JP5422934B2

JP5422934B2 - Palindromic structure detection system in genome sequence

Info

Publication number: JP5422934B2
Application number: JP2008188755A
Authority: JP
Inventors: 俊人塩谷
Original assignee: Toppan Inc
Current assignee: Toppan Inc
Priority date: 2007-09-21
Filing date: 2008-07-22
Publication date: 2014-02-19
Anticipated expiration: 2028-07-22
Also published as: JP2009093621A

Description

本発明は、ゲノム配列に含まれる回文構造を高速に検出するゲノム配列における回文構造検出システムに関する。 The present invention relates to a palindrome structure detection system in a genome sequence that detects a palindrome structure included in the genome sequence at high speed.

ＤＮＡ（Deoxyribonucleic acid）のゲノム配列データは、Ａ（アデニン）、Ｔ（チミン）、Ｇ（グアニン）、Ｃ（シトシン）の４種類の塩基の配列が直列に億単位で並ぶ膨大なデータ群である。
近年、ヒトの遺伝子やその他重要な生物の遺伝子におけるゲノム配列のデータが急速に解析されつつある。
ゲノム配列の公開データにおいては、上記Ａ，Ｔ，Ｃ，Ｇ４種類の塩基が延々と続くデータを羅列しているだけであり、配列のいずれの箇所が重要なデータを含んでいるかを知るどころか、何れの領域に遺伝子が存在するかが殆ど判明していないというのが実情である。 The genome sequence data of DNA (Deoxyribonucleic acid) is an enormous data group in which sequences of four types of bases A (adenine), T (thymine), G (guanine), and C (cytosine) are arranged in 100 million units in series. .
In recent years, genome sequence data on human genes and genes of other important organisms has been rapidly analyzed.
In the public data of the genome sequence, the above A, T, C, and G4 types of bases are merely enumerated, and rather than knowing which part of the sequence contains important data, The fact is that it is hardly known in which region the gene exists.

特に、１本鎖のＤＮＡ及びＲＮＡ（Ribonucleic acid）においては、ステムループ構造という回文（“たけやぶやけた”、“しんぶんし”等の前後から読んでも同一の読みとなる）構造（パリンドローム構造）をとり、リプレッサタンパクを結合させないことによってｍ（messenger）ＲＮＡの構造などを変化させている。
例えば、図２０に示す例において、図２０（ａ）の配列のｍＲＮＡ（トリプトファンの生成に対応する）は、リボソームのようなリプレッサタンパクが結合していない場合、図２０（ｂ）のようなステムループ構造を有する。
ただし、図２０（ａ）の配列のｍＲＮＡは、トリブトファンが多く供給されて高濃度に存在する場合、図２１（ａ）のステムループ構造を有し、一方、トリプトファンの需要が多く低濃度に存在する場合、図２１（ｂ）の構造を有する。 In particular, in single-stranded DNA and RNA (Ribonucleic acid), a stem loop structure pamphlet (the same reading is obtained even before and after “Takeya Bakeyake”, “Shinbushi”), etc. (palindromic) The structure of m (messenger) RNA is changed by not binding the repressor protein.
For example, in the example shown in FIG. 20, the mRNA of the sequence of FIG. 20 (a) (corresponding to the generation of tryptophan) is as shown in FIG. 20 (b) when a repressor protein such as ribosome is not bound. It has a stem loop structure.
However, the mRNA of the sequence of FIG. 20 (a) has the stem loop structure of FIG. 21 (a) when a large amount of tributophan is supplied and is present at a high concentration, while the demand for tryptophan is large and present at a low concentration. In this case, the structure shown in FIG.

上述したように、回文構造が異なる状態、すなわち特定の状態におけるステムループ構造は現状において各種見つかっている。
このステムループ構造となると、実際のゲノム配列が読めない状態となり、その部位の解析を行うことができない。
ｍＲＮＡやＤＮＡのゲノム配列内に潜んでいるステムループ構造が起こる可能性のある部位を探ることは、ゲノム配列に対応したリボソームにおけるアミノ酸の産生状況を探るなどの重要な意味を有している。
特に、ＰＣＲ（Polymerase Chain Reaction）によって生成されたｍＲＮＡ等は、ステムループ構造を構成しやすく、どのようなステムループ構造になっているかを知るためには、そのｍＲＮＡのステムループ構造が起こる可能性のある部位を予め知っておく必要がある。 As described above, various types of stem-loop structures in different palindrome structures, that is, specific states have been found at present.
If this stem-loop structure is used, the actual genome sequence cannot be read, and the site cannot be analyzed.
Searching for a site where a stem loop structure lurking in the genomic sequence of mRNA or DNA may occur has important implications such as searching for the status of amino acid production in the ribosome corresponding to the genomic sequence.
In particular, mRNA generated by PCR (Polymerase Chain Reaction) can easily form a stem loop structure, and in order to know what kind of stem loop structure it is possible that the stem loop structure of the mRNA may occur. It is necessary to know a certain part in advance.

このため、塩基Ａと塩基Ｔと、また塩基Ｃと塩基Ｇとが相補的な関係にあり塩基対を形成するため、ゲノム配列における塩基の並びを解析することにより、ゲノム配列のいずれの箇所が回文構造であり、ステムループ構造を起こす可能性がある部位であるか否かの検出が行われている（特許文献１）。
特開２００５−３１６９２４号公報 For this reason, since base A and base T and base C and base G are in a complementary relationship to form a base pair, by analyzing the base sequence in the genome sequence, any part of the genome sequence It is detected whether it is a palindromic structure and a part that may cause a stem loop structure (Patent Document 1).
JP 2005-316924 A

しかしながら、上述した従来例においては、コンピュータ処理にて行うアルゴリズムについて記載されているが、塩基Ａ，Ｔ，Ｇ，Ｃの配列の関係が示されているのみであり、直感的にいずれの箇所が回文構造であるかの判定が行えない欠点がある。
従来からもコンピュータ処理にて、いずれの箇所に回文構造が含まれているかを検出するため、塩基Ａ，Ｔ，Ｇ，Ｃ各々に対して、それぞれ１，２，３，４のような数字を付与して（２ビットにて示す場合、例えば、Ａ＝００，Ｔ＝０１，Ｇ＝１０，Ｃ＝１１のように付与して）、回文構造を検出するための演算を行っている。 However, in the above-described conventional example, an algorithm performed by computer processing is described, but only the sequence relationship of bases A, T, G, and C is shown. There is a drawback that it is not possible to determine whether it is a palindrome structure.
Conventionally, numbers such as 1, 2, 3, and 4 for bases A, T, G, and C, respectively, are detected by computer processing to detect where palindrome structures are included. (In the case of 2 bits, for example, A = 00, T = 01, G = 10, C = 11), and an operation for detecting the palindrome structure is performed. .

回文構造を検出する演算に際し、上述したように各塩基の定義を行うと、塩基Ａ，Ｔ，Ｇ，Ｃの羅列として出力される検出結果において、それぞれの相補的関係が視覚的に分かりにくく、塩基の羅列が何を意味しているのかが直感的に判断できない。
本発明は、このような事情に鑑みてなされたものであり、ゲノム配列における大量のデータの解析が簡単に行え、またゲノム配列における回文構造の解析結果が、視覚的にも重要な部位がすぐに判別可能な結果として得られるゲノム配列における回文構造検出装置を提供することを目的とする。 When the bases are defined as described above in the operation for detecting the palindrome structure, the complementary relations in the detection results output as a sequence of bases A, T, G, and C are difficult to visually understand. I cannot intuitively judge what the enumeration of bases means.
The present invention has been made in view of such circumstances, and it is possible to easily analyze a large amount of data in a genome sequence, and the analysis result of palindrome structure in the genome sequence has a visually important part. An object of the present invention is to provide a palindromic structure detection apparatus for genome sequences obtained as a result that can be discriminated immediately.

本発明のゲノム配列における回文構造検出システムは、ゲノム配列にて塩基対を形成する塩基各々に同一の数字をあてがい、それぞれに対して極性の異なる符号を付し、ゲノム配列における回文構造を検出する回文構造検出システムであり、入力されるゲノム配列における各塩基を、対応する符号の付された数字に置き換える塩基／数字変換部と、ゲノム配列において、解析範囲内の両端から中央方向に塩基を一つずつずらして、該解析範囲内の各両端からｎ個ずつの塩基を両端からの位置に対応して組み合わせて、予め設定されたｎ個の塩基対を生成する塩基対生成部と、該塩基対生成部から出力される塩基対それぞれの塩基に対応する数字を独立して加算するｎ個の第１の加算部と、該第１の加算部それぞれの出力に設けられ、加算結果の絶対値を算出する絶対値部と、前記解析範囲内の各加算結果の絶対値を加算する第２の加算部とを有する回文演算部とを有する。 The palindrome structure detection system in the genome sequence of the present invention assigns the same number to each base that forms a base pair in the genome sequence, and attaches a sign with a different polarity to each base, thereby identifying the palindrome structure in the genome sequence. This is a palindromic structure detection system that detects a base / number conversion unit that replaces each base in the input genome sequence with a number with a corresponding sign, and in the genome sequence from both ends in the analysis range toward the center. A base pair generation unit that shifts the bases one by one and combines n bases from each end in the analysis range corresponding to the positions from both ends to generate n base pairs set in advance; , N first adders for independently adding numbers corresponding to the bases of the base pairs output from the base pair generator, and outputs of the first adders, and the addition result Having an absolute value unit for calculating an absolute value, a palindromic calculation unit and a second adder for adding the absolute values of the addition result in the analysis range.

本発明のゲノム配列における回文構造検出システムは、前記第２の加算部が０を出力した場合、前記ｎ個の塩基対による回文構造があることを検出した検出信号であることを特徴とする。 The palindrome structure detection system in the genome sequence of the present invention is a detection signal that detects the presence of the palindrome structure by the n base pairs when the second adder outputs 0. To do.

本発明のゲノム配列における回文構造検出システムは、前記回文演算部が、異なる塩基数からなる解析範囲毎に複数設けられていることを特徴とする。 The palindrome structure detection system in the genome sequence of the present invention is characterized in that a plurality of palindromic operation units are provided for each analysis range having a different number of bases.

本発明のゲノム配列における回文構造検出システムは、前記ゲノム配列における前記解析範囲を１塩基ずつ順次シフトさせ、シフト毎に回文構造の検出処理を行うことを特徴とする。 The palindrome structure detection system in the genome sequence of the present invention is characterized in that the analysis range in the genome sequence is sequentially shifted one base at a time, and palindrome structure detection processing is performed at each shift.

本発明のゲノム配列における回文構造検出システムは、前記回文演算部が前記解析範囲内において両端からｎ個の塩基にて挟まれる非加算塩基（回文構造に含まれるが回文塩基を構成しない非回文塩基）数を偶数として回文構造を検出する、前記第１の加算部と第２の加算部とを有する偶数回文演算部と、前記解析範囲内において両端からｎ個の塩基にて挟まれる非加算塩基数を奇数として回文構造を検出する、前記第１の加算部と第２の加算部とを有する奇数回文演算部とを有していることを特徴とする。 In the palindrome structure detection system in the genome sequence of the present invention, the palindrome calculation unit includes non-addition bases sandwiched by n bases from both ends within the analysis range (contains palindrome bases included in the palindrome structure). A non-palindromic base) that detects the palindrome structure as an even number, the even palindromic operation unit having the first addition unit and the second addition unit, and n bases from both ends within the analysis range And an odd palindromic operation unit having a first addition unit and a second addition unit that detect a palindrome structure with an odd number of non-addition bases sandwiched between the first and second addition units.

本発明のゲノム配列における回文構造検出システムは、両端からｎ個の塩基にて挟まれる非加算塩基数を、塩基対単位にて増加させた前記解析範囲毎の回文演算部の検出結果を重ね合わせ、回文構造における対をなす塩基数を求めることを特徴とする。 In the palindrome structure detection system in the genome sequence of the present invention, the detection result of the palindrome calculation unit for each analysis range in which the number of non-addition bases sandwiched between n bases from both ends is increased in base pair units. It is characterized in that the number of bases forming a pair in superposition and palindrome structure is obtained.

本発明のゲノム配列における回文構造検出システムは、両端からｎ個の塩基にて挟まれる非加算塩基数を、塩基対単位にて増加させた前記解析範囲の回文演算部毎に、ゲノム配列を１塩基ずつシフトした検出結果を記憶する記憶部を有し、ゲノム配列にて、回文構造が検出され、かつ中央が同一の前記解析範囲を重ね合わせ、検出された回文構造として出力することを特徴とする。 The palindrome structure detection system in the genome sequence of the present invention provides a genome sequence for each palindrome operation unit in the analysis range in which the number of non-addition bases sandwiched between n bases from both ends is increased in units of base pairs. Has a storage unit for storing the detection results obtained by shifting the base by one base, the palindrome structure is detected in the genome sequence, and the analysis range having the same center is overlapped and output as the detected palindrome structure It is characterized by that.

本発明のゲノム配列における回文構造検出システムは、前記偶数回文演算部及び奇数回文演算部各々が２入力のｎ個の第１の加算部と、該加算部毎に設けられた絶対値部と、ｎ入力の第２の加算部から構成されていることを特徴とする。
本発明のゲノム配列における回文構造検出システムは、回文構造を構成する塩基配列に対応して制限酵素が記憶されているデータベースと、前記回文構造が検出された際、検出された前記回文構造の塩基配列に対応した前記制限酵素を検索する検索部とをさらに有することを特徴とする。 The palindrome structure detection system in the genome sequence according to the present invention includes the even palindromic operation unit and the odd palindromic operation unit each having two input n first addition units and an absolute value provided for each addition unit. And an n-input second adder.
The palindrome structure detection system in the genome sequence of the present invention includes a database storing restriction enzymes corresponding to the base sequences constituting the palindrome structure, and the palindrome structure detected when the palindrome structure is detected. And a search unit that searches for the restriction enzyme corresponding to the base sequence of the sentence structure.

以上説明したように、本発明によれば、ステムループ構造を形成する可能性のある回文構造の有無を検出する際、ステムループ構造を形成する際に、各塩基対を形成する相補的な関係にある塩基に対して同一の数字で異なる符号（＋／−）を付して回文構造の検出処理を行うため、塩基ゲノム配列における大量の塩基データの解析が簡単にかつ高速に行え、またゲノム配列における回文構造の解析結果が、視覚的にも重要な部位がすぐに判別可能な結果として得られる。 As described above, according to the present invention, when detecting the presence or absence of a palindrome structure that may form a stem-loop structure, when forming the stem-loop structure, complementary bases forming each base pair are formed. Since the palindromic structure is detected by attaching the same number and different sign (+/-) to the bases in relation, analysis of a large amount of base data in the base genome sequence can be performed easily and at high speed. Moreover, the analysis result of palindrome structure in the genome sequence is obtained as a result that a visually important part can be immediately discriminated.

本発明のゲノム配列における回文構造検出システムは、ＤＮＡやＲＮＡ等におけるゲノム配列にて塩基対（塩基Ｇ及び塩基Ｃ、または塩基Ａ及び塩基Ｔ（ＲＮＡにては塩基Ｕ））を形成する塩基各々に同一の数字をあてがい、それぞれに対して極性の異なる符号を付し、例えば、本実施形態において、塩基Ｇに「＋１」を塩基Ｃに「−１」を付し、また塩基Ａに「＋２」を、塩基Ｔに「−２」を付して、ゲノム配列における回文構造を検出する回文構造検出システムである。 The palindromic structure detection system in the genomic sequence of the present invention is a base that forms a base pair (base G and base C, or base A and base T (base U in RNA)) in the genomic sequence of DNA or RNA. The same number is assigned to each, and a symbol having a different polarity is assigned to each. For example, in this embodiment, “+1” is added to the base G, “−1” is added to the base C, and “ The palindrome structure detection system detects the palindrome structure in the genome sequence by attaching “−2” to the base T and “−2”.

すなわち、本発明においては、各塩基に符号付きの数字をあてがい、塩基対を形成する塩基間に相補的な特徴を持たせることにより、塩基Ａ、Ｔ（Ｕ）、Ｃ、Ｇの単なる文字の羅列を視覚的に理解しやすくしている。
上述した特徴として、塩基対としては塩基Ａに対しては必ず塩基Ｔ（Ｕ）が組み合わされ、塩基Ｇに対しては必ず塩基Ｃが組み合わされる相補の関係にある。
本願実施形態においては、上述したように、この塩基対（相補的塩基対）における相補間の関係を＋（プラス）と−（マイナス）との符号関係（極性の異なる符号関係）で表している。
また、プリン塩基である塩基Ａと塩基Ｇとにおいては、塩基Ｇの方の分子量が大きいため、塩基Ｇを「＋２」、塩基Ａを「＋１」と表現する。またピリミジン塩基は塩基Ｃを「−２」、塩基Ｔを「−１」と表現する。 That is, in the present invention, by assigning a number with a sign to each base and giving a complementary feature between the bases forming the base pair, the simple characters of the bases A, T (U), C, and G The list is easy to understand visually.
As a feature described above, the base pair is in a complementary relationship in which the base T is always combined with the base A and the base G is always combined with the base C.
In the present embodiment, as described above, the relationship between the complements in this base pair (complementary base pair) is represented by a sign relationship between + (plus) and-(minus) (a sign relationship with different polarity). .
In addition, in the base A and the base G that are purine bases, the molecular weight of the base G is larger, so the base G is expressed as “+2” and the base A is expressed as “+1”. The pyrimidine base represents the base C as “−2” and the base T as “−1”.

これにて相補の関係と、塩基対Ａ−Ｔ系、塩基対Ｃ−Ｇ系の分類、及び分子量の大きさをこれで表している。
塩基Ｇ分子量１５１ → ＋２２進では符号つき“０１０”
塩基Ａ分子量１３５ → ＋１２進では符号つき“００１”
塩基Ｔ分子量１２６ → −１２進では符号つき“１０１”
塩基Ｃ分子量１１１ → −２２進では符号つき“１１０”
上述したように、各塩を基符号ビットとＡ−Ｔ及びＧ−Ｃの分類ビットにより表している。
これを、後述する第１の加算部において、Ａ（＋１）＋Ｔ（−１）＝０、Ｇ（＋２）＋Ｃ（−２）＝０の関係を用いた計算をすることによってステムループの構造を抽出する。 This indicates the complementary relationship, the classification of the base pair AT system, the base pair CG system, and the size of the molecular weight.
Base G Molecular weight 151 → +2 "010" with a sign in binary
Base A Molecular weight 135 → +1 Signed “001” in binary
Base T Molecular weight 126 → −1 Signed “101” in binary
Base C Molecular weight 111 → -2 Signed "110" in binary
As described above, each salt is represented by a base code bit and AT and GC classification bits.
In the first adder described later, this is calculated using the relationship of A (+1) + T (−1) = 0, G (+2) + C (−2) = 0, so that the structure of the stem loop is obtained. Extract.

そして、本発明のゲノム配列における回文構造検出システムは、入力されるゲノム配列における順次各塩基を、上述した対応する数字及び符号の付された数字に置き換える塩基／数字変換部と、この置き換えられた符号付きの数字から回文構造を検出する回文演算部が、上述したゲノム配列において、予め設定された塩基数からなる解析範囲内の両端から、この解析範囲内の中央方向に塩基を一つずつずらして、解析範囲内の各両端からｎ個ずつの塩基を両端からの位置に対応して組み合わせて、予め設定されたｎ個の塩基対を生成する塩基対生成部とを有し、塩基対生成部から出力される塩基対それぞれの塩基に対応する数字をそれぞれ独立して加算するｎ個の第１の加算部と、該第１の加算部それぞれの出力に設けられ、加算結果の絶対値を算出する絶対値部と、解析範囲内の各加算結果の絶対値を加算する第２の加算部とを有している。 Then, the palindromic structure detection system in the genome sequence of the present invention is replaced with the base / number conversion unit that sequentially replaces each base in the input genome sequence with the corresponding number and the number with the sign. The palindromic operation unit that detects the palindrome structure from the signed numbers adds a base from the both ends in the analysis range consisting of a preset number of bases in the above-described genome sequence in the center direction in the analysis range. A base pair generator that generates n base pairs that are set in advance by combining n bases from each end within the analysis range corresponding to the positions from both ends, N number of first addition units for independently adding numbers corresponding to the bases of the base pairs output from the base pair generation unit, and outputs of the first addition units, Absolute An absolute value unit for calculating a, and a second adder for adding the absolute values of the addition result in the analysis range.

ｔ−ＲＮＡやｍ−ＲＮＡの末端には、一般的に、図１に示すように、ステムループ構造が顕著に表れる。
図１（ａ）の塩基を示すＡ，Ｔ，Ｃ，Ｇの文字のランダムな羅列にては、ステムループ構造を形成する可能性のある回文構造を検出することが困難である。
しかしながら、
”Ａ，Ｃ，Ｇ，Ｃ，Ｇ，Ｔ”
となっている回文構造を、上述した符号付きの数字に変換することにより、
”＋１（１番目），−２，＋２，−２，＋２，−１（６番目）”
と符号付きの数字の配列として記述することができる。 In general, a stem-loop structure appears remarkably at the end of t-RNA or m-RNA as shown in FIG.
In the random enumeration of characters A, T, C, and G indicating the base in FIG. 1A, it is difficult to detect a palindrome structure that may form a stem-loop structure.
However,
"A, C, G, C, G, T"
By converting the palindrome structure to the above-mentioned signed number,
"+1 (first), -2, +2, -2, +2, -1 (sixth)"
And an array of signed numbers.

この配列において、
１番目の「＋１」と６番目の「−１」
２番目の「−２」と５番目の「＋２」
３番目の「＋２」と４番目の「−２」
とを塩基対として、それぞれの塩基対ごとの塩基に対応する符号付き数字を足した場合、いずれの塩基対の塩基に対応した符号付き数字の加算結果が「０」となることがわかる。
つまりＤＮＡ配列を順々に読んでいき、現在の位置Ａｎとその手前のＡｎ−ｍをターゲットとしたときに、
Ａ_ｎ＋Ａ_ｎ−ｍ＝０
Ａ_ｎ−１＋Ａ_{ｎ−ｍ＋１}＝０
Ａ_ｎ−２＋Ａ_{ｎ−ｍ＋２}＝０
が多く成り立つところに回文構造があることがわかる。ここで、ｍ（整数）は中央部に非回文塩基を含んだ解析範囲内の塩基の数であり、非回文塩基を挟んで対向する位置の塩基を塩基対として、この塩基に対応する符号付き数字を塩基対毎に加算していく。 In this array,
1st "+1" and 6th "-1"
2nd "-2" and 5th "+2"
3rd "+2" and 4th "-2"
As a base pair, it is understood that the addition result of the signed number corresponding to the base of any base pair becomes “0” when adding the signed number corresponding to the base of each base pair.
In other words, when the DNA sequence is read in sequence and the current position An and the previous An-m are targeted,
A _n + A _n−m = 0
A _n−1 + A _{n−m + 1} = 0
A _n−2 + A _{n−m + 2} = 0
It can be seen that there are palindrome structures where Here, m (integer) is the number of bases in the analysis range including a non-palindromic base in the center, and corresponds to this base with the base at the opposite position across the non-pain base as a base pair. Add signed numbers for each base pair.

例えば、
ｍ＝５のとき
Ａ_ｎ＋Ａ_ｎ−５＝０Ａ_ｎ−１＋Ａ_ｎ−４＝０Ａ_ｎ−２＋Ａ_ｎ−３＝０
ｍ＝６のとき
Ａ_ｎ＋Ａ_ｎ−６＝０Ａ_ｎ−１＋Ａ_ｎ−５＝０Ａ_ｎ−２＋Ａ_ｎ−４＝０
ｍ＝７のとき
Ａ_ｎ＋Ａ_ｎ−７＝０Ａ_ｎ−１＋Ａ_ｎ−６＝０Ａ_ｎ−２＋Ａ_ｎ−５＝０
Ａ_ｎ−３＋Ａ_ｎ−４＝０
ｍ＝８のとき
Ａ_ｎ＋Ａ_ｎ−８＝０Ａ_ｎ−１＋Ａ_ｎ−７＝０Ａ_ｎ−２＋Ａ_ｎ−６＝０
Ａ_ｎ−３＋Ａ_ｎ−５＝０となる。 For example,
When m = 5
A _n + A _n−5 = 0 A _n−1 + A _n−4 = 0 A _n−2 + A _n−3 = 0
When m = 6
A _n + A _n-6 = 0 A _n-1 + A _n-5 = 0 A _n-2 + A _n-4 = 0
When m = 7
A _n + A _n-7 = 0 A _n-1 + A _n-6 = 0 A _n-2 + A _n-5 = 0
A _n-3 + A _n-4 = 0
When m = 8
A _n + A _n-8 = 0 A _n-1 + A _n-7 = 0 A _n-2 + A _n-6 = 0
A _n−3 + A _n−5 = 0.

＜第１の実施形態＞
以下に、図２を参照して本第１の実施形態によるゲノム配列における回文構造検出システムの説明を行う。図２は、本第１の実施形態によるゲノム配列における回文構造検出システムの構成例を示すブロック図である。
本実施形態については、回文構造の解析範囲を、回文３（＝ｒ）塩基、非回文ｑ塩基、回文３（＝ｒ）塩基からなる図２（ａ）の構造として、塩基配列に対して解析範囲を順次１塩基ずつシフトさせつつ回文構造の検出を行う。
塩基／数字変換部１は、時系列に１塩基ずつシフトして入力されるゲノム配列における各塩基を、対応する符号の付された数字に置き換え、すなわち塩基Ｇを「＋２」、塩基Ａを「＋１」、塩基Ｃを「−２」、塩基Ｔを「−１」へ変換して出力する。
例えば、シフトレジスタ１５０は、上記塩基／数字変換部１から時系列に入力される符号の付された数字を、記塩基／数字変換部１が数字を出力するタイミングに応じて、１つずつ順次シフトさせ、シリアルに入力された数字をパラレルに出力する。このシフトレジスタ１５０は、後述する塩基対生成部２における解析範囲内の塩基配列数に対応する数字分だけシフトできる構成とする。後述するＡ_ｎ…Ａ_ｎ−ｊ…は、シフトレジスタ１５０のパラレル出力（時系列にシフトされる塩基配列）、すなわちシフトレジスタ１５０を構成する各レジスタが出力する塩基の核酸分類数（符号付き数字）を示している。 <First Embodiment>
The palindrome structure detection system in the genome sequence according to the first embodiment will be described below with reference to FIG. FIG. 2 is a block diagram showing a configuration example of the palindrome structure detection system in the genome sequence according to the first embodiment.
In the present embodiment, the analysis range of the palindrome structure is set as the structure of FIG. 2A composed of palindrome 3 (= r) base, non palindrome q base, palindrome 3 (= r) base, and the base sequence The palindrome structure is detected while sequentially shifting the analysis range by one base at a time.
The base / number conversion unit 1 replaces each base in the genome sequence inputted by shifting one base at a time in time series with a number with a corresponding code, that is, base G is “+2” and base A is “ +1 ", base C is converted to" -2 ", base T is converted to" -1 "and output.
For example, the shift register 150 sequentially adds the numbers with the signs inputted in time series from the base / number conversion unit 1 one by one in accordance with the timing at which the base / number conversion unit 1 outputs the numbers. Shift and output serially input numbers in parallel. The shift register 150 is configured to be able to shift by a number corresponding to the number of base sequences within the analysis range in the base pair generation unit 2 described later. A _n ... A _n−j to be described later is a parallel output of the shift register 150 (base sequence shifted in time series), that is, the number of base nucleic acid classifications (signed numbers) output from each register constituting the shift register 150. ).

塩基対生成部２は、シフトレジスタ１５０からパラレルに入力されるゲノム配列において、解析範囲内の塩基配列の両端から中央方向に塩基を一つずつずらして、該解析範囲内の各両端からｒ個ずつの塩基を両端からの位置に対応して（非回文塩基を挟んで対称位置にある塩基を）組み合わせて、予め設定された塩基対を生成して出力する。
すなわち、塩基対生成部２は、図２（ａ）における非回文塩基（非加算塩基）を挟んで、それぞれ対向する位置（非回文塩基を挟んで対称位置）にある塩基の組み合わせを塩基対として、対応する後述の回文演算部へ出力する。
以下に、図２（ｂ）のブロック図を用いて、本願実施形態におけるゲノム配列における回文構造検出システムの構成例を説明する。 The base pair generation unit 2 shifts one base from the both ends of the base sequence in the analysis range one by one in the center direction in the genome sequence input in parallel from the shift register 150, and r r from each end in the analysis range. Each base is combined corresponding to the positions from both ends (bases at symmetrical positions with a non-palinity base in between) to generate and output a preset base pair.
That is, the base pair generation unit 2 bases a combination of bases at positions opposite to each other (a symmetric position with a non-pain base) across the non-pain base (non-addition base) in FIG. As a pair, the data is output to a corresponding palindromic operation unit described later.
Hereinafter, a configuration example of the palindrome structure detection system in the genome sequence according to the present embodiment will be described with reference to the block diagram of FIG.

回文演算部６１は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，０，３｝に対応する構成であり、３つの第１の加算部３_６１と、３つの絶対値部４_６１と１つの第２の加算部５_６１から構成されている。
第１の加算部３_６１各々は、Ａ_ｎ−５＋Ａ_ｎと、Ａ_ｎ−４＋Ａ_ｎ−１と、Ａ_ｎ−３＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_６１へ各々出力する。
絶対値部４_６１各々は、対応する第１の加算部３_６１から出力される演算結果の絶対値Ｓ１_ｎ，Ｓ２_ｎ，Ｓ３_ｎを出力する。
第２の加算部５_６１は、絶対値部４_６１各々からの絶対値Ｓ１_ｎ，Ｓ２_ｎ，Ｓ３_ｎを加算し、加算結果Ｐ１_ｎを出力する。この加算結果Ｐ１_ｎが「０」となれば、解析範囲における回文塩基同士が回文構造をしていることが検出される。 The palindromic operation unit 61 is an even palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 0, 3} as an analysis range. The first adding unit 3 ₆₁ includes three absolute value units 4 ₆₁ and one second adding unit 5 ₆₁ .
Each of the first addition units 3 ₆₁ performs addition of A _n−5 + A _n , A _n−4 + A _n−1 , and A _n−3 + A _n−2, and the corresponding absolute value unit 4 ₆₁ to each output.
Each of the absolute value units 4 ₆₁ outputs the absolute values S1 _n , S2 _n , S3 _n of the calculation results output from the corresponding first adder 3 ₆₁ .
The second adder 5 ₆₁ adds the absolute values S1 _n , S2 _n , S3 _n from each of the absolute value units 4 ₆₁ and outputs the addition result P1 _n . If the addition result P1 _n is “0”, it is detected that palindromic bases in the analysis range have palindromic structures.

回文演算部６２は、奇数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，１，３｝に対応する構成であり、３つの第１の加算部３_６２と、３つの絶対値部４_６２と１つの第２の加算部５_６２から構成されている。以下この非回文塩基の数が奇数の回文演算部は奇数回文演算部とし、非回文塩基の数が偶数の回文演算部は偶数回文演算部とする。
第１の加算部３_６２各々は、Ａ_ｎ−６＋Ａ_ｎと、Ａ_ｎ−５＋Ａ_ｎ−１と、Ａ_ｎ−４＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_６２へ各々出力する。
絶対値部４_６２各々は、対応する第１の加算部３_６２から出力される演算結果の絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１を出力する。
第２の加算部５_６２は、絶対値部４_６２各々からの絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１を加算し、加算結果Ｐ１_ｎ−１を出力する。この加算結果Ｐ１_ｎ−１が「０」となれば、解析範囲における回文塩基同士が回文構造をしていることが検出される。 The palindromic operation unit 62 is an odd palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 1, 3} as an analysis range. The first addition unit 3 ₆₂ includes three absolute value units 4 ₆₂ and one second addition unit 5 ₆₂ . Hereinafter, a palindromic operation unit with an odd number of non-palinity bases is an odd palindrome operation unit, and a palindrome operation unit with an even number of non-palinity bases is an even palindrome operation unit.
Each of the first addition units 3 ₆₂ performs addition of A _n−6 + A _n , A _n−5 + A _n−1 , and A _n−4 + A _n−2, and the corresponding absolute value unit 4 Each output to ₆₂ .
Each of the absolute value units 4 ₆₂ outputs the absolute values S1 _n−1 , S2 _n−1 , and S3 _n−1 of the operation results output from the corresponding first adder 3 ₆₂ .
The second addition unit _{5 62} adds the absolute value _{_{S1 n-1, S2 n-}} 1, S3 n-1 from the absolute value unit _{4 62} respectively, and outputs the addition result _{P1 n-1.} If the addition result P1 _n-1 is “0”, it is detected that palindromic bases in the analysis range have a palindromic structure.

回文演算部６３は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，２，３｝に対応する構成であり、４つの第１の加算部３_６３と、４つの絶対値部４_６３と、２つの第２の加算部５_６３から構成されている。
第１の加算部３_６３各々は、Ａ_ｎ−７＋Ａ_ｎと、Ａ_ｎ−６＋Ａ_ｎ−１と、Ａ_ｎ−５＋Ａ_ｎ−２と、Ａ_ｎ−４＋Ａ_ｎ−３とのそれぞれの加算を行い、対応する絶対値部４_６３へ出力する。
絶対値部４_６３各々は、対応する第１の加算部３_６３から出力される演算結果の絶対値Ｓ１_ｎ−２，Ｓ２_ｎ−２，Ｓ３_ｎ−２，Ｓ４_ｎ−２を出力する。
第２の加算部５_６３各々は、絶対値部４_６３各々からの絶対値Ｓ１_ｎ−２，Ｓ２_ｎ−２，Ｓ３_ｎ−２を、絶対値Ｓ２_ｎ−２，Ｓ３_ｎ−２，Ｓ４_ｎ−２をそれぞれ加算し、それぞれ加算結果Ｐ１_ｎ−２、加算結果Ｐ２_ｎ−２を出力する。 The palindromic operation unit 63 is an even palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 2, 3} as an analysis range. The first adding unit 3 ₆₃ includes four absolute value units 4 ₆₃ and two second adding units 5 ₆₃ .
Each of the first adding units 3 ₆₃ includes A _n−7 + A _n , A _n−6 + A _n−1 , A _n−5 + A _n−2 , and A _n−4 + A _n−3 . Addition is performed and output to the corresponding absolute value portion ₄₆₃ .
Each of the absolute value units 4 ₆₃ outputs the absolute values S1 _n−2 , S2 _n−2 , S3 _n−2 , and S4 _n−2 of the calculation result output from the corresponding first adder 3 ₆₃ .
Each of the second addition units 5 ₆₃ converts the absolute values S1 _n−2 , S2 _n−2 , and S3 _n−2 from the absolute value units 4 ₆₃ into absolute values S2 _n−2 , S3 _n−2 , and S4 _{n. -2} are added, and the addition result P1 _n-2 and the addition result P2 _n-2 are output.

この加算結果Ｐ１_ｎ−２が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，２，３｝における回文塩基同士が回文構造をしていることが検出される。また、加算結果Ｐ２_ｎ−２が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，０，３｝における回文塩基同士が回文構造をしていることが検出される。
したがって、加算結果Ｐ１_ｎ−２及びＰ２_ｎ−２双方が「０」である場合、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，２，３｝における解析を行わなかった非回文塩基もＡ_ｎ−４＋Ａ_ｎ−３の加算結果として「０」となることが検出されるため、最終的に解析範囲を８塩基として、解析範囲内全ての塩基が回文塩基構造であることが検出される。
上述したように、回文演算部６３は、偶数にて最小の非回文塩基数０に対応する第２の加算部と、偶数にて最小の非回文塩基数０に対して、塩基対単位、すなわち塩基数にて増加させた非回文塩基数２の解析範囲とに対応する第２の加算部とを有している。 If this addition result P1 _n−2 is “0”, the palindromic bases in the analysis range {palindromic base, non-palmographic base, palindromic base} = {3, 2, 3} have a palindromic structure. Is detected. If the addition result P2 _n−2 is “0”, the palindromic bases in the analysis range {pastogram base, non-pascal base, palindrome base} = {3, 0, 3} have the palindrome structure. Is detected.
Therefore, when both of the addition results P1 _n-2 and P2 _n-2 are “0”, the analysis is performed in the analysis range {palindromic base, non-palindromic base, palindromic base} = {3, 2, 3}. Since it was detected that the non-palindromic base that did not exist also becomes “0” as the addition result of A _n−4 + A _n−3 , the analysis range is finally set to 8 bases, and all the bases within the analysis range are palindrome. A base structure is detected.
As described above, the palindromic operation unit 63 performs base pairing with respect to the second addition unit corresponding to the minimum non-palindromic base number 0 even in the even number and the minimum non-palindromic base number 0 in the even number. And a second addition unit corresponding to the analysis range of the non-pain base number 2 increased by the unit, that is, the number of bases.

回文演算部６４は、奇数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，３，３｝に対応する構成であり、４つの第１の加算部３_６４と、４つの絶対値部４_６４と、２つの第２の加算部５_６４から構成されている。
第１の加算部３_６４各々は、Ａ_ｎ−８＋Ａ_ｎと、Ａ_ｎ−７＋Ａ_ｎ−１と、Ａ_ｎ−６＋Ａ_ｎ−２と、Ａ_ｎ−５＋Ａ_ｎ−３とのそれぞれの加算を行い、対応する絶対値部４_６４へ出力する。
絶対値部４_６４各々は、対応する第１の加算部３_６４から出力される演算結果の絶対値Ｓ１_ｎ−３，Ｓ２_ｎ−３，Ｓ３_ｎ−３，Ｓ４_ｎ−３を出力する。
第２の加算部５_６４各々は、絶対値部４_６４各々からの絶対値Ｓ１_ｎ−３，Ｓ２_ｎ−３，Ｓ３_ｎ−３を、絶対値Ｓ２_ｎ−３，Ｓ３_ｎ−３，Ｓ４_ｎ−３をそれぞれ加算し、それぞれ加算結果Ｐ１_ｎ−３、加算結果Ｐ２_ｎ−３を出力する。 The palindromic operation unit 64 is an odd palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 3, 3} as an analysis range. The first adder unit 3 ₆₄ includes four absolute value units 4 ₆₄ and two second adder units 5 ₆₄ .
Each of the first addition units 3 ₆₄ includes A _n−8 + A _n , A _n−7 + A _n−1 , A _n−6 + A _n−2 , and A _n−5 + A _n−3 . Addition is performed and output to the corresponding absolute value portion ₄₆₄ .
Each of the absolute value units 4 ₆₄ outputs the absolute values S1 _n−3 , S2 _n−3 , S3 _n−3 , and S4 _n−3 of the operation result output from the corresponding first adder 3 ₆₄ .
Each of the second addition units 5 ₆₄ converts the absolute values S1 _n−3 , S2 _n−3 , and S3 _n−3 from the absolute value units 4 ₆₄ into the absolute values S2 _n−3 , S3 _n−3 , and S4 _{n. -3} is added, and the addition result P1 _n-3 and the addition result P2 _n-3 are output.

この加算結果Ｐ１_ｎ−３が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，３，３｝における回文塩基同士が回文構造をしていることが検出される。また、加算結果Ｐ２_ｎ−３が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，１，３｝における回文塩基同士が回文構造をしていることが検出される。
したがって、加算結果Ｐ１_ｎ−３及びＰ２_ｎ−３双方が「０」である場合、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，３，３｝における解析を行わなかった非回文塩基もＡ_ｎ−５＋Ａ_ｎ−３の加算結果として「０」となることが検出されるため、最終的に解析範囲を９塩基として、中央の塩基Ａ_ｎ−４を除く８つの塩基が回文塩基構造であることが検出される。
上述したように、回文演算部６４は、奇数にて最小の非回文塩基数１に対応する第２の加算部と、奇数にて最小の非回文塩基数１に対して、塩基対単位、すなわち塩基数にて増加させた非回文塩基数３の解析範囲とに対応する第２の加算部とを有している。
また、上述した奇数回文演算部と偶数回文演算部との結果を組み合わせることにより、ゲノム配列における回文構造を解析することが可能となる。 If this addition result P1 _n-3 is “0”, the palindromic bases in the analysis range {palindromic base, non-pastronomic base, palindromic base} = {3, 3, 3} have a palindromic structure. Is detected. Also, if the addition result P2 _n-3 is “0”, the palindromic bases in the analysis range {palindromic base, non-palatomic base, palindromic base} = {3, 1, 3} have a palindromic structure. Is detected.
Therefore, when both of the addition results P1 _n-3 and P2 _n-3 are “0”, the analysis is performed in the analysis range {palindromic base, non-palindromic base, palindromic base} = {3, 3, 3}. Since it was detected that the non-palindromic base that did not exist becomes “0” as the addition result of A _n−5 + A _n−3 , the analysis range is finally set to 9 bases and the central base _An−4 is excluded. Eight bases are detected to have a palindromic base structure.
As described above, the palindromic operation unit 64 performs base pairing with respect to the second addition unit corresponding to the smallest non-palindromic base number 1 at the odd number and the smallest non-palindromic base number 1 at the odd number. A second adding unit corresponding to the analysis range of the non-pain base number 3 increased in units, that is, the number of bases.
Moreover, it becomes possible to analyze the palindrome structure in the genome sequence by combining the results of the odd palindromic operation unit and the even palindromic operation unit described above.

また、図３には、図２（ｂ）における構成にて、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，４，３｝、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，５，３｝、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，６，３｝、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，７，３｝を解析するものであり、実質的に解析範囲として塩基数１０、１１，１２，１３の回文構造を解析する。 Also, in FIG. 3, in the configuration in FIG. 2 (b), the analysis range {palindromic base, non-palumal base, palindromic base} = {3, 4, 3}, analysis range {palindromic base, non Palindromic base, palindromic base} = {3,5,3}, analysis range {palindromic base, non-pastronomic base, palindromic base} = {3, 6, 3}, analysis range {palindromic base, non The palindromic base, palindromic base} = {3, 7, 3} is analyzed, and the palindrome structure having the base numbers of 10, 11, 12, 13 is substantially analyzed as the analysis range.

すなわち、回文演算部６５は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，４，３｝に対応する構成であり、５つの第１の加算部３_６５と、５つの絶対値部４_６５と、３つの第２の加算部５_６５から構成されている。
第１の加算部３_６５各々は、Ａ_ｎ−９＋Ａ_ｎと、Ａ_ｎ−８＋Ａ_ｎ−１と、Ａ_ｎ−７＋Ａ_ｎ−２と、Ａ_ｎ−６＋Ａ_ｎ−３、Ａ_ｎ−５＋Ａ_ｎ−４とのそれぞれの加算を行い、対応する絶対値部４_６５へ出力する。
絶対値部４_６５各々は、対応する第１の加算部３_６５から出力される演算結果の絶対値Ｓ１_ｎ−４，Ｓ２_ｎ−４，Ｓ３_ｎ−４，Ｓ４_ｎ−４，Ｓ５_ｎ−４を出力する。
第２の加算部５_６５各々は、絶対値４_６５各々からの絶対値Ｓ１_ｎ−４，Ｓ２_ｎ−４，Ｓ３_ｎ−４を、絶対値Ｓ２_ｎ−４，Ｓ３_ｎ−４，Ｓ４_ｎ−４を、絶対値Ｓ３_ｎ−４，Ｓ４_ｎ−４，Ｓ５_ｎ−４をそれぞれ加算し、それぞれ加算結果Ｐ１_ｎ−４、加算結果Ｐ２_ｎ−４、加算結果Ｐ３_ｎ−４を出力する。 That is, the palindromic operation unit 65 is an even palindromic operation unit, and has a configuration corresponding to {palindromic base, non-palatomic base, palindromic base} = {3,4,3} as an analysis range, The first adder unit ₃₆₅ includes five first adder units ₃₆₅ , five absolute value units 4 ₆₅ , and three second adder units 5 ₆₅ .
Each of the first addition units 3 ₆₅ includes A _n−9 + A _n , A _n−8 + A _n−1 , A _n−7 + A _n−2 , A _n−6 + A _n−3 , A _{n−. 5} + A _n−4 is added and output to the corresponding absolute value portion ₄₆₅ .
The absolute value unit _{4 65} each, corresponding first adding unit ₃ is the operation result output from the ₆₅ absolute value _{_{_{S1 n-4, S2 n-}}} 4, S3 n-4, S4 n-4, S5 n-4 Is output.
Each of the second addition units 5 ₆₅ converts the absolute values S1 _n−4 , S2 _n−4 and S3 _n−4 from the absolute values 4 ₆₅ into absolute values S2 _n−4 , S3 _n−4 and S4 _{n−. 4} are added to absolute values S3 _n-4 , S4 _n-4 and S5 _n-4 , respectively, and an addition result P1 _n-4 , an addition result P2 _n-4 and an addition result P3 _n-4 are output.

この加算結果Ｐ１_ｎ−４が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，４，３｝における回文塩基同士が回文構造をしていることが検出される。また、加算結果Ｐ２_ｎ−４が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，２，３｝における回文塩基同士が回文構造をしていることが検出される。また、加算結果Ｐ３_ｎ−４が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，０，３｝における回文塩基同士が回文構造をしていることが検出される。
したがって、加算結果Ｐ１_ｎ−４、Ｐ２_ｎ−４及びＰ３_ｎ−４の全てが「０」である場合、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，４，３｝における解析を行わなかった非回文塩基も、Ａ_ｎ−５＋Ａ_ｎ−４、Ａ_ｎ−６＋Ａ_ｎ−３各々の加算結果が「０」となることが検出されるため、最終的に解析範囲を１０塩基として、解析範囲内全ての塩基が回文塩基構造であることが検出される。
上述したように、回文演算部６５は、偶数にて最小の非回文塩基数０に対応する第２の加算部と、偶数にて最小の非回文塩基数０に対して、塩基対単位、すなわち塩基数にて増加させた非回文塩基数２の解析範囲と、さらに塩基数にて増加させた非回文塩基数４の解析範囲とに対応する第２の加算部とを有している。 If this addition result P1 _n-4 is “0”, the palindromic bases in the analysis range {pastigial base, non-pastronomic base, palindromic base} = {3,4,3} have a palindromic structure. Is detected. Also, if the addition result P2 _n-4 is “0”, the palindromic bases in the analysis range {pastedial base, non-palatomic base, palindromic base} = {3, 2, 3} have a palindromic structure. Is detected. Further, if the addition result P3 _n-4 is “0”, the palindromic bases in the analysis range {pastedial base, non-palatomic base, palindromic base} = {3, 0, 3} have a palindromic structure. Is detected.
Therefore, when all of the addition results P1 _n-4 , P2 _n-4, and P3 _n-4 are “0”, the analysis range {palindromic base, non-palindromic base, palindromic base} = {3,4, 3}, it is detected that the addition result of each of A _n−5 + A _n−4 and A _n−6 + A _n−3 is “0”. When the analysis range is 10 bases, it is detected that all bases within the analysis range have a palindromic base structure.
As described above, the palindromic operation unit 65 performs base pairing on the second addition unit corresponding to the smallest non-palindromic base number 0 even in the even number and the non-palindromic base number 0 smallest in the even number. There is a second addition unit corresponding to the analysis range of the non-pain base number 2 increased in units, that is, the number of bases, and the analysis range of the non-pain base number 4 increased in number of bases. doing.

回文演算部６６は、奇数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，５，３｝に対応する構成であり、５つの第１の加算部３_６６と、５つの絶対値部４_６６と、３つの第２の加算部５_６６から構成されている。
第１の加算部３_６６各々は、Ａ_ｎ−１０＋Ａ_ｎと、Ａ_ｎ−９＋Ａ_ｎ−１と、Ａ_ｎ−８＋Ａ_ｎ−２と、Ａ_ｎ−７＋Ａ_ｎ−３と、Ａ_ｎ−６＋Ａ_ｎ−４とのそれぞれの加算を行い、対応する絶対値部４_６６へ出力する。
絶対値部４_６６各々は、対応する第１の加算部３_６６から出力される演算結果の絶対値Ｓ１_ｎ−５，Ｓ２_ｎ−５，Ｓ３_ｎ−５，Ｓ４_ｎ−５，Ｓ５_ｎ−５を出力する。
第２の加算部５_６６各々は、絶対値４_６６各々からの絶対値Ｓ１_ｎ−５，Ｓ２_ｎ−５，Ｓ３_ｎ−５を、絶対値Ｓ２_ｎ−５，Ｓ３_ｎ−５，Ｓ４_ｎ−５を、絶対値Ｓ３_ｎ−５，Ｓ４_ｎ−５，Ｓ５_ｎ−５をそれぞれ加算し、それぞれ加算結果Ｐ１_ｎ−５、加算結果Ｐ２_ｎ−５、加算結果Ｐ３_ｎ−５を出力する。 The palindromic operation unit 66 is an odd palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 5, 3} as an analysis range. The first adding unit 3 ₆₆ includes five absolute value units 4 ₆₆ and three second adding units 5 ₆₆ .
Each of the first addition units 3 ₆₆ includes A _n−10 + A _n , A _n−9 + A _n−1 , A _n−8 + A _n−2 , A _n−7 + A _n−3 , _An Each of _-6 + A _n-4 is added and output to the corresponding absolute value portion ₄₆₆ .
Each of the absolute value units 4 ₆₆ is the absolute value S1 _n-5 , S2 _n-5 , S3 _n-5 , S4 _n-5 , S5 _{n-5 of the} operation result output from the corresponding first adder 3 _66. Is output.
Each of the second adders 5 ₆₆ converts the absolute values S1 _n-5 , S2 _n-5 , S3 _n-5 from the absolute values 4 ₆₆ into the absolute values S2 _n-5 , S3 _n-5 , S4 _{n- 5} , absolute values S3 _n-5 , S4 _n-5 , S5 _n-5 are added, respectively, and an addition result P1 _n-5 , an addition result P2 _n-5 , and an addition result P3 _n-5 are output.

この加算結果Ｐ１_ｎ−５が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，５，３｝における回文塩基同士が回文構造をしていることが検出される。また、加算結果Ｐ２_ｎ−５が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，３，３｝における回文塩基同士が回文構造をしていることが検出される。また、加算結果Ｐ３_ｎ−５が「０」となれば、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，１，３｝における回文塩基同士が回文構造をしていることが検出される。
したがって、加算結果Ｐ１_ｎ−５、Ｐ２_ｎ−５及びＰ３_ｎ−５の全てが「０」である場合、解析範囲｛回文塩基、非回文塩基、回文塩基｝＝｛３，５，３｝における解析を行わなかった非回文塩基もＡ_ｎ−６＋Ａ_ｎ−４及びＡ_ｎ−７＋Ａ_ｎ−３の双方の加算結果が「０」となることが検出されるため、最終的に解析範囲を１１塩基として、中央の塩基Ａ_ｎ−５を除く１０個の塩基が回文塩基構造であることが検出される。
さらに、偶数回文演算部である回文演算部６７及び、奇数回分演算部である回数演算部６８等を増加させてゆくことにより、どのような塩基数を有する長さの回文構造でも検出を行うことができる。 If this addition result P1 _n-5 is “0”, the palindromic bases in the analysis range {pastigial base, non-palatomic base, palindromic base} = {3, 5, 3} have a palindromic structure. Is detected. Further, if the addition result P2 _n-5 is “0”, the palindromic bases in the analysis range {palindromic base, non-pastronomic base, palindromic base} = {3, 3, 3} have the palindrome structure. Is detected. Also, if the addition result P3 _n-5 is “0”, the palindromic bases in the analysis range {palindromic base, non palindromic base, palindromic base} = {3, 1, 3} have a palindromic structure. Is detected.
Therefore, when all of the addition results P1 _n-5 , P2 _n-5 and P3 _n-5 are “0”, the analysis range {palindromic base, non-palindromic base, palindromic base} = {3, 5, 3}, it is detected that the addition result of both A _n−6 + A _n−4 and A _n−7 + A _n−3 is “0”. The analysis range is 11 bases, and it is detected that 10 bases except the central base _An-5 have a palindromic base structure.
Furthermore, by increasing the palindromic operation unit 67 that is an even palindromic operation unit and the number operation unit 68 that is an odd-numbered operation unit, a palindrome structure having any number of bases can be detected. It can be performed.

＜第２の実施形態＞
次に、図４を用いて本発明の第２の実施形態によるゲノム配列における回文構造検出システムの説明を行う。図４は、本第２の実施形態によるゲノム配列における塩基数が偶数の解析範囲における回文構造を検出する動作を示すブロック図である。図４の第２の実施形態において、図２の第１の実施形態と同様の構成については同一の符号を付してその説明を省略する。
第１の実施形態においては、解析範囲が小さい場合は高速に回文構造の解析が行えるが、解析範囲が広くなるほど、図３に示すように加算器の数が大きくなり計算量が増加していくことが予想される。
しかしながら、図５に示すように、ゲノム配列において塩基Ａｎまで読み終わった時点にて、ｔ＝ｎとして図２における結果を見たとき、ｔ＝ｎ−１においても同様の解析範囲（同一の塩基数の解析範囲）に対する検出処理を行っていることが判る。 <Second Embodiment>
Next, the palindrome structure detection system in the genome sequence according to the second embodiment of the present invention will be described with reference to FIG. FIG. 4 is a block diagram showing an operation of detecting a palindrome structure in an analysis range with an even number of bases in the genome sequence according to the second embodiment. In the second embodiment of FIG. 4, the same components as those of the first embodiment of FIG.
In the first embodiment, when the analysis range is small, the palindrome structure can be analyzed at high speed. However, as the analysis range becomes wider, the number of adders increases as shown in FIG. It is expected to go.
However, as shown in FIG. 5, when the results in FIG. 2 are viewed with t = n at the time of reading to the base An in the genome sequence, the same analysis range (same bases) is obtained at t = n−1. It can be seen that the detection process is being performed for the numerical analysis range.

すなわち、ゲノム配列において回文を検出する位置が１塩基ずれているのみであり、ゲノム配列における塩基を読み込む時間がずれているだけで、加算処理としては同様の処理を行っている。
したがって、ゲノム配列の時間推移を考慮すると、図２の第１の実施形態における加算構造を、図４に示す第２の実施形態による加算構造にて構成することができる（第１の実施形態と同様に、回文構造が３塩基以上のものについて）。また、当然さらに大きな塩基数の回文構造も同じ加算の構造で表すことができる。 In other words, the same processing is performed as the addition processing only by the position where the palindrome is detected in the genome sequence is shifted by one base and the time for reading the base in the genome sequence is shifted.
Therefore, in consideration of the temporal transition of the genome sequence, the addition structure in the first embodiment of FIG. 2 can be configured with the addition structure according to the second embodiment shown in FIG. Similarly, for palindromic structures with 3 or more bases). Of course, a palindrome structure having a larger number of bases can be expressed by the same addition structure.

図４の構成において、回文演算部７０は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，０，３｝に対応する構成であり、３つの第１の加算部３_７０と、３つの絶対値部４_７０と、１つの第２の加算部５_７０から構成されている。
第１の加算部３_７０各々は、Ａ_ｎ−５＋Ａ_ｎと、Ａ_ｎ−４＋Ａ_ｎ−１と、Ａ_ｎ−３＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_７０へ出力する。
絶対値部４_７０各々は、対応する第１の加算部３_７０から出力される演算結果の絶対値Ｓ１_ｎ，Ｓ２_ｎ，Ｓ３_ｎを出力する。
第２の加算部５_７０各々は、絶対値部４_７０各々からの絶対値Ｓ１_ｎ，Ｓ２_ｎ，Ｓ３_ｎを加算し、それぞれ加算結果Ｐ１_ｎを出力する。 In the configuration of FIG. 4, the palindromic operation unit 70 is an even palindromic operation unit, and corresponds to {pastolic base, non-palindromic base, palindromic base} = {3, 0, 3} as an analysis range. The configuration includes three first adder units ₃₇₀ , three absolute value units ₄₇₀ , and one second adder unit ₅₇₀ .
Each of the first addition units 3 ₇₀ performs addition of A _n−5 + A _n , A _n−4 + A _n−1 , and A _n−3 + A _n−2, and the corresponding absolute value unit 4 Output to ₇₀ .
The absolute value unit _{4 70} each of which outputs the absolute value _S1 _{_n,} S2 _n, S3 n of the operation result output from the corresponding first adder _{3 70.}
Second adding unit _{5 70} respectively adds the absolute value _S1 _{_n,} S2 _n, S3 n from the absolute value unit _{4 70} respectively, and outputs each sum P1 _n.

回文演算部７１は、奇数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，１，３｝に対応する構成であり、３つの第１の加算部３_７１と、３つの絶対値部４_７１と、１つの第２の加算部５_７１から構成されている。
第１の加算部３_７１各々は、Ａ_ｎ−６＋Ａ_ｎと、Ａ_ｎ−５＋Ａ_ｎ−１と、Ａ_ｎ−４＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_７１へ出力する。
絶対値部４_７１各々は、対応する第１の加算部３_７１から出力される演算結果の絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１を出力する。
第２の加算部５_７１各々は、絶対値部４_７１各々からの絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１を加算し、それぞれ加算結果Ｐ１_ｎ−１を出力する。 The palindromic operation unit 71 is an odd palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 1, 3} as an analysis range. The first adding unit 3 ₇₁ includes three absolute value units 4 ₇₁ and one second adding unit 5 ₇₁ .
Each of the first addition units 3 ₇₁ performs addition of A _n−6 + A _n , A _n−5 + A _n−1 , and A _n−4 + A _n−2, and the corresponding absolute value unit 4 _{To 71} .
Each of the absolute value units 4 ₇₁ outputs the absolute values S1 _n−1 , S2 _n−1 , S3 _n−1 of the calculation results output from the corresponding first adder 3 ₇₁ .
Each of the second addition units 5 ₇₁ adds the absolute values S1 _n−1 , S2 _n−1 , S3 _n−1 from each of the absolute value units 4 ₇₁ and outputs an addition result P1 _n−1 .

回文演算部７２は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，２，３｝に対応する構成であり、３つの第１の加算部３_７２と、３つの絶対値部４_７２と、１つの第２の加算部５_７２から構成されている。
第１の加算部３_７２各々は、Ａ_ｎ−７＋Ａ_ｎと、Ａ_ｎ−６＋Ａ_ｎ−１と、Ａ_ｎ−５＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_７２へ出力する。
絶対値部４_７２各々は、対応する第１の加算部３_７２から出力される演算結果の絶対値Ｓ１_ｎ−２，Ｓ２_ｎ−２，Ｓ３_ｎ−２を出力する。
第２の加算部５_７２各々は、絶対値部４_７２各々からの絶対値Ｓ１_ｎ−２，Ｓ２_ｎ−２，Ｓ３_ｎ−２を加算し、それぞれ加算結果Ｐ１_ｎ−２を出力する。 The palindromic operation unit 72 is an even palindromic operation unit, and has a configuration corresponding to {pastolic base, non-pastronomic base, palindromic base} = {3, 2, 3} as an analysis range. The first addition unit 3 ₇₂ , the three absolute value units 4 _72, and the second addition unit 5 ₇₂ are configured.
Each of the first addition units 3 ₇₂ adds each of A _n−7 + A _n , A _n−6 + A _n−1 , and A _n−5 + A _n−2, and the corresponding absolute value unit 4 ₇₂ .
Each of the absolute value units 4 ₇₂ outputs the absolute values S1 _n−2 , S2 _n−2 , and S3 _n−2 of the calculation results output from the corresponding first adder 3 ₇₂ .
Each of the second addition units 5 ₇₂ adds the absolute values S1 _n−2 , S2 _n−2 , and S3 _n−2 from each of the absolute value units 4 ₇₂ and outputs an addition result P1 _n−2 respectively.

回文演算部７３は、奇数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，３，３｝に対応する構成であり、３つの第１の加算部３_７３と、３つの絶対値部４_７３と、１つの第２の加算部５_７３から構成されている。
第１の加算部３_７３各々は、Ａ_ｎ−８＋Ａ_ｎと、Ａ_ｎ−７＋Ａ_ｎ−１と、Ａ_ｎ−６＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_７３へ出力する。
絶対値部４_７３各々は、対応する第１の加算部３_７３から出力される演算結果の絶対値Ｓ１_ｎ−３，Ｓ２_ｎ−３，Ｓ３_ｎ−３を出力する。
第２の加算部５_７３各々は、絶対値部４_７３各々からの絶対値Ｓ１_ｎ−３，Ｓ２_ｎ−３，Ｓ３_ｎ−３を加算し、それぞれ加算結果Ｐ１_ｎ−３を出力する。 The palindromic operation unit 73 is an odd palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 3, 3} as an analysis range. The first adding unit 3 ₇₃ includes three absolute value units 4 ₇₃ and one second adding unit 5 ₇₃ .
Each of the first addition units 3 ₇₃ performs addition of A _n−8 + A _n , A _n−7 + A _n−1 , and A _n−6 + A _n−2, and the corresponding absolute value unit 4 _{To 73} .
Each of the absolute value units 4 ₇₃ outputs the absolute values S1 _n−3 , S2 _n−3 , and S3 _n−3 of the calculation result output from the corresponding first adder 3 ₇₃ .
Each of the second addition units 5 ₇₃ adds the absolute values S1 _n-3 , S2 _n-3 , S3 _n-3 from each of the absolute value units 4 ₇₃ and outputs an addition result P1 _n-3 .

また、本願発明においては、図６に示すように、さらに大きな解析範囲である各回文演算部７４（偶数回文演算部）、７５（奇数回文演算部）、７６（偶数回文演算部）及び７７（奇数回文演算部）においても、図７に示す３つの第１の加算部３、３つの絶対値部４及び１つの第２の加算部５にて構成することができる。
上述した図７において、第１の加算器３個と、各第１の加算器の計算結果の絶対値を取る絶対値部と、この絶対値部の出力する第２の加算部からなるユニットを回文計算ユニットとすると、例えば解析範囲（塩基数ｍ）が６塩基や７塩基における回文構造は、３塩基づつの対構造を取り出すことになり、回文演算ユニットは１つ、順に８塩基や９塩基では回文演算ユニット２つとなり、ｎ塩基ではｒ＝Ｆ（（ｎ−４）／２）となるｒ個の回文演算ユニットにて演算することができる。ここで、上記Ｆ（）は、（）内の計算結果を小数点切り捨てた整数値を出力する関数である。 In the present invention, as shown in FIG. 6, each palindromic operation unit 74 (even number palindrome operation unit), 75 (odd palindrome operation unit), and 76 (even number palindrome operation unit) which are larger analysis ranges. And 77 (odd palindromic operation unit) can also be constituted by three first addition units 3, three absolute value units 4 and one second addition unit 5 shown in FIG.
In FIG. 7 described above, a unit consisting of three first adders, an absolute value portion that takes the absolute value of the calculation result of each first adder, and a second adder that is output from the absolute value portion is provided. Assuming a palindromic calculation unit, for example, a palindrome structure with an analysis range (number of bases m) of 6 bases or 7 bases is to extract a pair structure of 3 bases. Or, 9 bases can provide two palindromic operation units, and n bases can be calculated by r palindromic operation units such that r = F ((n−4) / 2). Here, F () is a function that outputs an integer value obtained by rounding down the calculation result in ().

ただし、上記回文演算ユニットだけでは３塩基づつの回文構造を表しているだけであり、４塩基づつ以上の回文構造を検出することはできない。
この４塩基づつ以上の回文構造を検出するためには、ｔ＝ｎ，ｔ＝ｎ−１，ｔ＝ｎ−２，…のときのもので、同時に回文演算ユニットの値が「０」になっていることが必要となる。
すなわち、回文演算部７０の｛回文塩基、非回文塩基、回文塩基｝＝｛３，０，３｝と、回文演算部７２の｛回文塩基、非回文塩基、回文塩基｝＝｛３，２，３｝とにおいて、図８に示すように、ｔ＝ｎの時点において回文演算部７２が「０」であり、かつその直前、すなわち１塩基前のｔ＝ｎ−１の時点において回文演算部７０が「０」である場合、回文演算部７０の出力する回文構造｛３，０，３｝と、回文演算部７２の回文構造｛３，２，３｝との双方を組み合わせることにより、第１の実施形態と同様に、｛回文塩基、非回文塩基、回文塩基｝＝｛４，０，４｝の４塩基対を構成する回文構造が存在することが検出される。 However, the palindrome unit only represents a palindrome structure of 3 bases, and a palindrome structure of 4 bases or more cannot be detected.
In order to detect the palindrome structure of 4 bases or more, it is the case when t = n, t = n−1, t = n−2,. It is necessary to become.
That is, {pastolic base, non-pastronomic base, palindromic base} = {3, 0, 3} of palindromic operation unit 70 and {pascalial base, non-pascal base, palindrome of palindromic operation unit 72 As shown in FIG. 8, when the base} = {3, 2, 3}, the palindromic operation unit 72 is “0” at the time t = n, and immediately before that, that is, t = n one base before. When the palindrome calculation unit 70 is “0” at the time of −1, the palindrome structure {3, 0, 3} output from the palindrome calculation unit 70 and the palindrome structure {3, 2 and 3} are combined to form four base pairs of {palindromic base, non-palindromic base, palindromic base} = {4, 0, 4}, as in the first embodiment. The presence of a palindrome structure is detected.

次に、図９，図１０及び図１１を参照して第２の実施形態によるゲノム配列における回文構造検出システムの動作を説明する。図９は回文構造システムの動作を示す回文構造パターン生成の全体を示す概念図である。図１０は図９における領域Ａの拡大図であり、図１１は図９における領域Ｂの拡大図である。この図９は奇数回文演算部により演算された塩基数が奇数の解析範囲における回文構造を検出した結果を示す回文構造パターン生成に関する概念図である。
以下の説明における回文構造パターン生成を説明する図９及び図１４は、図示しない表示部が回文構造検出システムの検出結果を解析結果図として表示するものである。
最も左の列の欄がゲノム（ＤＮＡ）配列における塩基の配列の先頭からの順番を示す位置番号（塩基／数字変換部１から出力される塩基配列の先頭から何番目の塩基から塩基配列が開始されているかを示す番号）が記載され、次の２つの欄にそれぞれの番号に対応した核酸塩基名および部分的に有意なコドン名が記載されている。核酸分類数（符号付き数字）の列の欄は、４列存在しており、いずれの列の欄も同様であり、塩基／数字変換部１により左の欄の各塩基が変換された符号付き数字（核酸分類数）が記載されている。ここで、各核酸分類数（符号付き数字）の列は、それぞれ検出する回文構造の種類に対応しており、回文構造と検出された塩基配列の領域の核酸分類数を囲む、あるいは色を変えるなどのマーキングを行い視覚的にいずれの領域が回文構造であるかを示すために用いられる。
すなわち、図９の左すみの列から、Ａ、Ｔ、Ｇ、Ｃの塩基名と、この塩基名Ａ、Ｔ、Ｇ、Ｃに対応した＋１、−１、＋２、−２の符号付き数字が示されている。 Next, the operation of the palindrome structure detection system in the genome sequence according to the second embodiment will be described with reference to FIGS. 9, 10 and 11. FIG. FIG. 9 is a conceptual diagram showing the entire palindrome structure pattern generation showing the operation of the palindrome structure system. 10 is an enlarged view of region A in FIG. 9, and FIG. 11 is an enlarged view of region B in FIG. FIG. 9 is a conceptual diagram relating to palindrome structure pattern generation showing the result of detecting the palindrome structure in the analysis range where the number of bases calculated by the odd palindrome calculation unit is odd.
9 and 14 for explaining the palindrome structure pattern generation in the following description, the display unit (not shown) displays the detection result of the palindrome structure detection system as an analysis result diagram.
The leftmost column is the position number indicating the order from the top of the base sequence in the genome (DNA) sequence (the base sequence starts from the base number of the base sequence output from the base / number converter 1) No. indicating whether or not a nucleobase name and a partially significant codon name corresponding to each number are described in the next two columns. There are four columns in the column of the number of nucleic acid classifications (signed numbers), and all the columns are the same, and each base in the left column is converted by the base / number conversion unit 1 with a sign. Numbers (number of nucleic acid classifications) are described. Here, each column of the number of nucleic acid classifications (signed numbers) corresponds to the type of palindrome structure to be detected, and surrounds the number of nucleic acid classifications in the palindrome structure and the detected base sequence region, or the color It is used to visually indicate which region has a palindrome structure by marking such as changing.
That is, from the column on the left side of FIG. 9, the base names of A, T, G, and C and the signed numbers of +1, −1, +2, and −2 corresponding to the base names A, T, G, and C are shown. It is shown.

次の回文構造｛３，１，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７１における各第１の加算部３_７１各々が加算し、絶対値部４_７１がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１各々の出力が示されている。
また、回文構造｛３，１，３｝の下部にあるＰ１の欄には、第２の加算部５_７１が絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１各々を加算したＰ１_ｎ−１の結果が記載されている。
同様に、次の回文構造｛３，３，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７３における各第１の加算部３_７３各々が加算し、絶対値部４_７３がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−３，Ｓ２_ｎ−３，Ｓ３_ｎ−３各々の出力が示されている。
また、回文構造｛３，３，３｝の下部にあるＰ１の欄には、第２の加算部５_７３が絶対値Ｓ１_ｎ−３，Ｓ２_ｎ−３，Ｓ３_ｎ−３各々を加算したＰ１_ｎ−３の結果が記載されている。 Each of the first addition units 3 ₇₁ in the palindromic operation unit 71 adds to the columns of S1, S2 and S3 below the next palindrome structure {3, 1, 3}, and the absolute value unit 4 ₇₁ The outputs of the absolute values S1 _n−1 , S2 _n−1 , S3 _{n−1 obtained} by converting the addition result into absolute values are shown.
Further, the P1 section of the bottom of the palindrome {3,1,3}, the second adding unit _{5 71} obtained by adding the absolute value _{_{S1 n-1, S2 n-}} 1, S3 n-1 each Results for P1 _n-1 are listed.
Similarly, the column of S1, S2 and S3 at the bottom of the next palindrome {3,3,3}, the first adding unit _{3 73,} each adding each of the palindromic calculation unit 73, the absolute value part _{4 73} is the sum of the absolute value by the absolute value _{_{S1 n-3, S2 n-}} 3, S3 n-3 outputs of each are shown.
In addition, in the column of P1 below the palindrome structure {3, 3, 3}, the second addition unit ₅₇₃ adds the absolute values S1 _n−3 , S2 _n−3 , and S3 _n−3 respectively. Results for P1 _n-3 are listed.

そして、次の回文構造｛３，５，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７５における各第１の加算部３_７５各々が加算し、絶対値部４_７５がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−５，Ｓ２_ｎ−５，Ｓ３_ｎ−５各々の出力が示されている。
また、回文構造｛３，５，３｝の下部にあるＰ１の欄には、第２の加算部５_７５が絶対値Ｓ１_ｎ−５，Ｓ２_ｎ−５，Ｓ３_ｎ−５各々を加算したＰ１_ｎ−５の結果が記載されている。
同様に、次の回文構造｛３，７，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７７における各第１の加算部３_７７各々が加算し、絶対値部４_７７がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−７，Ｓ２_ｎ−７，Ｓ３_ｎ−７各々の出力が示されている。
また、回文構造｛３，７，３｝の下部にあるＰ１の欄には、第２の加算部５_７７が絶対値Ｓ１_ｎ−７，Ｓ２_ｎ−７，Ｓ３_ｎ−７各々を加算したＰ１_ｎ−７の結果が記載されている。 Then, the column of S1, S2 and S3 at the bottom of the next palindrome {3,5,3}, the first adding unit _{3 75,} each adding each of the palindromic calculation unit 75, the absolute value unit Reference numeral 4 ₇₅ shows outputs of absolute values S1 _n-5 , S2 _n-5 , and S3 _{n-5 obtained} by converting the addition result into absolute values.
Further, in the column of P1 at the bottom of the palindrome {3,5,3}, the second adding unit _{5 75} obtained by adding the absolute value _{_{S1 n-5, S2 n-}} 5, S3 n-5 each Results for P1 _n-5 are listed.
Similarly, each of the first addition units 3 ₇₇ in the palindromic operation unit 77 adds to the columns of S1, S2, and S3 below the next palindrome structure {3, 7, 3}, and the absolute value part _{4 77} is the sum of the absolute value by the absolute value _{_{S1 n-7, S2 n-}} 7, S3 n-7 outputs of each are shown.
Further, in the column of P1 at the bottom of the palindrome {3,7,3}, the second adding unit _{5 77} obtained by adding the absolute value _{_{S1 n-7, S2 n-}} 7, S3 n-7 each Results for P1 _n-7 are listed.

以降同様に、回文構造｛３，９，３｝の解析を行う回文演算部７９、回文構造｛３，１１，３｝の解析を行う回文演算部７１１、回文構造｛３，１３，３｝の解析を行う回文演算部７１３、回文構造｛３，１５，３｝の解析を行う回文演算部７１５の計算結果が記載されている。
また、右端の各解析範囲のｑの値が記載されて解析範囲毎に対応した列は、塩基配列を１つずつずらして各解析結果Ｐ１が揃うように、配列し直したものである。
ｑ（表においてはｑ＝１５）を左から
ｑ０＝ｑｍａｘ−２・０＝１５−２・０＝１５
ｑ１＝ｑｍａｘ−２・１＝１５−２・１＝１３
ｑ２＝ｑｍａｘ−２・２＝１５−２・２＝１１
ｑ３＝ｑｍａｘ−２・３＝１５−２・３＝９
ｑ４＝ｑｍａｘ−２・４＝１５−２・４＝７
：
：
ｑｎ＝ｑｍａｘ−２・ｎ＝１５−２・ｎ
と定義して、「斜め線より回文構造把握」という文字列の下部の領域に、ｎ＝０及びｎ＝４に「０」が存在するため、
ｎ＝０：現在の場所を中心に回文構造｛３，１５，３｝
ｎ＝４：現在の場所の４個前を中心に回文構造｛３，７，３｝
という構造であることが分かる。
また、ｎ＝４：回文構造｛３，７，３｝、ｎ＝５：回文構造｛３，５，３｝のように、連続した奇数あるいは偶数に「０」が存在する場合、手前４個目の中心に回文構造｛４，５，４｝の回文構造が検出される。
この結果から、ＤＮＡがどのような回文構造を有するかを容易に検出することができる。
図９の例においては、４２番目のパターンにて、｛回文塩基、非回文塩基、回文塩基｝＝｛３，１５，３｝に対応する回文演算部７１５の解析範囲が回文構造を有する検出結果、すなわちＰ１_ｎ−１５＝０となっている。また、これより４塩基前の３８番目のパターンにて、｛回文塩基、非回文塩基、回文塩基｝＝｛３，７，３｝に対応する回文演算部７７の解析範囲が回文構造を有する検出結果、すなわちＰ１_ｎ−７＝０となっている。
図視しない回文パターン合成部は、１塩基ずつシフトして演算される結果において、回文構造が検出された解析範囲の検出結果を合わせることにより、回文構造が検出された最も塩基数の多い解析範囲の回文構造を検出することができる。
例えば、図９の例において、３８番目のパターンにてＰ１_ｎ−７＝０であり、４２番目のパターンにてＰ１_ｎ−１５＝０であることから、上記回文パターン合成部は、解析範囲が２１塩基にて図１２（ａ）に示すような３塩基対が２つある回文構造を検出する。 Thereafter, similarly, the palindromic operation unit 79 for analyzing the palindrome structure {3, 9, 3}, the palindromic operation unit 711 for analyzing the palindrome structure {3, 11, 3}, the palindrome structure {3, 13 and 3}, and the results of the palindromic operation unit 715 that analyzes the palindrome structure {3, 15, 3} are described.
In addition, the column corresponding to each analysis range in which the q value of each analysis range at the right end is described is rearranged so that each analysis result P1 is aligned by shifting the base sequence one by one.
q (q = 15 in the table) from the left q0 = qmax−2 · 0 = 15−2 · 0 = 15
q1 = qmax−2 · 1 = 15−2 · 1 = 13
q2 = qmax−2 · 2 = 15−2 · 2 = 11
q3 = qmax−2 · 3 = 15−2 · 3 = 9
q4 = qmax−2 · 4 = 15−2 · 4 = 7
:
:
qn = qmax-2 · n = 15-2 · n
Since “= 0” exists in n = 0 and n = 4 in the lower area of the character string “understand palindromic structure from diagonal lines”,
n = 0: palindrome structure around the current location {3, 15, 3}
n = 4: Palindrome structure {3, 7, 3} around the current location
It can be seen that this structure.
Also, when “0” exists in consecutive odd numbers or even numbers, such as n = 4: palindrome structure {3, 7, 3}, n = 5: palindrome structure {3, 5, 3}, The palindrome structure of the palindrome structure {4, 5, 4} is detected at the fourth center.
From this result, it is possible to easily detect what palindromic structure the DNA has.
In the example of FIG. 9, in the 42nd pattern, the analysis range of the palindromic operation unit 715 corresponding to {palindromic base, non palindromic base, palindromic base} = {3, 15, 3} is the palindrome. The detection result having the structure, that is, P1 _n-15 = 0. In addition, the analysis range of the palindromic operation unit 77 corresponding to {palindromic base, non-pastronomic base, palindromic base} = {3, 7, 3} in the 38th pattern 4 bases before this is The detection result having the sentence structure, that is, P1 _n-7 = 0.
The palindrome pattern synthesis unit not shown in the figure combines the detection results of the analysis range in which the palindrome structure is detected in the result calculated by shifting one base at a time. It is possible to detect palindrome structures in many analysis ranges.
For example, in the example of FIG. 9, P1 _n−7 = 0 in the 38th pattern and P1 _n−15 = 0 in the 42nd pattern. Detects a palindrome structure with two 3 base pairs as shown in FIG.

また、図９の例においては、５０番目のパターンにて、｛回文塩基、非回文塩基、回文塩基｝＝｛３，７，３｝に対応する回文演算部７７の解析範囲が回文構造を有する検出結果、すなわちＰ１_ｎ−７＝０となっている。また、これより１塩基前の４９番目のパターンにて、｛回文塩基、非回文塩基、回文塩基｝＝｛３，５，３｝に対応する回文演算部７５の解析範囲が回文構造を有する検出結果、すなわちＰ１_ｎ−５＝０となっている。
図視しない回文パターン合成部は、上述したように、１塩基ずつシフトして演算される結果において、回文構造が検出された解析範囲の検出結果を合わせることにより、回文構造が検出された最も塩基数の多い解析範囲の回文構造を検出することができる。
例えば、図９の例において、４９番目のパターンにてＰ１_ｎ−５＝０であり、５０番目のパターンにてＰ１_ｎ−７＝０であることから、上記回文パターン合成部は、解析範囲が１３塩基にて図１２（ｂ）に示すような４塩基対が１つある回文構造を検出する。 In the example of FIG. 9, the analysis range of the palindromic operation unit 77 corresponding to {palindromic base, non palindromic base, palindromic base} = {3, 7, 3} in the 50th pattern is The detection result having the palindrome structure, that is, P1 _n-7 = 0. In addition, in the 49th pattern one base before this, the analysis range of the palindromic operation unit 75 corresponding to {palindromic base, non-palatomic base, palindromic base} = {3, 5, 3} is The detection result having the sentence structure, that is, P1 _n-5 = 0.
As described above, the palindrome pattern synthesis unit not shown in the figure detects the palindrome structure by combining the detection results of the analysis range in which the palindrome structure is detected in the result calculated by shifting one base at a time. The palindrome structure in the analysis range with the largest number of bases can be detected.
For example, in the example of FIG. 9, P1 _n-5 = 0 in the 49th pattern and P1 _n-7 = 0 in the 50th pattern. Detects a palindrome structure with 13 bases and one 4-base pair as shown in FIG.

図９における斜め線Ｚは１塩基前と合成できるか否かを検出するため、上記回文パターン合成部が各解析範囲において、ゲノム配列における塩基の番号の位置にてＰ１＝０であるか否かを検出する走査順を示したものである。そして、回文パターン合成部は、この斜め線Ｚを走査する検出処理において、各回文演算部の演算結果にて、各々２つ解析範囲の少ない回文演算部の１塩基前の演算結果を合成処理することにより、中央部分の塩基の番号が一致する解析範囲を合成することができ、上述したように、回文構造の検出された最も大きな解析範囲における塩基の回文構造を検出して出力する。
上述したように、図９，１０，１１において、塩基名を、ステムループ構造にて塩基対を形成する塩基対に同一の数字を付し、相補的な関係にある塩基同士にて極性の異なる符号を付すことにより、利用者が符号付きの数字のパターンを見ることにより、塩基名とその相補的な関係が判らなくとも、ビジュアル（視覚）的に回文パターンが確認でき、回文構造を検出する演算が容易となることが判る。 In order to detect whether or not the diagonal line Z in FIG. 9 can be synthesized with the previous base, whether or not P1 = 0 at the position of the base number in the genome sequence in each analysis range by the palindromic pattern synthesis unit. This shows the scanning order for detecting these. Then, in the detection process of scanning the diagonal line Z, the palindrome pattern synthesis unit synthesizes the calculation results of the palindrome calculation unit one base before the analysis result of each of the palindrome calculation units with two smaller analysis ranges. By processing, it is possible to synthesize the analysis range where the base number of the central part matches, and as described above, detect and output the palindrome structure of the base in the largest analysis range where the palindrome structure is detected To do.
As described above, in FIGS. 9, 10, and 11, base names are given the same numbers to the base pairs that form base pairs in the stem-loop structure, and the bases that are in a complementary relationship have different polarities. By attaching a sign, the user can see the palindrome pattern visually and visually, without seeing the base name and its complementary relationship, by looking at the signed number pattern. It turns out that the operation to detect becomes easy.

図９は、奇数回文演算部による奇数の塩基数の解析範囲における解析について述べたが、偶数の塩基数の解析範囲における解析も同様に行うことができる。
次に、図１４，図１５及び図１６を参照して第２の実施形態によるゲノム配列における回文構造検出システムの動作を説明する。図１４は回文構造システムの塩基数が偶数の解析範囲における回文構造を検出する動作を示す回文構造パターン生成の全体を示す概念図である。図１５は図１４における領域Ａの拡大図であり、図１６は図１４における領域Ｂの拡大図である。この図１４は偶数回文演算部により演算された偶数の塩基数の解析範囲における回文構造を検出した結果を示す回文構造パターン生成に関する概念図である。
上記図１４は、図９と同様に、最も左の欄がゲノム配列における塩基の順番を示し、次の２つの欄にそれぞれの番号に対応した核酸塩基名および部分的に有意なコドン名が記載されている。核酸分類数（符号付き数字）の欄は、いずれの列も同様であり、塩基／数字変換部１により左の欄の各塩基が変換された符号付き数字（核酸分類数）が記載されている。すなわち、図１４の左すみから、ＡＴＧＣの塩基名と、この塩基名ＡＴＧＣに対応した＋１、−１、＋２、−２の符号付き数字が示されている。 Although FIG. 9 describes the analysis in the analysis range of the odd base number by the odd palindromic operation unit, the analysis in the analysis range of the even base number can be performed in the same manner.
Next, the operation of the palindrome structure detection system in the genome sequence according to the second embodiment will be described with reference to FIGS. 14, 15 and 16. FIG. 14 is a conceptual diagram showing the entire palindrome structure pattern generation showing the operation of detecting the palindrome structure in the analysis range where the number of bases of the palindrome structure system is an even number. 15 is an enlarged view of region A in FIG. 14, and FIG. 16 is an enlarged view of region B in FIG. FIG. 14 is a conceptual diagram relating to palindrome structure pattern generation showing the result of detecting the palindrome structure in the analysis range of the even number of bases calculated by the even palindrome calculation unit.
In FIG. 14, as in FIG. 9, the leftmost column indicates the order of bases in the genome sequence, and the next two columns describe the nucleobase names corresponding to the respective numbers and partially significant codon names. Has been. The column of the number of nucleic acid classification (number with a sign) is the same in any column, and the number with a sign (number of nucleic acid classification) in which each base in the left column is converted by the base / number conversion unit 1 is described. . That is, from the left corner of FIG. 14, the base name of ATGC and the numbers with signs of +1, −1, +2, and −2 corresponding to this base name ATGC are shown.

次の回文構造｛３，０，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７０における各第１の加算部３_７０各々が加算し、絶対値部４_７０がこの加算結果を絶対値化した絶対値Ｓ１_ｎ，Ｓ２_ｎ，Ｓ３_ｎ各々の出力が示されている。
また、回文構造｛３，０，３｝の下部にあるＰ１の欄には、第２の加算部５_７０が絶対値Ｓ１_ｎ，Ｓ２_ｎ，Ｓ３_ｎ各々を加算したＰ１_ｎの結果が記載されている。
同様に、次の回文構造｛３，２，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７２における各第１の加算部３_７２各々が加算し、絶対値部４_７２がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−２，Ｓ２_ｎ−２，Ｓ３_ｎ−２各々の出力が示されている。
また、回文構造｛３，２，３｝の下部にあるＰ１の欄には、第２の加算部５_７２が絶対値Ｓ１_ｎ−２，Ｓ２_ｎ−２，Ｓ３_ｎ−２各々を加算したＰ１_ｎ−２の結果が記載されている。 In the column of the next palindrome S1 at the bottom of the structure {3,0,3}, S2 and S3, first adding unit _{3 70,} each adding each of the palindromic calculation unit 70, the absolute value unit _{4 70} The outputs of the absolute values S1 _n , S2 _n and S3 _{n obtained} by converting the addition result into absolute values are shown.
Further, in the column of P1 at the bottom of the palindrome {3,0,3}, the second adding unit _{5 70} absolute value _S1 _{_n,} S2 _n, S3 n each obtained by adding P1 _n results described Has been.
Similarly, the column of a certain S1, S2 and S3 at the bottom of the next palindrome {3,2,3}, the first adding unit _{3 72,} each adding each of the palindromic calculation unit 72, the absolute value part _{4 72} is the sum of the absolute value by the absolute value _{_{S1 n-2, S2 n-}} 2, S3 n-2 output of each is shown.
Further, in the column of P1 at the bottom of the palindrome {3,2,3}, the second adding unit _{5 72} adds the absolute value _{_{S1 n-2, S2 n-}} 2, S3 n-2 , respectively Results for P1 _n-2 are listed.

そして、次の回文構造｛３，４，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７４における各第１の加算部３_７４各々が加算し、絶対値部４_７４がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−４，Ｓ２_ｎ−４，Ｓ３_ｎ−４各々の出力が示されている。
また、回文構造｛３，４，３｝の下部にあるＰ１の欄には、第２の加算部５_７４が絶対値Ｓ１_ｎ−４，Ｓ２_ｎ−４，Ｓ３_ｎ−４各々を加算したＰ１_ｎ−４の結果が記載されている。
同様に、次の回文構造｛３，６，３｝の下部にあるＳ１，Ｓ２及びＳ３の欄には、回文演算部７６における各第１の加算部３_７６各々が加算し、絶対値部４_７６がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−６，Ｓ２_ｎ−６，Ｓ３_ｎ−６各々の出力が示されている。
また、回文構造｛３，６，３｝の下部にあるＰ１の欄には、第２の加算部５_７６が絶対値Ｓ１_ｎ−６，Ｓ２_ｎ−６，Ｓ３_ｎ−６各々を加算したＰ１_ｎ−６の結果が記載されている。 Then, in the column of a S1, S2 and S3 at the bottom of the next palindrome {3,4,3}, the first adding unit _{3 74,} each adding each of the palindromic calculation unit 74, the absolute value unit 4 ₇₄ is the sum of the absolute value by the absolute value _{_{S1 n-4, S2 n-}} 4, S3 n-4 outputs of each are shown.
Further, the P1 section of the bottom of the palindrome {3,4,3}, the second adding unit _{5 74} adds the absolute value _{_{S1 n-4, S2 n-}} 4, S3 n-4 , respectively P1 _n-4 results are listed.
Similarly, the column of S1, S2 and S3 at the bottom of the next palindrome {3,6,3}, the first adding unit _{3 76,} each adding each of the palindromic calculation unit 76, the absolute value part _{4 76} is the sum of the absolute value by the absolute value _{_{S1 n-6, S2 n-}} 6, S3 n-6 output of each is shown.
Further, the P1 section of the bottom of the palindrome {3,6,3}, the second adding unit _{5 76} adds the absolute value _{_{S1 n-6, S2 n-}} 6, S3 n-6 , respectively Results for P1 _n-6 are listed.

以降同様に、回文構造｛３，８，３｝の解析を行う回文演算部７８、回文構造｛３，１０，３｝の解析を行う回文演算部７１０、回文構造｛３，１２，３｝の解析を行う回文演算部７１２、回文構造｛３，１４，３｝の解析を行う回文演算部７１４の計算結果が記載されている。
図１４の例においては、３９番目のパターンにて、｛回文塩基、非回文塩基、回文塩基｝＝｛３，４，３｝に対応する回文演算部７４の解析範囲が回文構造を有する検出結果、すなわちＰ１_ｎ−４＝０となっている。 Thereafter, similarly, the palindromic operation unit 78 that analyzes the palindrome structure {3, 8, 3}, the palindrome operation unit 710 that analyzes the palindrome structure {3, 10, 3}, the palindrome structure {3, 12 and 3}, and the results of the palindrome operation unit 714 that analyzes the palindrome structure {3, 14, 3} are described.
In the example of FIG. 14, in the 39th pattern, the analysis range of the palindromic operation unit 74 corresponding to {palindromic base, non palindromic base, palindromic base} = {3,4,3} is the palindrome. The detection result having the structure, that is, P1 _n-4 = 0.

また、５０番目のパターンにて、｛回文塩基、非回文塩基、回文塩基｝＝｛３，８，３｝に対応する回文演算部７８の解析範囲が回文構造を有する検出結果、すなわちＰ１ｎ−８＝０となっている。
図視しない回文パターン合成部は、すでに述べた解析範囲が奇数の塩基数である場合と同様に、１塩基ずつシフトして演算される結果において、回文構造が検出された解析範囲の検出結果を合わせることにより、回文構造が検出された最も塩基数の多い解析範囲の回文構造を検出することができる。
例えば、図１４の例において、３９番目のパターンにてＰ１_ｎ−４＝０であることから、上記回文パターン合成部は、解析範囲が１０塩基にて図１３（ａ）に示すような３塩基対が１つある回文構造を検出する。
また、５０番目のパターンにて、Ｐ１_ｎ−８＝０であることから、上記回文パターン合成部は、解析範囲が１４塩基にて図１３（ｂ）に示すような３塩基対が１つある回文構造を検出する。 In addition, in the 50th pattern, a detection result in which the analysis range of the palindromic operation unit 78 corresponding to {palindromic base, non palindromic base, palindromic base} = {3, 8, 3} has a palindromic structure That is, P1n-8 = 0.
The palindrome pattern synthesizer (not shown) detects the analysis range in which the palindrome structure is detected in the result calculated by shifting one base at a time, as in the case where the analysis range already described is an odd number of bases. By combining the results, the palindrome structure in the analysis range with the largest number of bases in which the palindrome structure is detected can be detected.
For example, in the example of FIG. 14, since P1 _n-4 = 0 in the 39th pattern, the palindrome pattern synthesis unit has an analysis range of 10 bases as shown in FIG. A palindrome structure with one base pair is detected.
In addition, since P1 _n-8 = 0 in the 50th pattern, the palindromic pattern synthesis unit has an analysis range of 14 bases and one 3 base pair as shown in FIG. 13B. Detect a palindrome structure.

上述したように、図９，１０，１１と同様に、図１４，１５，１６において塩基名を、ステムループ構造にて塩基対を形成する塩基対に同一の数字を付し、相補的な関係にある塩基同士にて極性の異なる符号を付すことにより、ビジュアル的に回文パターンが確認でき、回文構造を検出する演算が容易となることが判る。
また、図１４における斜め線Ｚは、図９と同様に、１塩基前と合成できるか否かを検出するため、上記回文パターン合成部が各解析範囲において、ゲノム配列における塩基の番号の位置にてＰ１＝０であるか否かを検出する走査順を示したものである。そして、回文パターン合成部は、この斜め線Ｚを走査する検出処理において、各回文演算部の演算結果にて、各々２つ解析範囲の少ない回文演算部の１塩基前の演算結果を合成処理することにより、中央部分の塩基の番号が一致する解析範囲を合成することができ、上述したように、回文構造の検出された最も大きな解析範囲における塩基の回文構造を検出して出力する。 As described above, in the same manner as in FIGS. 9, 10, and 11, the base names are the same in FIGS. 14, 15, and 16, and the same numbers are assigned to the base pairs that form the base pairs in the stem-loop structure. It can be seen that by adding signs having different polarities between the bases in FIG. 4, the palindrome pattern can be visually confirmed, and the operation for detecting the palindrome structure becomes easy.
In addition, the diagonal line Z in FIG. 14 indicates whether or not the palindromic pattern synthesizer can analyze the base number position in the genome sequence in each analysis range in order to detect whether or not it can be synthesized with the previous base as in FIG. The scanning order for detecting whether or not P1 = 0 is shown. Then, in the detection process of scanning the diagonal line Z, the palindrome pattern synthesis unit synthesizes the calculation results of the palindrome calculation unit one base before the analysis result of each of the palindrome calculation units with two smaller analysis ranges. By processing, it is possible to synthesize the analysis range where the base number of the central part matches, and as described above, detect and output the palindrome structure of the base in the largest analysis range where the palindrome structure is detected To do.

そして、回文パターン合成部は、上述した偶数及び奇数の解析範囲における組み合わせにより、ゲノム配列における回文構造を検出して出力する。
なお、回文パターン合成部は、ゲノム配列における塩基の順番を示す番号により、回文構造が検出された解析範囲の存在する位置を検出することができるため、回文構造の位置を容易に出力することができる。
すなわち、回文パターン合成部は、ゲノム配列における回文構造の解析が進行する方向において、解析範囲の最後の塩基に対応する番号にて、その解析範囲に回文構造の有無を検出して、その番号に第２の加算回路の結果Ｐ１を順次記入していくことにより、各解析範囲に対応してその回文構造の有無の結果を記入することとなるため、いずれの非回文塩基の塩基数を有する解析範囲が、ゲノム配列のどの位置に存在するかを容易に確認することができる。 Then, the palindrome pattern synthesis unit detects and outputs the palindrome structure in the genome sequence by the combination in the even and odd analysis ranges described above.
The palindrome pattern synthesis unit can easily detect the position of the palindrome structure by detecting the position of the analysis range where the palindrome structure is detected by the number indicating the base order in the genome sequence. can do.
That is, the palindrome pattern synthesis unit detects the presence or absence of the palindrome structure in the analysis range with the number corresponding to the last base of the analysis range in the direction in which the analysis of the palindrome structure in the genome sequence proceeds, By sequentially entering the result P1 of the second adder circuit in the number, the result of the presence or absence of the palindrome structure corresponding to each analysis range is entered. It can be easily confirmed at which position in the genome sequence the analysis range having the number of bases is present.

＜回文構造検出システムのハードウェア化＞
上述したような構成は、パーソナルコンピュータにおける表計算ソフトが有する演算機能を上述した各機能として実現することができるが、ＤＮＡのゲノム配列が巨大な塩基配列であるため、大量の塩基配列のデータを高速に解析するためには、簡易な回路を用いてハードウェアにて解析することが考えられる。
このため、２つの塩基に対応する符号付き数字を加算する第１の加算回路と、その加算結果を絶対値化する絶対値部とを１つの演算ユニットとして図１７に示す回路（１６塩基の塩基配列単位にＤＮＡの配列における回文構造を検出、例えば、第１の実施形態に対応して、３つの塩基対を有する回文構造｛３，０，３｝〜回文構造｛３，１０，３｝の回文構造を検出する構成の回路）により構成する。 <Hardware construction of palindrome structure detection system>
The configuration as described above can realize the calculation function of the spreadsheet software in the personal computer as each function described above. However, since the DNA genome sequence is a huge base sequence, a large amount of base sequence data can be obtained. In order to analyze at high speed, it is conceivable to perform analysis with hardware using a simple circuit.
For this reason, the circuit shown in FIG. 17 (16 base bases) includes a first addition circuit that adds signed numbers corresponding to two bases and an absolute value part that converts the addition result into an absolute value as one arithmetic unit. The palindrome structure in the DNA sequence is detected in the sequence unit. For example, corresponding to the first embodiment, the palindrome structure {3, 0, 3} having three base pairs to the palindrome structure {3, 10, 3} circuit configured to detect the palindrome structure.

ここで、符号付き数字は２進数のビット単位の演算となるため、塩基Ａ，Ｇ，Ｃ，Ｔ（Ｕ）それぞれを、２進数で示すデータとして［符号，最上位ビット，最下位ビット］で表す。ここで、例えば、符号は、「＋」の場合「０」とし、「−」の場合「１」とする。
したがって、塩基Ａは「＋１」であるため、［符号，最上位ビット（２１），最下位ビット（２０）］＝［０，０，１］と表され、塩基Ｔ（Ｕ）は「−１」であるため、［符号，最上位ビット，最下位ビット］＝［１，０，１］と表され、塩基Ｇは「＋２」であるため、［符号ビット，最上位ビット，最下位ビット］＝［０，１，０］と表され、塩基Ｔ（Ｕ）は「−２」であるため、［符号ビット，最上位ビット，最下位ビット］＝［１，１，０］と表される。
この塩基毎の２進数へのデータ変換は、上述した塩基／数字変換部１が行う。 Here, since the number with a sign is a binary bit unit operation, each of the bases A, G, C, and T (U) is represented by a binary number as [sign, most significant bit, least significant bit]. Represent. Here, for example, the sign is “0” for “+” and “1” for “−”.
Therefore, since the base A is “+1”, it is expressed as [sign, most significant bit (21), least significant bit (20)] = [0, 0, 1], and the base T (U) is “−1”. Therefore, [sign, most significant bit, least significant bit] = [1, 0, 1] and base G is “+2”, so [sign bit, most significant bit, least significant bit] = [0, 1, 0], and the base T (U) is “−2”, so [sign bit, most significant bit, least significant bit] = [1, 1, 0]. .
Data conversion into binary numbers for each base is performed by the base / number conversion unit 1 described above.

したがって、図１７の回路図に示すように、上記演算ユニットは、２つの塩基各々に対応する符号付き数字を示すビット列である［符号ビット，最上位ビット，最下位ビット］におけるそれぞれのビットの演算を行うため、符号の演算を行うＥＸＮＯＲ１０１と、最上位ビット（２ビット目）の演算を行うＥＸＯＲ１０２と、最下位ビット（１ビット目）の演算を行うＥＸＯＲ１０３と、ＥＸＮＯＲ１０１，ＥＸＯＲ１０２及びＥＸＯＲ１０３の演算結果で、演算した塩基同士が相補的であるか否かを演算するＯＲ１０４とから構成されている。 Therefore, as shown in the circuit diagram of FIG. 17, the arithmetic unit calculates each bit in [sign bit, most significant bit, least significant bit] which is a bit string indicating a signed number corresponding to each of the two bases. Therefore, EXNOR101 for calculating the sign, EXOR102 for calculating the most significant bit (second bit), EXOR103 for calculating the least significant bit (first bit), and the results of EXNOR101, EXOR102, and EXOR103 The OR 104 calculates whether or not the calculated bases are complementary.

これにより、例えば、図１７において、塩基Ａ_ｎ−５に対応する符号付き数字と、塩基Ａｎに対応する符号付き数字とを加算した結果の絶対値Ｓが「０」か「１」として得られる。
例として、塩基Ａ_ｎ−５が「−２（塩基Ｃ）」であり、２進数で示した符号付き数値が［符号ビット，最上位ビット，最下位ビット］＝［１，１，０］とし、塩基Ａｎが「＋２（塩基Ｇ）」であると、２進数で示した符号付き数値が［符号ビット，最上位ビット，最下位ビット］＝［０，１，０］とされる。
この場合、最下位ビットが「０」と「０」とで同一のため、ＥＸＯＲ１０３は「０」を出力し、最上位ビットが「１」と「１」とで同一のため、ＥＸＯＲ１０２は「０」を出力し、符号ビットが「１」と「０」とで異なるため、ＥＸＮＯＲ１０１は「０」を出力する。
そして、ＯＲ１０４は、入力されるデータが全て「０」であるため、結果としての絶対値Ｓを「０」として出力する。これにより、塩基Ａ_ｎ−５と塩基Ａ_ｎとがステムループ構造にて塩基対を形成する相補的な関係にある塩基同士であることが検出されることとなる。 Thereby, for example, in FIG. 17, the absolute value S as a result of adding the signed number corresponding to the base An _-5 and the signed number corresponding to the base An is obtained as “0” or “1”. .
As an example, the base _An-5 is “−2 (base C)”, and the signed numeric value expressed in binary number is [sign bit, most significant bit, least significant bit] = [1, 1, 0]. When the base An is “+2 (base G)”, the signed numerical value represented by the binary number is [sign bit, most significant bit, least significant bit] = [0, 1, 0].
In this case, since the least significant bit is the same between “0” and “0”, the EXOR 103 outputs “0”, and since the most significant bit is the same between “1” and “1”, the EXOR 102 is “0”. , And the sign bit differs between “1” and “0”, so the EXNOR 101 outputs “0”.
Then, since all the input data is “0”, the OR 104 outputs the resulting absolute value S as “0”. Thereby, it is detected that the base _An-5 and the base _An are bases in a complementary relationship forming a base pair in the stem loop structure.

また、塩基Ａ_ｎ−５が「＋２（塩基Ｇ）」であり、２進数で示した符号付き数値が［符号ビット，最上位ビット，最下位ビット］＝［０，１，０］とし、塩基Ａ_ｎが同様に「＋２（塩基Ｇ）」であると、２進数で示した符号付き数値が［符号ビット，最上位ビット，最下位ビット］＝［０，１，０］とされる。
この場合、最下位ビットが「０」と「０」とで同一のため、ＥＸＯＲ１０３は「０」を出力し、最上位ビットが「１」と「１」とで同一のため、ＥＸＯＲ１０２は「０」を出力し、符号ビットが「０」と「０」とで同一のため、ＥＸＮＯＲ１０１は「１」を出力する。
そして、ＯＲ１０４は、入力されるデータに「１」が含まれるため、結果としての絶対値Ｓを「１」として出力する。これにより、塩基Ａ_ｎ−５と塩基Ａ_ｎとがステムループ構造にて塩基対を形成しない相補的な関係に無い塩基同士であることが検出されることとなる。 In addition, base _An-5 is “+2 (base G)”, and a signed numerical value represented by a binary number is [sign bit, most significant bit, least significant bit] = [0, 1, 0] If it is similarly is a _n "+2 (base G)", signed numerical values shown in binary is a [sign bit, the most significant bit, the least significant bit] = [0,1,0].
In this case, since the least significant bit is the same between “0” and “0”, the EXOR 103 outputs “0”, and since the most significant bit is the same between “1” and “1”, the EXOR 102 is “0”. Since the sign bit is the same between “0” and “0”, the EXNOR 101 outputs “1”.
Then, since “1” is included in the input data, the OR 104 outputs the resulting absolute value S as “1”. As a result, it is detected that the base _An-5 and the base _An are bases that are not in a complementary relationship that does not form a base pair in the stem loop structure.

また、塩基Ａ_ｎ−５が「＋１（塩基Ａ）」であり、２進数で示した符号付き数値が［符号ビット，最上位ビット，最下位ビット］＝［０，０，１］とし、塩基Ａ_ｎが「−２（塩基Ｃ）」であると、２進数で示した符号付き数値が［符号ビット，最上位ビット，最下位ビット］＝［１，１，０］とされる。
この場合、最下位ビットが「１」と「０」とで異なるため、ＥＸＯＲ１０３は「１」を出力し、最上位ビットが「０」と「１」とで異なるため、ＥＸＯＲ１０２は「１」を出力し、符号ビットが「０」と「１」とで異なるため、ＥＸＮＯＲ１０１は「０」を出力する。
そして、ＯＲ１０４は、入力されるデータに「１」が含まれるため、結果としての絶対値Ｓを「１」として出力する。これにより、塩基Ａ_ｎ−５と塩基Ａ_ｎとがステムループ構造にて塩基対を形成しない相補的な関係に無い塩基同士であることが検出されることとなる。 In addition, base A _n-5 is “+1 (base A)”, and a signed numerical value represented by a binary number is [sign bit, most significant bit, least significant bit] = [0, 0, 1] _{If An} is “−2 (base C)”, the signed numerical value represented by a binary number is [sign bit, most significant bit, least significant bit] = [1, 1, 0].
In this case, since the least significant bit differs between “1” and “0”, EXOR103 outputs “1”, and since the most significant bit differs between “0” and “1”, EXOR102 changes “1”. Since the sign bit is different between “0” and “1”, the EXNOR 101 outputs “0”.
Then, since “1” is included in the input data, the OR 104 outputs the resulting absolute value S as “1”. As a result, it is detected that the base _An-5 and the base _An are bases that are not in a complementary relationship that does not form a base pair in the stem loop structure.

そして、図４における回文計算ユニットを、上記演算ユニットを３個と、第２の加算部に対応するＯＲ２００により、図１８に示すように構成する。これは３塩基以上並んだ回文構造を把握する回文計算ユニットである。
図１８は図４における偶数回文演算部である回文演算部７０，７２，７４，７６に対応する構成を示すものである。
回文演算部７０，７２，７４，７６各々には、それぞれの解析範囲に対応する回文構造｛３，０，３｝、回文構造｛３，２，３｝、回文構造｛３，４，３｝、回文構造｛３，６，３｝のゲノム配列における塩基に対応した符号付き数字のデータが入力されている。 The palindromic calculation unit in FIG. 4 is configured as shown in FIG. 18 with three arithmetic units and an OR 200 corresponding to the second addition unit. This is a palindrome calculation unit that grasps the palindrome structure in which three or more bases are arranged.
FIG. 18 shows a configuration corresponding to the palindromic operation units 70, 72, 74, and 76 that are even palindromic operation units in FIG.
In each palindromic operation unit 70, 72, 74, 76, the palindrome structure {3, 0, 3}, palindrome structure {3, 2, 3}, palindrome structure {3, corresponding to the respective analysis range 4, 3} and signed numeric data corresponding to the bases in the palindromic structure {3, 6, 3} genome sequence are input.

これにより、各回文演算部（７０，７２，７４，７６）においては、解析範囲における解析対象の塩基対同士が全て相補的である場合に「０」を出力し、いずれかが相補的でない場合に「１」を出力することとなる。
すなわち、上記回文演算部を構成する３つの演算ユニット１００の全てが「０」を出力している場合、ＯＲ２００は「０」を出力し、上記回文演算部を構成する３つの演算ユニット１００のいずれかが「１」を出力している場合、ＯＲ２００は「１」を出力する。
上述においては、回文計算ユニットを偶数回文演算部として説明したが、奇数回文演算部においても同様の構成である。 As a result, each palindromic operation unit (70, 72, 74, 76) outputs "0" when the base pairs to be analyzed in the analysis range are all complementary, and any of them is not complementary “1” is output to the terminal.
That is, when all of the three arithmetic units 100 constituting the palindromic operation unit output “0”, the OR 200 outputs “0”, and the three arithmetic units 100 constituting the palindromic operation unit. OR 200 outputs “1” when any of the above outputs “1”.
In the above description, the palindrome calculation unit has been described as an even palindrome operation unit, but the odd palindrome operation unit has the same configuration.

そして、第２の実施形態において説明したように、この回文計算ユニットを使って計算させた結果の時間、すなわちゲノム配列における塩基の順番番号をずらしてレジスタに書き込むことにより、各解析範囲における回文構造を把握することができる。
図１９の回路ブロックの構成例を用いて、ハードウェアにて実現した第２の実施形態の回文構造検出システムの説明を行う。この図１９の回路ブロックは非回文の塩基数が偶数の場合の回文構造検出の部分を示している。図１９の回路ブロックと同様の構成にて非回文の塩基数が奇数の場合の回文構造検出の部分を構成し、非回文の塩基数が偶数の場合の回文構造検出の部分と合わせることにより、本実施形態による回文構造検出システムが構成されることとなる。 Then, as described in the second embodiment, the time of the result calculated using this palindromic calculation unit, that is, the sequence number of the base in the genome sequence is shifted and written to the register, so that the rotation in each analysis range is performed. The sentence structure can be grasped.
The palindrome structure detection system according to the second embodiment realized by hardware will be described using the configuration example of the circuit block of FIG. The circuit block of FIG. 19 shows a palindromic structure detection portion when the number of non-palindromic bases is an even number. The configuration of the palindromic structure detection when the number of non-palindromic bases is an odd number in the same configuration as the circuit block of FIG. By combining them, the palindrome structure detection system according to the present embodiment is configured.

図１９において、データ入力シフトレジスタは、塩基／数字変換部１がゲノム配列が塩基毎に対応する符号付き数字のデータ［符号ビット，最上位ビット，最下位ビット］を３ビット単位にて、設定された周期Ｔにより、順次方向Ｑへ転送する。このとき、データ入力シフトレジスタの各レジスタは、それぞれ転送する符号付きデータを回文計算ユニットへ出力する。
回文計算ユニット（加算結果Ｐ１を計算する単位）群は塩基対生成部２と各解析範囲に対応する回文演算部とから構成されており、上記周期Ｔ毎に入力される上記符号付きデータを基に、解析範囲に対応した回文演算部毎にその解析結果である加算結果Ｐ１をシフトレジスタ群を介して回文レジスタに対して出力する。
シフトレジスタ群は、各解析範囲に対応した回文演算部毎の出力と回文レジスタの入力との間に設けられたシフトレジスタから構成されている。
上記シフトレジスタ群における各シフトレジスタは、１ビットの加算結果Ｐ１を上記周期Ｔ毎にデータ入力シフトレジスタのデータ転送のタイミングに同期させ、解析範囲の塩基数が２つ少なくなる毎に、回文レジスタにおける解析タイミングを合わせるため、段数が１つずつ増加して（タイミングを遅延させるため）構成されている。
図１９の例としては、解析範囲の塩基数が１６，１４，１２，１０，８，６となるに従い、回文計算ユニット群から回文レジスタへの間のシフトレジスタにおけるレジスタ数が０，１，２，３，４，５と増加している。 In FIG. 19, in the data input shift register, the base / number conversion unit 1 sets signed numeric data [sign bit, most significant bit, least significant bit] corresponding to the genome sequence for each base in units of 3 bits. The data is sequentially transferred in the direction Q by the cycle T. At this time, each register of the data input shift register outputs signed data to be transferred to the palindromic computation unit.
A palindromic calculation unit (unit for calculating the addition result P1) is composed of a base pair generation unit 2 and a palindromic operation unit corresponding to each analysis range, and the signed data input at each cycle T. Based on the above, for each palindromic operation unit corresponding to the analysis range, the addition result P1 which is the analysis result is output to the palindrome register via the shift register group.
The shift register group includes shift registers provided between the output of each palindromic operation unit corresponding to each analysis range and the input of the palindrome register.
Each shift register in the shift register group synchronizes the 1-bit addition result P1 with the data transfer timing of the data input shift register every cycle T, and each time the number of bases in the analysis range decreases by two, the palindrome In order to match the analysis timing in the register, the number of stages is increased by one (to delay the timing).
In the example of FIG. 19, as the number of bases in the analysis range becomes 16, 14, 12, 10, 8, and 6, the number of registers in the shift register between the palindromic calculation unit group and the palindrome register is 0,1. , 2, 3, 4 and 5.

上述したシフトレジスタ群の構成により、各回文演算部の演算結果にて、各々２つ解析範囲の少ない回文演算部の１塩基前の演算結果各々を合成処理するために必要な、各解析範囲の加算結果Ｐ１を回文レジスタに同時に出力することができ、図示しない回文パターン合成部がこの回文レジスタを参照することにより、出力中央部分の塩基の番号が一致する解析範囲を順次合成することができ、すでに第２の実施形態の説明にて述べたように、回文構造の検出された最も大きな解析範囲における塩基の回文構造を検出して出力する。
例えば、１００塩基を解析範囲とすると、すでに述べたように、１００塩基回文構造を検出することができる。
この場合、回文構造｛回文，非回文，回文｝＝｛３，９４，３｝の解析範囲を用いて回文構造を検出することとなるため、データ入力シフトレジスタとしては、３ビット単位（符号ビット、最上位ビット及び最下位ビット）にて並列にビット転送を行う１００段のシフトレジスタが必要である。
また、シフトレジスタ群として、以下の（１）式にて求められる数のレジスタが必要となる。 With the above-described configuration of the shift register group, each analysis range necessary for synthesizing each of the calculation results one base before the palindrome calculation unit with a small analysis range in each calculation result of each palindrome calculation unit Result P1 can be simultaneously output to the palindrome register, and an unillustrated palindrome pattern synthesizer refers to this palindrome register to sequentially synthesize analysis ranges having the same base number in the output center portion. As described in the explanation of the second embodiment, the palindrome structure of the base in the largest analysis range in which the palindrome structure is detected is detected and output.
For example, if the analysis range is 100 bases, as described above, a 100 base palindrome structure can be detected.
In this case, the palindrome structure is detected using the analysis range of the palindrome structure {pastront, non palindrome, palindrome} = {3, 94, 3}. A 100-stage shift register that performs bit transfer in parallel in bit units (sign bit, most significant bit, and least significant bit) is required.
Further, as the shift register group, the number of registers obtained by the following equation (1) is required.

解析範囲として最大の値を有する回文演算部、すなわち｛回文，非回文，回文｝＝｛３，９４，３｝には、回文レジスタに対して加算結果Ｐ１_ｎ−９４を転送する（ゲノム配列における塩基の順番をずらす）シフトレジスタが必要なく、回文演算部が４８個であるため、図１７における方向Ｑにおけるレジスタ数は４８（＝（９４／２）＋１）となり、｛回文，非回文，回文｝＝｛３，０，３｝の解析範囲に対応して、回文レジスタと回文演算部との間に介挿されたシフトレジスタ群における最長の段数のシフトレジタが４７段のため、上記（１）式からシフトレジスタ群におけるレジスタ数を計算することができる。
したがって、１００個の塩基における回文構造を検出する回路においては、１１２８個（＝４８×４７）のレジスタが必要となる。
また、回文レジスタは、４８個の回文演算部に対応しているため、すなわち４８個の加算結果（Ｐ１_ｎ，Ｐ１_ｎ−２，…）を記憶する必要があるため、４８個のレジスタが必要となる。 The addition result P1 _n-94 is transferred to the palindrome register in the palindrome operation unit having the maximum value as the analysis range, that is, {pastoral, non palindrome, palindrome} = {3, ₉₄ , 3}. No shift register is required (shifting the base sequence in the genome sequence), and there are 48 palindromic operation units, so the number of registers in direction Q in FIG. 17 is 48 (= (94/2) +1), { Corresponding to the analysis range of palindrome, non palindrome, palindrome} = {3, 0, 3}, the longest number of stages in the shift register group inserted between the palindrome register and the palindrome operation unit Since the shift register has 47 stages, the number of registers in the shift register group can be calculated from the above equation (1).
Therefore, in a circuit for detecting a palindrome structure with 100 bases, 1128 (= 48 × 47) registers are required.
Further, since the palindrome register corresponds to 48 palindromic operation units, that is, it is necessary to store 48 addition results (P1 _n , P1 _n-2 ,...), 48 registers Is required.

そして、偶数回文演算部と奇数回文演算部とに対応させるため、上述したレジスタ数及び回文演算部の数の２倍を必要とするが、現在のＬＳＩの集積度を考慮すると、容易に１チップ化することができる。
また、回文計算ユニットの計算が、上記周期Ｔの間にて、すなわち１クロック周期内にて計算できるとすると、回文計算ユニット下のデータ入力シフトレジスタにおいてデータをシフトさせるために必要な１００クロックと、シフトレジスタ群における最大長のシフトレジスタ（４７段）に対する４７クロックの１４７クロック分で回文構造を把握することができる。
そして、ヒトのゲノム配列における塩基数を３０億とすると、１００塩基までの回文構造を知るためには、３０億＋１４７クロックにて処理できる。
ここで、３０億に比し１４７がほぼ「０」に等しい小さい数と判断して、１００ＭＨｚで動作させるとすると、
３．０×１０９／（１００×１０６）＝３０（ｓ）
にて処理できることがわかる。 In order to correspond to the even palindromic operation unit and the odd palindromic operation unit, twice the number of registers and the number of palindromic operation units described above are required. Can be made into one chip.
If the palindromic calculation unit can calculate the period T, that is, within one clock period, the data input shift register under the palindromic calculation unit needs to shift data 100. The palindrome structure can be grasped by 147 clocks of 47 clocks for the maximum length shift register (47 stages) in the shift register group.
Then, if the number of bases in the human genome sequence is 3 billion, in order to know the palindrome structure up to 100 bases, it can be processed in 3 billion +147 clocks.
Here, it is assumed that 147 is a small number substantially equal to “0” compared to 3 billion, and is operated at 100 MHz.
3.0 × 109 / (100 × 106) = 30 (s)
It can be seen that can be processed by.

＜第３の実施形態＞
また、近年に開発された組み替えＤＮＡ実験技術または遺伝子工学という技術において多用される極めて注目すべき物質として、制限酵素というものが天然に存在することが見いだされている。この制限酵素は、自身以外の生物のＤＮＡを認識し、特定の塩基配列における任意の場所あるいは特定の場所にて切断して分解する性質を有している（例えば、特開平１０−２６２６９９号公報参照）。また、この制限酵素の多くは細菌中に存在し、この細菌に侵入する外来のＤＮＡを排除するための機能として存在する。これらの制限酵素それぞれは、ＤＮＡにおける一種類の短い塩基配列を認識する。この認識される塩基配列の長さは通常４から６塩基対であり、切断部位の塩基配列は回文構造（パリンドローム構造）を示している。この制限酵素を用いることにより、分子生物学者は外来のＤＮＡを、目的とするＤＮＡ、例えばプラスミドに挿入できる形とするため、プラスミドを開環させる。
また、制限酵素は、ＤＮＡ分子から必要とする断片を切り取るための手段として用いられている。
プラスミドと制限酵素の発見により、ＤＮＡをクローン化し、望みのＤＮＡ断片の正確なコピーを得ることが容易となった。そのため、制限酵素を用いてその断片をＤＮＡから単離し、それをプラスミドに挿入して、混成プラスミドを宿主細菌に導入することにより、ＤＮＡ断片のコピーが容易に生成できる。 <Third Embodiment>
Further, it has been found that a restriction enzyme is naturally present as a very remarkable substance frequently used in a technique called recombinant DNA experimental technique or genetic engineering developed in recent years. This restriction enzyme has the property of recognizing the DNA of a living organism other than itself and cleaving it at any location in a specific base sequence or at a specific location (for example, JP-A-10-262699). reference). Many of these restriction enzymes are present in bacteria and exist as a function for eliminating foreign DNA that invades the bacteria. Each of these restriction enzymes recognizes one type of short base sequence in DNA. The length of this recognized base sequence is usually 4 to 6 base pairs, and the base sequence of the cleavage site shows a palindromic structure (palindromic structure). By using this restriction enzyme, the molecular biologist opens the plasmid to make the foreign DNA into a form that can be inserted into the target DNA, eg, a plasmid.
Restriction enzymes are used as a means for cutting out necessary fragments from DNA molecules.
The discovery of plasmids and restriction enzymes has made it easier to clone DNA and obtain an exact copy of the desired DNA fragment. Therefore, a copy of the DNA fragment can be easily generated by isolating the fragment from DNA using restriction enzymes, inserting it into a plasmid, and introducing the hybrid plasmid into a host bacterium.

ここで、ＤＮＡ分子において、各制限酵素の切断部位の配置を示す図を制限酵素切断地図と言う。
また、従来から行われている上記制限酵素切断地図の作成手順を以下に示す。
まず、制限酵素切断地図を作成するＤＮＡを、対象生物の組織から全ゲノムＤＮＡとして抽出し、物理的な手法により細かく断片化し、ＢＡＣ（bacterial artificial chromosome）やＹＡＣ（yeast artificial chromosome）などのベクターにクローニングしてゲノムＤＮＡライブラリを作成する。
次に、上記ゲノムＤＮＡライブラリに含まれるクローンの何種類かを制限酵素によって切断し、切断部位のクローン上の位置を決定して、クローンにおける制限酵素切断地図を作成する。そして、各クローン間の制限酵素切断地図を比較し、その切断パターンの解析を行い、隣接するクローンを抽出する（フィンガープリント法）。
複数のクローンについて隣接関係を特定し、ある程度の長さの領域をカバーするクローンのグループをコンティグと呼ぶ。この作業と平行して、遺伝子地図を参照して、得られた上述のコンティグを、遺伝子地図の染色体上に位置づける。 Here, a diagram showing the arrangement of the cleavage sites of each restriction enzyme in a DNA molecule is called a restriction enzyme cleavage map.
Moreover, the procedure for preparing the restriction enzyme cleavage map that has been conventionally performed is shown below.
First, the DNA that creates the restriction enzyme cleavage map is extracted as whole genome DNA from the tissue of the target organism, fragmented by physical methods, and used as a vector such as BAC (bacterial artificial chromosome) or YAC (yeast artificial chromosome). Cloning to create a genomic DNA library.
Next, several types of clones contained in the genomic DNA library are cleaved with a restriction enzyme, the position of the cleavage site on the clone is determined, and a restriction enzyme cleavage map in the clone is prepared. Then, restriction enzyme cleavage maps between the clones are compared, the cleavage pattern is analyzed, and adjacent clones are extracted (fingerprint method).
A group of clones that specify an adjacent relationship for a plurality of clones and cover an area of a certain length is called a contig. In parallel with this operation, the above-mentioned contig obtained is positioned on the chromosome of the genetic map with reference to the genetic map.

この位置づけを行う際、位置づけするマーカーのうち、塩基配列の配列情報を有するものを利用し、その塩基配列を基にＰＣＲ（Polymerase Chain Reaction）プライマーまたはハイブリダイゼーションプローブを作成して、ゲノムＤＮＡライブラリの中のクローンから、そのマーカーが塩基配列を有するクローンを特定する。あるコンティグに含まれるクローンの一つがこのようにして染色体上に位置づけられれば、そのコンティグそのものの染色体上の位置も明らかとなる。以上の作業を繰り返すことにより、染色体のほぼ全域にコンティグとしてのクローンを位置づけることができれば、物理地図としての制限酵素切断地図を完成させることができる。 When performing this positioning, use a marker that has sequence information of the base sequence among the markers to be positioned, create a PCR (Polymerase Chain Reaction) primer or hybridization probe based on the base sequence, A clone having the nucleotide sequence of the marker is identified from the clones in the middle. If one of the clones contained in a certain contig is positioned on the chromosome in this way, the position of the contig itself on the chromosome is also revealed. If a clone as a contig can be located almost throughout the chromosome by repeating the above operations, a restriction enzyme cleavage map as a physical map can be completed.

しかしながら、現在、高速にゲノム情報がサンガー法などにより、ＤＮＡから読み出されることから、上述した実験を繰り返して制限酵素切断地図を生成することは非常に手間がかかることになる。
ここで、ＤＮＡにおける切断部位のほとんどが４、５、６または８塩基対の回文構造であることから、第１の実施形態におけるゲノム配列における回文構造検出システムを用いることより、高速に回文構造の塩基対を検索し、容易に制限酵素の切断部位を示す制限酵素切断地図を生成することができることになる。以下、制限酵素切断地図を生成するための回文構造検出システムの説明を行う。 However, since genome information is read from DNA at high speed by the Sanger method or the like at present, it is very time-consuming to generate a restriction enzyme cleavage map by repeating the above-described experiment.
Here, since most of the cleavage sites in DNA are palindromic structures of 4, 5, 6 or 8 base pairs, the palindromic structure detection system in the genomic sequence in the first embodiment can be used to perform faster round-trips. By searching for the base pair of the sentence structure, a restriction enzyme cleavage map showing the restriction enzyme cleavage site can be easily generated. Hereinafter, a palindrome structure detection system for generating a restriction enzyme cleavage map will be described.

以下に、図２２を参照して本発明の第３の実施形態によるゲノム配列における回文構造検出システムの説明を行う。図２２は、本実施形態によるゲノム配列における回文構造検出システムの構成例を示すブロック図である。
本実施形態については、回文構造の解析範囲を、非回文ｑ（＝０）塩基、すなわち回文構造中に非回文塩基がなく、回文２（＝ｒ）塩基、回文３（＝ｒ）塩基からなる構造として、解析範囲の中央に対して対照な位置の塩基配列を塩基対として、解析範囲の塩基配列に対して１塩基ずつシフトさせつつ回文構造の検出を行う。 Hereinafter, a palindrome structure detection system in a genome sequence according to the third embodiment of the present invention will be described with reference to FIG. FIG. 22 is a block diagram illustrating a configuration example of the palindrome structure detection system in the genome sequence according to the present embodiment.
In the present embodiment, the analysis range of the palindrome structure is set to a non-palindrome q (= 0) base, that is, there is no non-palindrome base in the palindrome structure, palindrome 2 (= r) base, palindrome 3 ( = R) As a structure composed of bases, the palindrome structure is detected while shifting the base sequence of the analysis range one base at a time, using the base sequence at the contrast position with respect to the center of the analysis range as a base pair.

塩基／数字変換部１は、第１の実施形態と同様に、時系列に１塩基ずつシフトして入力されるゲノム配列における各塩基を、対応する符号の付された数字に置き換え、すなわち塩基Ｇを「＋２」、塩基Ａを「＋１」、塩基Ｃを「−２」、塩基Ｔを「−１」へ変換して出力する。
例えば、シフトレジスタ１５０は、上記塩基／数字変換部１から時系列に入力される符号の付された数字を、記塩基／数字変換部１が数字を出力するタイミングに応じて、１つずつ順次シフトさせ、シリアルに入力された数字をパラレルに出力する。このシフトレジスタ１５０は、後述する塩基対生成部２における解析範囲内の塩基配列数に対応する数字分だけシフトできる構成とする。 As in the first embodiment, the base / number conversion unit 1 replaces each base in the genome sequence inputted by shifting one base at a time in time series with a number with a corresponding code, that is, base G Is converted to “+2”, base A is “+1”, base C is converted to “−2”, and base T is converted to “−1” and output.
For example, the shift register 150 sequentially adds the numbers with the signs inputted in time series from the base / number conversion unit 1 one by one in accordance with the timing at which the base / number conversion unit 1 outputs the numbers. Shift and output serially input numbers in parallel. The shift register 150 is configured to be able to shift by a number corresponding to the number of base sequences within the analysis range in the base pair generation unit 2 described later.

塩基対生成部２は、シフトレジスタ１５０からパラレルに入力されるゲノム配列において、解析範囲内の塩基配列の両端から中央方向に塩基を一つずつずらして、該解析範囲内の各両端からｒ個ずつの塩基を両端からの位置に対応して（非回文塩基を挟んで対称位置にある塩基を）組み合わせて、予め設定された塩基対を生成する。
すなわち、塩基対生成部２は、解析範囲の中央部を挟んで、それぞれ対向する位置（非回文塩基を挟んで対称位置）にある塩基の組み合わせを塩基対として、対応する後述の回文演算部へ出力する。 The base pair generation unit 2 shifts one base from the both ends of the base sequence in the analysis range one by one in the center direction in the genome sequence input in parallel from the shift register 150, and r r from each end in the analysis range. Each base is combined corresponding to the position from both ends (bases at symmetrical positions with a non-palinity base in between) to generate a preset base pair.
In other words, the base pair generation unit 2 uses a combination of bases at opposite positions (symmetric positions with a non-palinity base in between) across the center of the analysis range as base pairs, and the corresponding palindromic computation described later. Output to the section.

回文演算部２０１は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛２，０，２｝に対応する構成であり、２つの第１の加算部３_６１と、３つの絶対値部４_６１と１つの第２の加算部５_６１から構成されている。
第１の加算部３_６１各々は、Ａ_ｎ−３＋Ａ_ｎと、Ａ_ｎ−２＋Ａ_ｎ−１とのそれぞれの加算を行い、それぞれの加算結果を対応する絶対値部４_６１へ各々出力する。
絶対値部４_６１各々は、対応する第１の加算部３_６１から出力される演算結果の絶対値Ｓ１_ｎ，Ｓ２_ｎを出力する。
第２の加算部５６１は、絶対値４_６１各々からの絶対値Ｓ１_ｎ，Ｓ２_ｎを加算し、加算結果Ｐ１_ｎを出力する。この加算結果Ｐ１_ｎが「０」となれば、解析範囲における回文塩基同士が回文構造をしていることが検出される。 The palindromic operation unit 201 is an even palindromic operation unit, and has a configuration corresponding to {pastolic base, non-pastronomic base, palindromic base} = {2, 0, 2} as an analysis range. The first adding unit 3 ₆₁ includes three absolute value units 4 ₆₁ and one second adding unit 5 ₆₁ .
Each of the first addition units 3 ₆₁ performs the addition of A _n−3 + A _n and A _n−2 + A _n−1, and outputs each addition result to the corresponding absolute value unit 4 ₆₁ . .
Each of the absolute value units 4 ₆₁ outputs the absolute values S1 _n and S2 _n of the calculation results output from the corresponding first adder 3 ₆₁ .
The second addition unit 561 adds the absolute value _S1 n, S2 _n from the absolute value _{4 61} respectively, and outputs the addition result P1 _n. If the addition result P1 _n is “0”, it is detected that palindromic bases in the analysis range have palindromic structures.

回文演算部２０２は、偶数回文演算部であり、解析範囲として、｛回文塩基、非回文塩基、回文塩基｝＝｛３，０，３｝に対応する構成であり、３つの第１の加算部３_６２と、３つの絶対値部４_６２と１つの第２の加算部５_６２から構成されている。
第１の加算部３_６２各々は、Ａ_ｎ−５＋Ａ_ｎと、Ａ_ｎ−４＋Ａ_ｎ−１と、Ａ_ｎ−３＋Ａ_ｎ−２とのそれぞれの加算を行い、対応する絶対値部４_６２へ各々出力する。
絶対値部４_６２各々は、対応する第１の加算部３_６２から出力される演算結果の絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１を出力する。
第２の加算部５_６２は、絶対値４_６２各々からの絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１を加算し、加算結果Ｐ１_ｎ−１を出力する。この加算結果Ｐ１_ｎ−１が「０」となれば、解析範囲における回文塩基同士が回文構造をしていることが検出される。 The palindromic operation unit 202 is an even palindromic operation unit, and has a configuration corresponding to {pastolic base, non-palindromic base, palindromic base} = {3, 0, 3} as an analysis range. The first addition unit 3 ₆₂ includes three absolute value units 4 ₆₂ and one second addition unit 5 ₆₂ .
Each of the first addition units 3 ₆₂ performs addition of A _n−5 + A _n , A _n−4 + A _n−1 , and A _n−3 + A _n−2, and the corresponding absolute value unit 4 Each output to ₆₂ .
Each of the absolute value units 4 ₆₂ outputs the absolute values S1 _n−1 , S2 _n−1 , and S3 _n−1 of the operation results output from the corresponding first adder 3 ₆₂ .
The second addition unit _{5 62} adds the absolute value _{_{S1 n-1, S2 n-}} 1, S3 n-1 from the absolute value _{4 62} respectively, and outputs the addition result _{P1 n-1.} If the addition result P1 _n-1 is “0”, it is detected that palindromic bases in the analysis range have a palindromic structure.

データベース５００には、回文構造の塩基配列における塩基の種類及び塩基の並び順とに対応して、その回文構造を切断部位とする制限酵素を表す情報が記憶されている。
検索部５０１は、塩基／数字変換部１が塩基を番号に変換して出力する周期単位に、Ｐ１_ｎが「０」で出力された場合、シフトレジスタ１５０が出力している回文構造｛２，０，２｝のＡ_ｎ，Ａ_ｎ−１，Ａ_ｎ−２，Ａ_ｎ−３の塩基配列に対応する制限酵素をデータベース５００から検索し、Ｐ１_ｎ−１が「０」で出力された場合、シフトレジスタ１５０が出力している回文構造｛３，０，３｝のＡ_ｎ，Ａ_ｎ−１，Ａ_ｎ−２，Ａ_ｎ−３，Ａ_ｎ−４，Ａ_ｎ−５の塩基配列に対応する制限酵素をデータベース５００から検索し、検索されるとＤＮＡの塩基配列の番号とともに出力する。
ここで、検索部５０１には、シフトレジスタ１５０のＡ_ｎ〜Ａ_ｎ−５各々から出力される塩基の情報（回文構造の塩基配列における塩基の種類及び塩基の並び順）と、加算結果Ｐ１_ｎ及びＰ１_ｎ−１と、シフトレジスタ１５０のＡ_ｎから出力される塩基のＤＮＡにおける配列位置を示す位置番号（塩基／数字変換部１から出力される塩基配列の先頭から何番目の塩基から塩基配列が開始されているかを示す番号）とが入力される。 The database 500 stores information representing restriction enzymes having the palindrome structure as a cleavage site, corresponding to the type of base and the order of bases in the base sequence of the palindrome structure.
The search unit 501 outputs the palindrome structure {2} output by the shift register 150 when P1 _n is output as “0” in the cycle unit that the base / number conversion unit 1 converts the base into a number and outputs it. , 0, 2}, a restriction enzyme corresponding to the base sequence of _An , _An-1 , _An-2 , _An-3 was searched from the database 500, and P1 _n-1 was output as "0" In this case, the bases of A _n , A _n−1 , A _n−2 , A _n−3 , A _n−4 , and A _n−5 of the palindrome structure {3, 0, 3} output from the shift register 150. A restriction enzyme corresponding to the sequence is searched from the database 500. When the search is performed, the restriction enzyme is output together with the DNA base sequence number.
Here, the search unit 501 includes a base which is output from the A _n ~A _{_n-5} each shift register 150 information (the type and order of bases of bases in the base sequence of the palindromic), the addition result P1 and _n and P1 _n-1, the base from the beginning of the position number (base / numeric nucleotide sequence that is output from the conversion unit 1 showing the arrangement position in the DNA bases output from a _n of the shift register 150 from what number bases A number indicating whether the sequence has been started).

次に、図２３を参照して第３の実施形態によるゲノム配列における回文構造検出システムの動作を説明する。図２３は回文構造システムの動作を示す回文構造パターン生成、及び検出された回文構造の塩基配列により、その塩基配列を切断部位とする制限酵素の検索及びその回文構造のＤＮＡにおける位置を特定し、制限酵素切断地図の生成を説明する概念図である。
以下の説明における回文構造パターン生成を説明する図２３は、図示しない表示部が制限酵素切断地図生成機能を有する回文構造検出システムの検出結果を解析結果図として表示するものである。 Next, the operation of the palindrome structure detection system in the genome sequence according to the third embodiment will be described with reference to FIG. FIG. 23 shows the palindrome structure pattern generation showing the operation of the palindrome structure system, the search for restriction enzymes having the base sequence as a cleavage site based on the detected base sequence of the palindrome structure, and the position of the palindrome structure in DNA FIG. 2 is a conceptual diagram for explaining the generation of a restriction enzyme cleavage map.
FIG. 23 for explaining the palindrome structure pattern generation in the following description displays the detection result of the palindrome structure detection system in which the display unit (not shown) has a restriction enzyme cut map generation function as an analysis result diagram.

この図２３において、最も左の列の欄がゲノム配列における塩基配列の先頭からの順番を示し、次の列の欄にそれぞれの番号に対応した核酸塩基名が記載されている。ここで、核酸分類数（符号付き数字）の欄の列は２列あって、いずれの列も同様の記載であり（この列は１列としても良い）、塩基／数字変換部１により隣接する左の列の欄の各塩基が変換された符号付き数字（核酸分類数）が記載されている。すなわち、図９と同様に、左すみの列の欄から、Ａ、Ｔ、Ｇ、Ｃ各々の塩基名と、この塩基名Ａ、Ｔ、Ｇ、Ｃに対応した＋１、−１、＋２、−２の符号付き数字（核酸分類数）が示されている。ここで、各核酸分類数（符号付き数字）の列は、第１の実施形態と同様に、それぞれ検出する回文構造の種類に対応しており、回文構造と検出された塩基配列の領域の核酸分類数を囲む、あるいは色を変えるなどのマーキングを行い視覚的にいずれの領域が回文構造であるかを示すために用いられる。 In FIG. 23, the leftmost column column indicates the order from the top of the base sequence in the genome sequence, and the nucleobase name corresponding to each number is described in the next column column. Here, there are two columns in the column of the number of nucleic acid classifications (signed numbers), and both columns have the same description (this column may be one column) and are adjacent by the base / number conversion unit 1. A signed number (number of nucleic acid classifications) in which each base in the column of the left column is converted is described. That is, as in FIG. 9, from the column of the left corner column, base names of A, T, G, and C, and +1, −1, +2, −, corresponding to the base names A, T, G, and C, A signed number of 2 (number of nucleic acid classifications) is shown. Here, each nucleic acid classification number (number with a sign) corresponds to the type of palindrome structure to be detected as in the first embodiment, and the palindrome structure and the region of the detected base sequence It is used to visually mark which region has a palindrome structure by marking such as changing the number of nucleic acid classifications or changing the color.

次の回文構造｛３，０，３｝の下部には、絶対値Ｓ１_ｎ−１，Ｓ_ｎ−１及びＳ３_ｎ−１の欄があり、回文演算部２０１における各第１の加算部３_５１各々がシフトレジスタ１５０から出力される上記符号付き数字を加算し、絶対値部４_６１がこの加算結果を絶対値化した絶対値Ｓ１_ｎ−１，Ｓ２_ｎ−１，Ｓ３_ｎ−１各々の出力が示され、また、加算結果Ｐ１_ｎ−１の欄があり、絶対値Ｓ１_ｎ−１（｜Ａ_ｎ＋Ａ_ｎ−５｜），Ｓ２_ｎ−１（｜Ａ_ｎ−１＋Ａ_ｎ−４｜），Ｓ３_ｎ−１（｜Ａ_ｎ−２＋Ａ_ｎ−３｜）が加算された加算結果Ｐ１_ｎ−１の数値が示されている。
次の回文構造｛２，０，２｝の下部には、絶対値Ｓ１_ｎ及びＳ_ｎの欄があり、回文演算部２０２における各第１の加算部３_６２各々がシフトレジスタ１５０から出力される上記符号付き数字を加算し、絶対値部４_６２がこの加算結果を絶対値化した絶対値Ｓ１_ｎ，Ｓ２_ｎ各々の出力が示され、また、加算結果Ｐ１_ｎの欄があり、絶対値Ｓ１_ｎ（｜Ａ_ｎ＋Ａ_ｎ−３｜），Ｓ２_ｎ（｜Ａ_ｎ−１＋Ａ_ｎ−２｜）が加算された加算結果Ｐ１_ｎの数値が示されている。 Below the palindrome structure {3, 0, 3}, there are columns of absolute values S1 _n−1 , S _n−1, and S3 _n−1. 3 ₅₁ each by adding the signed numeric output from the shift register 150, the absolute value unit _{4 61} absolute value _S1 _n-1 obtained by the absolute value of this _{_{sum, S2 n-1, S3 n}} -1 each And an addition result P1 _n-1 column, and absolute values S1 _n-1 (| A _n + A _n-5 |), S2 _n-1 (| A _n-1 + A _n-4 |), S3 _n−1 (| A _n−2 + A _n−3 |) are added, and the numerical value of the addition result P1 _n−1 is shown.
At the bottom of the next palindrome {2,0,2}, has the column of the absolute value S1 _n and _{S n,} outputs the first adding section _{3 62} Each of the palindrome calculation unit 202 from the shift register 150 by adding the signed numbers is, the absolute value unit 4 ₆₂ output of the absolute value S1 _n, S2 _n each has an absolute value of this sum is shown, also has columns of the addition result P1 _n, absolute A numerical value of the addition result P1 _n obtained by adding the values S1 _n (| A _n + A _n−3 |) and S2 _n (| A _n−1 + A _n−2 |) is shown.

そして、検索部５０１は、回文演算部２０１から出力される加算結果Ｐ１_ｎ、あるいは回文演算部２０２から出力される加算結果Ｐ１_ｎ−１が「０」であることを検出すると、いずれの回文構造が検出されたかをいずれが「０」であることを検出することにより判定し、対応する構成（回文構造｛２，０，２｝あるいは回文構造｛３，０，３｝）の回文構造として検出された塩基配列に対応する制限酵素を、それぞれの回文構造の塩基配列における塩基の種類及び塩基の並び順によりデータベース５００から検索する。
例えば、位置番号１４から始まる塩基配列｛−１，−１，１，１｝＝｛Ｔ，Ｔ，Ａ，Ａ｝が回文構造であるため、回文演算部２０１は、加算結果Ｐ１_ｎを「０」として出力する。
ここで、
Ｓ１_ｎ＝｜Ａ_ｎ＋Ａ_ｎ−３｜＝０、Ｓ２_ｎ＝｜Ａ_ｎ−１＋Ａ_ｎ−２｜＝０
であり、
Ｐ１_ｎ＝Ｓ１_ｎ＋Ｓ２_ｎ＝０＋０＝０
となる。 When the search unit 501 detects that the addition result P1 _n output from the palindromic calculation unit 201 or the addition result P1 _n−1 output from the palindromic calculation unit 202 is “0”, Whether a palindrome structure is detected is determined by detecting which one is “0”, and the corresponding configuration (palindrome structure {2, 0, 2} or palindrome structure {3, 0, 3}) The restriction enzyme corresponding to the base sequence detected as the palindrome structure is searched from the database 500 by the base type and the base arrangement order in the base sequence of each palindrome structure.
For example, since the base sequence {-1, -1, 1, 1} = {T, T, A, A} starting from the position number 14 has a palindrome structure, the palindromic operation unit 201 sets the addition result P1 _n to Output as “0”.
here,
_{_{_{_{S1 n = | A n + A}}}} n-3 | = 0, S2 n = | A n-1 + A n-2 | = 0
And
P1 _n = S1 _n + S2 _n = 0 + 0 = 0
It becomes.

加算結果Ｐ１_ｎが入力されることにより、検索部５０１は、回文演算部２０１から入力された加算結果Ｐ１_ｎが「０」であるため、回文構造｛２，０，２｝である塩基配列｛−１，−１，１，１｝に対応する制限酵素を、データベース５００において検索する。
このとき、検索部５０１は、塩基配列｛−１，−１，１，１｝に対応する制限酵素がデータベース５００に記憶されていないことを検出し、図２３に示すように、回文構造｛２，０，２｝に対応する核酸分類数（符号付き数字）の欄の列において、塩基配列｛−１，−１，１，１｝の領域部分をマークし、図示しない表示部に対して「特になし」を表示して、この塩基配列｛−１，−１，１，１｝に対応する制限酵素が無いことを通知する。 When the addition result P1 _n is input, the search unit 501 has the base of the palindrome structure {2, 0, 2} because the addition result P1 _n input from the palindromic operation unit 201 is “0”. A restriction enzyme corresponding to the sequence {-1, -1, 1, 1} is searched in the database 500.
At this time, the search unit 501 detects that the restriction enzyme corresponding to the base sequence {-1, -1, 1, 1} is not stored in the database 500, and as shown in FIG. In the column of the column of the number of nucleic acid classification (signed number) corresponding to 2, 0, 2}, mark the region portion of the base sequence {-1, -1, 1, 1} and “None” is displayed to notify that there is no restriction enzyme corresponding to this base sequence {-1, -1, 1, 1}.

一方、位置番号１５から始まる塩基配列｛−２，−１，−１，１，１，２｝＝｛Ｃ，Ｔ，Ｔ，Ａ，Ａ，Ｇ｝が回文構造であるため、回文演算部２０２は、加算結果Ｐ１_ｎ−１を「０」として出力する。
ここで、
Ｓ１_ｎ−１＝｜Ａ_ｎ＋Ａ_ｎ−５｜＝０、Ｓ２_ｎ−１＝｜Ａ_ｎ−１＋Ａ_ｎ−４｜＝０、Ｓ３_ｎ−１＝｜Ａ_ｎ−２＋Ａ_ｎ−３｜＝０
であり、
Ｐ１_ｎ−１＝Ｓ１_ｎ−１＋Ｓ２_ｎ−１＋Ｓ３_ｎ−１＝０＋０＋０＝０
となる。 On the other hand, since the base sequence {-2, -1, -1,1,1,2} = {C, T, T, A, A, G} starting from position number 15 has a palindrome structure, palindromic operations The unit 202 outputs the addition result P1 _n−1 as “0”.
here,
S1 _n-1 = | A _n + A _n-5 | = 0, S2 _n-1 = | A _n-1 + A _n-4 | = 0, S3 _n-1 = | A _n-2 + A _n-3 | = 0
And
P1 _n-1 = S1 _n-1 + S2 _n-1 + S3 _n-1 = 0 + 0 + 0 = 0
It becomes.

加算結果Ｐ１_ｎ−１が入力されることにより、検索部５０１は、回文演算部２０１から入力される加算結果Ｐ１_ｎ−１が「０」であるため、回文構造｛３，０，３｝である塩基配列｛−２，−１，−１，１，１，２｝に対応する制限酵素を、データベース５００において検索する。
このとき、検索部５０１は、塩基配列｛−２，−１，−１，１，１，２｝に対応する制限酵素ＥcoRIがデータベース５００に記憶されていることを検出し、図２３に示すように、図示しない表示部に対して、回文構造｛３，０，３｝に対応する核酸分類数（符号付き数字）の欄の列において、塩基配列｛−２，−１，−１，１，１，２｝の領域部分をマークするとともに、回文構造の塩基配列の先頭の位置番号の行に対応した位置（例えば、Ｐ１_ｎ−１が「０」と表示された右隣の欄）に、検出された制限酵素名「ＥcoRI」を表示し、この塩基配列｛−２，−１，−１，１，１，２｝に対応する制限酵素が「ＥcoRI」であること、および制限酵素の切断部位のＤＮＡの塩基配列における回文構造の位置を通知する。また、上記位置番号を制限酵素名とともに表示部に表示させるようにしても良い。 When the addition result P1 _n−1 is input, the search unit 501 has the palindrome structure {3, 0, 3 because the addition result P1 _n−1 input from the palindromic operation unit 201 is “0”. }, The database 500 is searched for a restriction enzyme corresponding to the base sequence {-2, -1, -1, 1, 1, 2}.
At this time, the search unit 501 detects that the restriction enzyme EcoRI corresponding to the base sequence {-2, -1, -1,1,1,2} is stored in the database 500, as shown in FIG. In addition, in the column of the column of the number of nucleic acid classifications (signed numbers) corresponding to the palindromic structure {3, 0, 3} with respect to the display unit (not shown), the base sequence {-2, -1, -1, 1 , 1, 2}, and the position corresponding to the first position number line of the palindromic base sequence (for example, the right-hand column where P1 _n-1 is displayed as “0”) The name of the detected restriction enzyme “EcoRI” is displayed, the restriction enzyme corresponding to this base sequence {−2, −1, −1,1,1,2} is “EcoRI”, and the restriction enzyme The position of the palindrome structure in the base sequence of the DNA at the cleavage site is notified. The position number may be displayed on the display unit together with the restriction enzyme name.

また、回文演算部２０１は位置番号２４において、加算結果Ｐ１_ｎを「０」として出力する。
そして、検索部５０１は、回文演算部２０１から入力された加算結果Ｐ１_ｎが「０」であるため、回文構造｛２，０，２｝である塩基配列｛−２，−２，２，２｝＝｛Ｃ，Ｃ，Ｇ，Ｇ｝に対応する制限酵素を、データベース５００において検索する。
このとき、検索部５０１は、塩基配列｛−２，−２，２，２｝に対応する制限酵素ＰalIがデータベース５００に記憶されていることを検出し、図２３に示すように、図示しない表示部に対して、回文構造｛２，０，２｝に対応する核酸分類数（符号付き数字）の欄の列において、塩基配列｛−２，−２，２，２｝の領域部分をマークするとともに、回文構造の塩基配列の先頭の位置番号の行に対応した位置（例えば、Ｐ１_ｎが「０」と表示された右隣の欄）に、検出された制限酵素名「ＰalI」を表示し、この塩基配列｛−２，−２，２，２｝に対応する制限酵素が「ＰalI」であること、および制限酵素の切断部位のＤＮＡの塩基配列における回文構造の位置を通知する。また、上記位置番号を制限酵素名とともに表示部に表示させるようにしても良い。 The palindromic operation unit 201 outputs the addition result P1 _n as “0” at the position number 24.
Then, since the addition result P1 _n input from the palindromic operation unit 201 is “0”, the search unit 501 has the base sequence {−2, −2, 2 which is the palindromic structure {2, 0, 2}. , 2} = {C, C, G, G}, the restriction enzyme corresponding to the database 500 is searched.
At this time, the search unit 501 detects that the restriction enzyme PalI corresponding to the base sequence {-2, -2, 2, 2} is stored in the database 500, and as shown in FIG. Mark the region part of the base sequence {-2, -2, 2, 2} in the column of the number of nucleic acid classifications (signed numbers) corresponding to the palindrome structure {2, 0, 2} In addition, the detected restriction enzyme name “PalI” is placed at the position corresponding to the line of the first position number of the palindromic structure base sequence (for example, the right adjacent column where P1 _n is displayed as “0”). The restriction enzyme corresponding to this base sequence {-2, -2, 2, 2} is "PalI", and the palindromic structure position in the DNA base sequence of the restriction enzyme cleavage site is notified. . The position number may be displayed on the display unit together with the restriction enzyme name.

上述したように、本実施形態の回文構造検出システムにおいては、第１の実施形態と同様にＤＮＡの塩基配列から回文構造を有する塩基配列の領域を検出し、かつ検出された回文構造の塩基配列における塩基の種類及び塩基の並び順に対応した制限酵素を、データベース５００から検出し、ＤＮＡの塩基配列における先頭からの位置を示す位置番号とともに出力することにより、解析対象のＤＮＡの塩基配列に対する制限酵素切断地図を生成する機能を有している。
本実施形態においては、回文構造｛２，０，２｝及び回文構造｛３，０，３｝の２種類の回文構造の型の検出を行う構成として説明した。
制限酵素の切断部位の塩基配列としての回文構造において、回文構造｛３，０，３｝の型の塩基配列が全体の約７割を占めており、回文構造｛２，０，２｝の型の塩基配列が全体の約１．５割以上であることから、本実施形態の回文構造検出システムで全体の８．５割以上の制限酵素の切断部位に対応した制限酵素切断地図を生成することができる。 As described above, in the palindrome structure detection system of the present embodiment, the region of the base sequence having the palindrome structure is detected from the DNA base sequence as in the first embodiment, and the detected palindrome structure The restriction enzyme corresponding to the base type and the base sequence in the base sequence is detected from the database 500 and output together with the position number indicating the position from the beginning in the base sequence of the DNA, whereby the base sequence of the DNA to be analyzed It has a function of generating a restriction enzyme cleavage map for.
In the present embodiment, a description has been given of a configuration that detects two types of palindrome structures, palindrome structure {2, 0, 2} and palindrome structure {3, 0, 3}.
In the palindrome structure as the base sequence of the restriction enzyme cleavage site, the palindrome structure {3, 0, 3} type occupies about 70% of the whole, and the palindrome structure {2, 0, 2 } Type base sequence is about 1.5% or more of the whole, so that the restriction enzyme cleavage map corresponding to the whole of 8.5% or more restriction enzyme cleavage sites in the palindromic structure detection system of this embodiment. Can be generated.

また、制限酵素切断地図において、より切断部位を詳細とする場合、第１の実施形態の各回文演算部を用いて、より多種類の回文構造を検出する構成を加えることにより、より詳細な制限酵素の切断部位を示す制限酵素切断地図を生成することができる。
本実施形態の回文構造検出システムによれば、ＤＮＡにおける制限酵素による切断部位のほとんどが４塩基対（回文構造｛２，０，２｝）、５塩基対（回文構造｛２，１，２｝）、６塩基対（回文構造｛３，０，３｝）または８塩基対（回文構造｛４，０，４｝）の配列を有する回文構造であることから、第１の実施形態におけるゲノム配列における回文構造検出システムを用いることより、高速に回文構造の塩基対を検索し、容易に制限酵素の切断部位を示す制限酵素切断地図を生成することができる。 Further, in the restriction enzyme cleavage map, when the cleavage site is more detailed, by adding a configuration for detecting more types of palindrome structures using each palindrome calculation unit of the first embodiment, A restriction enzyme cleavage map showing restriction enzyme cleavage sites can be generated.
According to the palindrome structure detection system of the present embodiment, most of the cleavage sites by restriction enzymes in DNA are 4 base pairs (palindrome structure {2, 0, 2}), 5 base pairs (palindrome structure {2, 1). , 2}), 6 base pairs (palindrome structure {3, 0, 3}) or 8 base pairs (palindrome structure {4, 0, 4}). By using the palindrome structure detection system in the genome sequence in the embodiment, the base pair of the palindrome structure can be searched at high speed, and the restriction enzyme cleavage map showing the restriction enzyme cleavage site can be easily generated.

なお、図２、図４及び図２２における回文構造検出システムの機能を実現するための各部それぞれの動作を記述したプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより回文構造検出の各処理（制限酵素切断地図の作成を含む）を行ってもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータシステム」は、ホームページ提供環境（あるいは表示環境）を備えたＷＷＷシステムも含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムが送信された場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリ（ＲＡＭ）のように、一定時間プログラムを保持しているものも含むものとする。 The program describing the operation of each part for realizing the function of the palindromic structure detection system in FIG. 2, FIG. 4 and FIG. 22 was recorded on a computer-readable recording medium and recorded on this recording medium. Each process of palindromic structure detection (including creation of a restriction enzyme cleavage map) may be performed by causing the computer system to read and execute the program. Here, the “computer system” includes an OS and hardware such as peripheral devices. The “computer system” includes a WWW system having a homepage providing environment (or display environment). The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Further, the “computer-readable recording medium” refers to a volatile memory (RAM) in a computer system that becomes a server or a client when a program is transmitted via a network such as the Internet or a communication line such as a telephone line. In addition, those holding programs for a certain period of time are also included.

また、上記プログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク（通信網）や電話回線等の通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。また、上記プログラムは、前述した機能の一部を実現するためのものであっても良い。さらに、前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であっても良い。 The program may be transmitted from a computer system storing the program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in the transmission medium. Here, the “transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line. The program may be for realizing a part of the functions described above. Furthermore, what can implement | achieve the function mentioned above in combination with the program already recorded on the computer system, what is called a difference file (difference program) may be sufficient.

ステムループ構造及び相補的な塩基対の説明に用いる概念図である。It is a conceptual diagram used for description of a stem loop structure and a complementary base pair. 本発明の第１の実施形態における回文構造検出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the palindrome structure detection system in the 1st Embodiment of this invention. 第１の実施形態における各解析範囲に対応した回文構造演算部の構成を説明する概念図である。It is a conceptual diagram explaining the structure of the palindrome structure calculating part corresponding to each analysis range in 1st Embodiment. 本発明の第２の実施形態における回文構造検出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the palindrome structure detection system in the 2nd Embodiment of this invention. 回文構造検出処理における、解析範囲とゲノム配列での塩基の順番との対応関係を説明する概念図である。It is a conceptual diagram explaining the correspondence between the analysis range and the order of bases in the genome sequence in palindromic structure detection processing. 第２の実施形態における各解析範囲に対応した回文構造演算部の構成を説明する概念図である。It is a conceptual diagram explaining the structure of the palindrome structure calculating part corresponding to each analysis range in 2nd Embodiment. 図４における回文構造部（回文計算ユニット）の構成例を示すブロック図である。It is a block diagram which shows the structural example of the palindrome structure part (pain count calculation unit) in FIG. ２つの解析範囲を合成して、各回文構造部による回文の塩基数より数の多い塩基数の回文構造を検出する処理を説明する概念図である。It is a conceptual diagram explaining the process which synthesize | combines two analysis ranges and detects the palindrome structure of more base numbers than the base number of palindrome by each palindrome structure part. 第２の実施形態における回文構造システムにおける奇数回文構造部に対応する動作を示す回文構造パターン生成の全体を示す概念図である。It is a conceptual diagram which shows the whole palindrome structure pattern production | generation which shows the operation | movement corresponding to the odd palindrome structure part in the palindrome structure system in 2nd Embodiment. 図９における領域Ａの拡大図である。It is an enlarged view of the area | region A in FIG. 図９における領域Ｂの拡大図である。FIG. 10 is an enlarged view of a region B in FIG. 9. ゲノム配列において相補的な塩基同士にて塩基対が形成され、ステムループ構造が形成されることを説明する概念図である。It is a conceptual diagram explaining that a base pair is formed by complementary bases in a genome sequence, and a stem loop structure is formed. ゲノム配列において相補的な塩基同士にて塩基対が形成され、ステムループ構造が形成されることを説明する概念図である。It is a conceptual diagram explaining that a base pair is formed by complementary bases in a genome sequence, and a stem loop structure is formed. 第２の実施形態における回文構造システムにおける偶数回文構造部に対応する動作を示す回文構造パターン生成の全体を示す概念図である。It is a conceptual diagram which shows the whole palindrome structure pattern production | generation which shows the operation | movement corresponding to the even palindrome structure part in the palindrome structure system in 2nd Embodiment. 図１４における領域Ａの拡大図である。It is an enlarged view of the area | region A in FIG. 図１４における領域Ｂの拡大図である。It is an enlarged view of the area | region B in FIG. 第１の加算部とこの第１の加算部に対応して設けられる絶対値部との機能を有する演算ユニットの構成例を示すブロック図である。It is a block diagram which shows the structural example of the arithmetic unit which has a function of the 1st addition part and the absolute value part provided corresponding to this 1st addition part. 図１５の演算ユニットを用いて、偶数回文演算部の構成例を示すブロック図である。It is a block diagram which shows the structural example of an even number palindromic operation part using the arithmetic unit of FIG. 本発明の第２の実施形態による回文構造検出システムにおける偶数回文構造検出部に対応する部分の構成例を示すブロック図である。It is a block diagram which shows the structural example of the part corresponding to the even palindrome structure detection part in the palindrome structure detection system by the 2nd Embodiment of this invention. ゲノム配列におけるステムループ構造を生成する可能性のある回文構造について説明する概念図である。It is a conceptual diagram explaining the palindrome structure which may produce | generate the stem loop structure in a genome arrangement | sequence. ゲノム配列におけるステムループ構造を生成する可能性のある回文構造について説明する概念図である。It is a conceptual diagram explaining the palindrome structure which may produce | generate the stem loop structure in a genome arrangement | sequence. 第３の実施形態における各解析範囲に対応した回文構造演算部の構成を説明する概念図である。It is a conceptual diagram explaining the structure of the palindrome structure calculating part corresponding to each analysis range in 3rd Embodiment. 第３の実施形態の制限酵素切断地図作成機能が付加された回文構造システムにおける偶数回文構造部に対応する動作を示す回文構造パターン生成の全体を示す概念図である。It is a conceptual diagram which shows the whole palindrome structure pattern production | generation which shows the operation | movement corresponding to the even palindrome structure part in the palindrome structure system to which the restriction enzyme cut | disconnection map creation function of 3rd Embodiment was added.

Explanation of symbols

１…塩基／数字変換部２…塩基対生成部３_６１，３_６２，３_６３，３_６４，３_６５，３_６６，３_６７，３_６８…第１の加算部４_６１，４_６２，４_６３，４_６４，４_６５，４_６６，４_６７，４_６８…絶対値部５_６１，５_６２，５_６３，５_６４，５_６５，５_６６，５_６７，５_６８…第２の加算部６１，６２，６３，６４，６５，６６，６７，６８…回文演算部３_７０，３_７１，３_７２，３_７３，３_７４，３_７５，３_７６，３_７７…第１の加算部４_７０，４_７１，４_７２，４_７３，４_７４，４_７５，４_７６，４_７７…絶対値部５００…データベース５０１…検索部５_７０，５_７１，５_７２，５_７３，５_７４，５_７５，５_７６，５_７７…第２の加算部７０，７１，７２，７３，７４，７５，７６，７７，２０１，２０２…回文演算部１００…演算ユニット１５０…シフトレジスタ１０１…ＥＸＮＯＲ（排他的論理和の否定）１０２，１０３…ＥＸＯＲ（排他的論理和）１０４，２００…ＯＲ（論理和） DESCRIPTION OF SYMBOLS 1 ... Base / number conversion part 2 ... Base pair production | generation part 3 ₆₁ , 3 ₆₂ , 3 ₆₃ , 3 ₆₄ , 3 ₆₅ , 3 ₆₆ , 3 ₆₇ , 3 ₆₈ ... 1st addition part 4 ₆₁ , 4 ₆₂ , 4 ₆₃ , 4 ₆₄ , 4 ₆₅ , 4 ₆₆ , 4 ₆₇ , 4 ₆₈ ... Absolute value parts 5 ₆₁ , 5 ₆₂ , 5 ₆₃ , 5 ₆₄ , 5 ₆₅ , 5 ₆₆ , 5 ₆₇ , 5 ₆₈ . 62, 63, 64, ₆₅ , ₆₆ , ₆₇ , ₆₈ ... palindromic operation units 3 ₇₀ , 3 ₇₁ , 3 ₇₂ , 3 ₇₃ , 3 ₇₄ , 3 ₇₅ , 3 ₇₆ , 3 ₇₇ ... First addition unit 4 ₇₀ , 4 ₇₁ , 4 ₇₂ , 4 ₇₃ , 4 ₇₄ , 4 ₇₅ , 4 ₇₆ , 4 ₇₇ ... absolute value part 500 ... database 501 ... search part 5 ₇₀ , 5 ₇₁ , 5 ₇₂ , 5 ₇₃ , 5 ₇₄ , 5 ₇₅ , 5 _76, 5 _{77 ...} the second adding section 70, 71, 72, 3, 74, 75, 76, 77, 201, 202 ... palindromic operation unit 100 ... operation unit 150 ... shift register 101 ... EXNOR (negative exclusive OR) 102, 103 ... EXOR (exclusive OR) 104, 200 ... OR (logical sum)

Claims

Bases each forming a base pair with the genomic sequence Ategai the digits to a palindrome detection system for detecting a palindrome in the genome sequence,
For each base in the input genome sequence, assign the same number to the bases that form a complementary relationship in the genome sequence and attach a + sign to any one of the bases. , A base / number conversion unit that attaches a minus sign to the other base and replaces each of the bases with a number;
In the genome sequence, a base is shifted from the both ends in the analysis range one by one in the center direction, and n bases from each end in the analysis range are combined corresponding to the positions from both ends. a base pair generator for generating n base pairs;
N number of first addition units that independently add numbers corresponding to the bases of the base pairs output from the base pair generation unit, and outputs of the first addition units. the absolute value absolute value unit for calculating a, a palindromic calculation unit and a second adder for adding the absolute values of the addition result in the analysis range possess,
A palindrome structure detection system in a genome sequence , wherein when the second adder outputs 0, it is a detection signal that detects the presence of a palindrome structure with the n base pairs .

The palindrome structure detection system in the genome sequence according to claim 1, wherein a plurality of palindromic operation units are provided for each analysis range having different numbers of bases.

The palindromic structure detection system for genome sequences according to claim 1 or 2, wherein the analysis range in the genome sequence is sequentially shifted by one base at a time, and palindromic structure detection processing is performed for each shift.

The palindromic operation unit includes the first addition unit and the second addition unit that detect a palindrome structure with an even number of non-addition bases sandwiched between n bases from both ends within the analysis range. An even palindrome operation unit,
An odd palindromic operation unit having the first addition unit and the second addition unit for detecting a palindromic structure with an odd number of non-addition bases sandwiched between n bases from both ends within the analysis range; The palindrome structure detection system in the genome sequence according to any one of claims 1 to 3 , wherein

The number of non-addition bases sandwiched by n bases from both ends is overlapped with the detection result of the palindrome calculation unit for each analysis range increased in base pair units, and the number of bases forming a pair in the palindrome structure is calculated. The palindrome structure detection system in the genome sequence according to any one of claims 3 to 4, wherein the palindrome structure detection system is obtained.

A storage unit for storing detection results obtained by shifting the genome sequence by one base for each palindromic operation unit in the analysis range in which the number of non-addition bases sandwiched between n bases from both ends is increased in units of base pairs Have
At the genomic sequence, is detected palindrome, and center overlay the same of the analysis range, any one of the preceding claims 2, characterized in that the output as detected palindromic The palindromic structure detection system in the genome sequence described in 1.

Each of the even palindromic operation unit and the odd palindromic operation unit is composed of n first adders having two inputs, an absolute value unit provided for each adder, and a second adder having n inputs. The palindrome structure detection system in the genome sequence according to claim 4, wherein the palindrome structure is detected.

A database in which restriction enzymes are stored corresponding to the base sequences constituting the palindrome structure;
When the palindromic is detected, any one of claim 1, further comprising a retrieval unit for retrieving the restriction enzyme which corresponds to the base sequence of said detected palindrome claim 7 A palindromic structure detection system for genome sequences according to one item .