JP2022518304A

JP2022518304A - Flounder disease-resistant breeding gene chip and its applications

Info

Publication number: JP2022518304A
Application number: JP2020556756A
Authority: JP
Inventors: 松林陳; 茜周; 昇盧; 亜東陳; 洋劉; 文騰徐; 仰真李; 磊王; 英明楊; 娜王; 希紅李
Original assignee: 中国水産科学研究院黄海水産研究所
Priority date: 2019-12-17
Filing date: 2019-12-17
Publication date: 2022-03-15
Anticipated expiration: 2039-12-17
Also published as: JP7158496B2; WO2021119980A1; CN111278994B; CN111278994A

Abstract

本発明はヒラメの耐病性優良品種の選別に用いられる遺伝子チップを提供し、魚類優良品種の育成において遺伝子チップが欠乏するという問題を解決し、従来の育種技術の欠点を補完し、魚類の耐病性と高生産性を兼ね備える優れた優良品種育成のために、新たな分子育種方法を提供し、魚類育種技術の世代交代を実現させ、魚類育種産業の高速な発展を推進することを目的とする。本発明が提供するラメ耐病性に関連するＳＮＰローカスの遺伝子チップはヒラメ耐病性個体の選別に利用可能であり、かつ実際の選択正確性は理論値に近いため、ヒラメ耐病性優良品種の選択正確性を高め、育種期間を短縮させ、これにより、ヒラメ耐病性優良品種の選別のために遺伝子チップ技術を提供し、魚類耐病性優良品種の選別のための遺伝子チップ育種の新しい道を切り開くことができる。The present invention provides a gene chip used for selecting excellent disease-resistant varieties of flatfish, solves the problem of lack of gene chips in breeding excellent fish varieties, complements the shortcomings of conventional breeding techniques, and makes fish disease-resistant. The purpose is to provide a new molecular breeding method, realize a generational change in fish breeding technology, and promote the rapid development of the fish breeding industry in order to cultivate excellent excellent varieties that have both sex and high productivity. .. The gene chip of SNP locus related to flounder disease resistance provided by the present invention can be used for selection of flounder disease resistant individuals, and the actual selection accuracy is close to the theoretical value. By increasing sex and shortening the breeding period, it is possible to provide gene chip technology for the selection of excellent flounder disease resistant varieties and open up a new path for gene chip breeding for the selection of excellent fish disease resistant varieties. can.

Description

本発明は水産遺伝育種の技術分野に属し、具体的にヒラメ耐病性優良品種選別に用いられる遺伝子チップの製造方法および応用に関する。 The present invention belongs to the technical field of fishery genetic breeding, and specifically relates to a method and application for producing a gene chip used for selection of excellent flounder disease-resistant varieties.

水産養殖業は中国食品の重要なソースであり、魚類養殖業はまた水産養殖業の基幹産業でもあり、２０１５年に魚類養殖の生産量は２８４５７万トンで、水産養殖生産量全体の５７．６％である。養殖魚類はすでに中国タンパク質の重要な由来となった。 Aquaculture is an important source of Chinese food, and fish farming is also a key industry of fish farming. In 2015, fish farming produced 284.57 million tons, 57.6 of the total fish farming production. %. Farmed fish have already become an important source of Chinese protein.

しかし、魚類養殖業の急速な発展につれて、優良な品種が欠乏し、養殖種類の品質が劣化する。養殖規模が拡大し、集約化水準の向上および養殖環境の悪化は水産養殖病害の頻繁な発生を引き起こし、養殖製品の薬物残留の深刻化等問題も魚類養殖業の持続可能な発展を深刻に制約する。魚類のみにとって、高密度な養殖で形成した免疫抑制のため、養殖魚類の耐病性の低下を引き起こす。魚類の免疫耐病性メカニズムおよび耐病性の分子遺伝に対する研究はいまだに進まないため、分子レベルで魚類病害の予防案を提出しにくい。さらに、耐病性機能遺伝子と耐病性分子マーカーが欠乏し、耐病性優良品種の育成を行いにくく、そのため現在養殖生産は耐病性が低下する野生または人工繁殖した多世代の種苗のみに依存し、流行病が魚類養殖における頻繁な発生を引き起こす。不完全な統計によると、中国における魚類養殖業では毎年病害による直接的経済損失は１００億人民元にも達する。病害はすでに中国における魚類養殖業の持続可能な発展を制約するボトルネックとなった。 However, with the rapid development of the fish farming industry, there is a shortage of good varieties and the quality of the farmed varieties deteriorates. The scale of aquaculture has expanded, the level of aquaculture has improved, and the aquaculture environment has deteriorated, causing frequent outbreaks of aquaculture diseases. do. For fish only, the immunosuppression formed by high-density farming causes a decrease in disease resistance of farmed fish. Since research on the immune disease resistance mechanism of fish and the molecular genetics of disease resistance has not progressed yet, it is difficult to submit a preventive plan for fish diseases at the molecular level. In addition, disease-resistant functional genes and disease-resistant molecular markers are deficient, making it difficult to breed excellent disease-resistant varieties, and therefore aquaculture production currently depends only on wild or artificially propagated multi-generational seedlings with reduced disease resistance and is endemic. The disease causes frequent outbreaks in fish farming. According to incomplete statistics, the direct economic loss due to disease in the fish farming industry in China reaches RMB 10 billion every year. Diseases have already become a bottleneck that constrains the sustainable development of fish farming in China.

ヒラメは世界的な海水養殖魚類で、中国における海水養殖の主導的な魚類の一つでもある。但し、ヒラメ養殖業においても病害が頻発し、死亡率が高いという問題が存在する。ヒラメ等養殖魚類を脅かす主要な病害は細菌性疾患・ウイルス性疾患を含む。その中危害が大きい疾患はそれぞれエドワジエラ症・ビブリオ症とリンホシスチス病が挙げられる。抗生物質系薬物またはワクチン等疾患予防措置は一定の効果があるが、水産養殖における病害問題を根本的に解決できない。かつ抗生物質系薬物に魚体内に蓄積しやすく、養殖魚類の商品品質を低下させ、消費者の健康に潜在的な危害を有し、病原菌に薬剤耐性および養殖環境を深刻に汚染するという問題を引き起こしやすいため、水産養殖業における応用はますます制約される。同時に、抗生物質の使用はまた人々日増しに伸びる無残留の無公害水産品の需要を満たすことができない。そのため、魚類耐病性優良品種の選別は中国の水産領域において解決が急がれる重大な課題の一つである。 Flounder is a world-class seawater-cultured fish and is one of the leading fish in seawater-cultured fish in China. However, there is a problem that diseases frequently occur in the flatfish farming industry and the mortality rate is high. Major diseases that threaten farmed fish such as flatfish include bacterial and viral diseases. Among them, the most harmful diseases are edwardsiellosis, vibrio disease and lymphocystis disease, respectively. Although disease preventive measures such as antibiotic drugs or vaccines have certain effects, they cannot fundamentally solve the disease problems in aquaculture. In addition, antibiotic-based drugs tend to accumulate in the body of fish, degrading the quality of farmed fish products, causing potential harm to consumer health, and causing serious drug resistance to pathogens and seriously contaminating the farming environment. Its propensity to cause is increasingly constraining its application in the aquaculture industry. At the same time, the use of antibiotics also fails to meet the ever-growing demand for non-residual, pollution-free marine products. Therefore, selection of excellent fish disease-resistant varieties is one of the urgent issues to be solved in the fishery field of China.

今まで、魚類優良品種の選別は主に表現型性状の選別に基づき、グループ選別・家系選別・交雑選別とＢＬＵＰ選別等を含み、主に体長・体重等測定しやすい表現型値に基づいて算出される育種値に基づいて選択を行う。分子マーカーができた後、重要な経済性状に関連する分子マーカーを特定することにより経済性状に選択を行い、従来の分子マーカー補助選別に用いられる分子マーカーの数量が非常に限られ、単一遺伝子性状または品質性状の選択効果に優れたが、多重遺伝子が決定される数量性状に対する選択効果がかんばしくない。耐病性性状は複数の遺伝子が制御される数量性状であり、直接測定しにくく、選択正確性が非常に低く、そのため耐病性優良品種に対する選別は長い間伸び悩み、魚類耐病性新品種の育成を制約し、新たしい育種技術でこの課題を解決する必要が急がれる。 Until now, the selection of excellent fish varieties has been calculated mainly based on phenotypic properties, including group selection, family line selection, crossbreeding selection, BLUP selection, etc., and mainly based on phenotypic values that are easy to measure such as body length and weight. Make selections based on the breeding values that are given. After the molecular marker is created, the selection is made economically by identifying the molecular marker related to the important economic property, and the number of molecular markers used in the conventional molecular marker auxiliary selection is very limited, and a single gene is used. The selection effect of the property or quality property is excellent, but the selection effect on the quantity property in which the multiplex gene is determined is not good. Disease resistance is a quantitative property in which multiple genes are controlled, it is difficult to measure directly, and selection accuracy is very low. Therefore, selection for excellent disease resistance varieties has been sluggish for a long time, limiting the breeding of new fish disease resistance varieties. However, there is an urgent need to solve this problem with new breeding technologies.

遺伝子チップは、ＤＮＡチップ・ＤＮＡマイクロアレイともいい、フォトエッチング技術を用い、シリコンウェハを固相担体とし、選択最適化を経た大量のＤＮＡ配列をオリゴヌクレオチドに合成し、特殊な処理を経たスライドにつけ、変性・固定を経た後にＤＮＡマイクロアレイを形成する。核酸分子交雑技術に基づき、遺伝子チップは数万ひいては数十万のＤＮＡフラグメントに並行化交雑と解析を同時に実行でき、高スループット・並行性・高効率・サンプルの数量が少ないという利点を有する。現在、遺伝子チップはすでに人類の疾患・腫瘍の診断と、動植物の遺伝学的解析および遺伝育種に広範に応用される。動物育種において、遺伝子チップはすでに牧畜、特に乳牛・豚等種の優良品種の選別に応用されたことに成功した。例えば、乳牛において、すでにＢｏｖｉｎｅ３Ｋｃｈｉｐ・Ｂｏｖｉｎｅ２５Ｋ・ＳＮＰｃｈｉｐ・ＢｏｖｉｎｅＨＤ７００Ｋ・ＢｏｖｉｎＬＤ７Ｋ等多くの遺伝子チップが相次いで開発される。今まで、北米・欧州・オーストラリア等はＢｏｖｉｎｅＳＮＰ５０Ｂｅａｄｃｈｉｐチップ（５４ＫＳＮＰ）を用いてゲノムＳＮＰマーカー分類検測の汎用プラットフォームとして、かつ大規模参考グループが取得したＳＮＰ分類結果に基づき、産乳量・繁殖力・耐病性等多種の経済性状のフルゲノム関連解析を行い、乳牛ゲノムの選択体系を構築する。ゲノム選択を通して新生雄牛の初期選択を実現させ、乳牛育種の世代間隔を短縮させ、遺伝的進歩を促進させ、種雄牛の選択効率を大いに向上させ、養殖と育種コストを顕著に節約する。 Gene chips, also called DNA chips and DNA microarrays, use photoetching technology to synthesize a large amount of DNA sequences that have undergone selective optimization into oligonucleotides using a silicon wafer as a solid phase carrier, and attach them to slides that have undergone special treatment. After undergoing denaturation and fixation, a DNA microarray is formed. Based on the nucleic acid molecular crossing technique, the gene chip can simultaneously perform parallel crossing and analysis on tens of thousands or even hundreds of thousands of DNA fragments, and has the advantages of high throughput, concurrency, high efficiency, and a small number of samples. Currently, gene chips are already widely applied to the diagnosis of human diseases and tumors, genetic analysis of animals and plants, and genetic breeding. In animal breeding, gene chips have already been successfully applied to livestock farming, especially to select excellent breeds such as dairy cows and pigs. For example, in dairy cows, many gene chips such as Bovine3Kchip, Bovine25K, SNPchip, BovineHD700K, and BovinLD7K have already been developed one after another. Until now, North America, Europe, Australia, etc. have used the Bovine SNP50Bedchip chip (54KSNP) as a general-purpose platform for genomic SNP marker classification and inspection, and based on the SNP classification results acquired by a large-scale reference group, milk production, fertility, etc. Perform full genome-wide association studies of various economic properties such as disease resistance, and construct a selection system for dairy cow genomes. Through genome selection, early selection of newborn bulls is achieved, intergenerational intervals of dairy cow breeding are shortened, genetic progress is promoted, breeding bull selection efficiency is greatly improved, and aquaculture and breeding costs are significantly saved.

但し、水産養殖動物に、今まで育種用遺伝子チップ、特に耐病性育種遺伝子チップの報道がいまだに見られない。 However, there have been no reports of breeding gene chips, especially disease-resistant breeding gene chips, in aquaculture animals.

本発明はヒラメの耐病性優良品種の選別に用いられる遺伝子チップを提供し、魚類優良品種の育成において遺伝子チップが欠乏するという問題を解決し、従来育種技術の欠点を補完し、魚類の耐病性と高生産性を兼ね備える優れた優良品種育成のために、新たな分子育種方法を提供し、魚類育種技術の世代交代を実現させ、魚類育種産業の高速な発展を推進することを目的とする。 The present invention provides a gene chip used for selecting excellent disease-resistant varieties of flatfish, solves the problem of lack of gene chips in breeding excellent fish varieties, complements the shortcomings of conventional breeding techniques, and provides fish disease resistance. The purpose is to provide a new molecular breeding method, realize a generational change in fish breeding technology, and promote the rapid development of the fish breeding industry in order to cultivate excellent excellent varieties that have both high productivity and high productivity.

本発明はまずヒラメ耐病性に関連するＳＮＰローカスを提供し、前記ＳＮＰローカスは配列がＳＥＱＮＯ：１ＳＥＱＩＤＮＯ：４８６９７である中のいずれか一つの配列の第３６位の塩基である。 The present invention first provides an SNP locus related to flounder disease resistance, and the SNP locus is the base at position 36 of any one of the sequences SEQNO: 1SEQIDNO: 48697.

本発明のＳＮＰローカスはヒラメ耐病性優良品種の選別に利用できる。 The SNP locus of the present invention can be used for selecting excellent varieties of flounder disease resistance.

本発明が提供されるＳＮＰローカスはヒラメ耐病性優良品種の選別用の検測製品の製造にも用いられることができる。 The SNP locus provided by the present invention can also be used in the production of a inspection product for selecting excellent varieties of flounder disease resistance.

前記検測製品は、好ましくは遺伝子チップである。 The inspection product is preferably a gene chip.

本発明のさらなる方面は、ヒラメ耐病性優良品種の選別に用いられる遺伝子チップを提供し、それはヒラメ耐病性に関連するＳＮＰローカスを検測することができる。 A further aspect of the present invention provides a genetic chip used for selection of flounder disease resistant excellent varieties, which can detect SNP locus associated with flounder disease resistance.

本発明のさらなる方面はヒラメ耐病性個体の選別方法を提供し、この方法は、上記の遺伝子チップを用いて実行することができる。 Further aspects of the present invention provide a method for selecting flounder disease resistant individuals, which can be performed using the gene chips described above.

前記方法は、以下のステップを含む：
１）候補グループにおける個体ゲノムＤＮＡを抽出し、かつ上記の遺伝子チップを利用して検測してＳＮＰマーカーの遺伝子型判定の結果を取得する。
２）参考グループにおけるＳＮＰグループより遺伝子チップと同様であるＳＮＰローカスの遺伝子型判定の結果を抽出し、さらに参考グループのＳＮＰの遺伝子型判定の結果と候補グループからチップを利用して取得した遺伝子型判定の結果とを合併する。
３）合併された遺伝子型と参考グループが表現する型とを利用し、加重ＧＢＬＵＰ方法を用いて候補グループの推定育種値（ＧＥＢＶ）を推定し、さらにＧＥＢＶ値に基づいて被検測個体の耐病性潜在能力を決定する。 The method comprises the following steps:
1) The individual genomic DNA in the candidate group is extracted, and the above gene chip is used for inspection to obtain the result of genotyping of the SNP marker.
2) The result of genotyping of SNP locus similar to the gene chip was extracted from the SNP group in the reference group, and the result of genotyping of SNP in the reference group and the genotype obtained from the candidate group using the chip. Combine with the judgment result.
3) Using the combined genotype and the type expressed by the reference group, the estimated breeding value (GEBV) of the candidate group is estimated using the weighted GBLUP method, and the disease resistance of the test-tested individual is further estimated based on the GEBV value. Determine sexual potential.

参考グループの遺伝子型を利用し、加重最良線形不偏推定量（加重ＧＢＬＵＰ）を用いて予測正確性を推定する。そのうち、５倍交差検証方法を予測正確性の判定方法に利用し、特徴曲線下面積（ＡＵＣ）を予測正確性の判定指標とある。ＡＵＣは１に近いほど、予測正確性は高い。 The genotype of the reference group is used to estimate the prediction accuracy using the weighted best linear biased estimator (weighted GBLUP). Among them, the 5-fold cross-validation method is used as the prediction accuracy determination method, and the area under the feature curve (AUC) is used as the prediction accuracy determination index. The closer the AUC is to 1, the higher the prediction accuracy.

本発明が提供されるヒラメ耐病性に関連するＳＮＰローカスの遺伝子チップはヒラメ耐病性個体の選別に利用することが可能であり、かつ実際の選択正確性が理論値に近いため、ヒラメ耐病性優良品種の選択正確性が向上し、育種期間を短縮することができる。これにより、ヒラメ耐病性優良品種の選別のために遺伝子チップ技術を提供し、魚類耐病性に優れた品種を選別するための、遺伝子チップによる育種という新しい道を切り開いた。 The gene chip of SNP locus related to flounder disease resistance provided by the present invention can be used for selection of flounder disease resistant individuals, and the actual selection accuracy is close to the theoretical value, so that the flounder disease resistance is excellent. The accuracy of selecting varieties can be improved and the breeding period can be shortened. This provided gene chip technology for the selection of excellent flounder disease-resistant varieties, and opened up a new path of breeding with gene chips for selecting fish disease-resistant varieties.

本発明はヒラメ耐病性優良品種育成の遺伝子チップの製造と応用方法を確立し、ヒラメ等魚類の耐病性優良品種の育成のために新たな分子育種の技術手段を提供することを目的とする。 An object of the present invention is to establish a method for producing and applying a gene chip for breeding excellent disease-resistant varieties of flatfish, and to provide a new technical means for molecular breeding for breeding excellent disease-resistant varieties of fish such as flatfish.

次に本発明が関する専門用語に対する説明は以下のとおりである：
ＳＮＰ：ＳｉｎｇｌｅＮｕｃｌｅｏｔｉｄｅＰｏｌｙｍｏｒｐｈｉｓｍの略語で、すなわち一塩基多型で、ゲノムレベルにおいてヌクレオチド単体の変異により引き起こされるＤＮＡ配列の多型である。 Next, the explanation for the technical terms related to the present invention is as follows:
SNP: An abbreviation for Single Nucleotide Polymorphism, a single nucleotide polymorphism, a polymorphism in a DNA sequence caused by mutation of a single nucleotide at the genomic level.

遺伝子チップ：微細加工技術を通し、数万乃至百万特定のＤＮＡ配列フラグメントを、シリコンウエハー・スライド等支持物に規則的に配置して固定し、構成される二次元ＤＮＡプローブアレーで、遺伝物質（ＤＮＡ等）に遺伝子分類および分子検測を行うことができる。 Gene chip: A two-dimensional DNA probe array composed of tens of thousands to millions of specific DNA sequence fragments that are regularly placed and fixed on a support such as a silicon wafer or slide through microfabrication technology. Gene classification and molecular inspection can be performed on (DNA, etc.).

縮退塩基：コドンの縮退性に基づき、一つの記号をよく用いて不特定の二つまたはそれ以上の塩基を代替する。 Nucleic acid notation: Based on the degenerate property of a codon, one symbol is often used to substitute two or more unspecified bases.

例えばＲはＡ／Ｇを示し、ＹはＣ／Ｔを示し、ＭはＡ／Ｃを示し、ＫはＧ／Ｔを示し、ＳはＧ／Ｃを示し、ＷはＡ／Ｔを示すなど。 For example, R indicates A / G, Y indicates C / T, M indicates A / C, K indicates G / T, S indicates G / C, W indicates A / T, and so on.

参考グループ：ゲノムの選択において、人為感染等検測を通して取得した表現型データを有するグループは、通常表現型性状を有する大型グループから選別されるグループ全体を代表する表現型分布で、かつゲノム再配列を行い、遺伝子型データを取得し、実際のゲノム選択計算を行う個体の集合である。 Reference group: In the selection of the genome, the group having the phenotypic data obtained through the inspection such as anthropogenic infection is the phenotypic distribution representing the whole group selected from the large group having the normal phenotypic properties, and the genome rearrangement. Is a set of individuals who perform the above, acquire genotype data, and perform actual genome selection calculation.

候補グループ：ゲノムの選択において、候補グループとはゲノム再配列を通し、遺伝子型データを取得したが、表現型データがないグループで、該グループは育種の潜在力を有し、後続の実際優良品種育成作業に用いる予定がある個体の集合である。 Candidate group: In the selection of the genome, the candidate group is a group in which genotype data is acquired through genome rearrangement but no phenotypic data, and the group has breeding potential and is a subsequent excellent cultivar. It is a set of individuals that are planned to be used for breeding work.

ＧＢＬＵＰ：ＧｅｎｏｍｉｃＢｅｓｔＬｉｎｅａｒＵｎｂｉａｓｅｄＰｒｅｄｉｃｔｉｏｎの略語で、すなわちゲノム最良線形不偏推定量で、ゲノムにおける高密度な分子マーカーを用いて個体間の成因関係（Ｇマトリックス）を利用し、ゲノム育種値の推定する方法である。 GBLUP: An abbreviation for GenomicBestLinearUnbiasedPrescription, that is, a method for estimating a genomic breeding value by using a genetic relationship (G matrix) between individuals using a high-density molecular marker in the genome with a genomic best linear biased estimator.

ＧＥＢＶ：ＧｅｎｏｍｉｃＥｓｔｉｍａｔｅｄＢｒｅｅｄｉｎｇＶａｌｕｅｓの略語で、すなわちゲノム推定育種価で、フルゲノムにおけるすべてのマーカーまたはハプロタイプの効果推定を加算して取得する。 GEBV: An abbreviation for Genomic Estimated Breeding Values, ie, the genomic estimated breeding value, obtained by adding up the effect estimates of all markers or haplotypes in the full genome.

次に実施例を組み合わせて本発明に詳細な記述を行う。 Next, a detailed description will be given to the present invention by combining examples.

実施例１：「魚チップ１号」遺伝子チップＳＮＰローカスの選別およびチップ製造
１、ヒラメ耐エドワージエラタルダ参考グループの作成および表現型性状の測定
ヒラメゲノムが選定される参考グループと候補グループの個体はいずれも本課題チームが２００３年以来作成したヒラメ家系より由来され、多年の育成過程において、次第に韓国・日本および中国のヒラメグループの急速な成長、耐病性や耐逆性等優良な性状より由来される。 Example 1: Selection of "fish chip No. 1" gene chip SNP locus and chip production 1. Creation of a reference group for flounder-resistant Edwardie latalda and measurement of phenotypic properties Individuals in the reference group and candidate group from which the flounder genome is selected Are all derived from the flounder family created by this task team since 2003, and gradually derived from the excellent properties such as rapid growth, disease resistance and reversal resistance of the flounder groups in Korea, Japan and China during the many years of breeding process. Will be done.

特に２０１３年から、ヒラメ養殖業におけるエドワージエラタルダが日増しに深刻化する情勢に対し、ヒラメ耐エドワージエラタルダ家系選別の研究を実施する。 In particular, from 2013, we will carry out research on the selection of flounder-resistant Edwardie latarda families in response to the increasingly serious situation in the flounder farming industry.

２０１３－２０１５年、当年に作成したヒラメ家系に腹腔接種人為感染エドワージエラタルダ検測を連続的に実施し、感染した検測魚苗に鰭棘を収集し、生長と耐病性表現型を測定し、２０１３年・２０１４年と２０１５年にサンプル４５７７匹・５９４２匹と６９１９匹を採取し、ヒラメエドワージエラタルダゲノムが選択される参考グループを選択して作成するために用いられるもの。 In 2013-2015, the flounder family created in the same year was continuously infused with peritoneal infection, and Edwardie eratalda was continuously examined. , 2013, 2014 and 2015, samples of 4757, 5942 and 6919 were collected and used to select and create a reference group from which the flounder Edwardia talda genome is selected.

感染検測のサンプルより、９６家系（２０１３年に３２、２０１４年に１０、２０１５年に４８）を選定し、各家系は死亡率に従って等比率の死亡と生存した個体１０－１５を選定し、ゲノムが選択される参考グループを組成し、選定した個体の感染検測の結果（死亡または生存）を参考グループの表現型性状（表２）とする。

From the infection detection sample, 96 families (32 in 2013, 10 in 2014, 48 in 2015) were selected, and each family selected equal proportions of mortality and surviving individuals 10-15 according to the mortality rate. , A reference group in which the genome is selected is formed, and the result of infection detection (death or survival) of the selected individual is used as the phenotypic property of the reference group (Table 2).

２、ヒラメフルゲノム再配列およびＳＮＰローカス評価
ヒラメ参考グループはＤＮＡ抽出・検測を経た後、利用可能な個体を計９３１有する（表３）。

2. Flounder full genome rearrangement and SNP locus evaluation The flounder reference group has a total of 931 individuals that can be used after DNA extraction and inspection (Table 3).

９３１の参考グループの個体のゲノムＤＮＡを抽出し、ＤＮＡ検測が合格した後、次世代配列ライブラリを作成し、ライブラリ作成類型は両端ＤＮＡライブラリ（挿入フラグメント３５０ｂｐ）で、ＩｌｌｕｍｉｎａＨｉｓｅｑＸ１０配列プラットフォームを用いて配列とデータエクスポートを完成し、品質管理で取得された平均的なデータ量は２Ｇ／個体である。本課題チームが提供されるヒラメゲノム配列（ＧｅｎＢａｎｋＩＤ：ＰＲＪＮＡ７３６７３）を参考ゲノムとして、ＢＷＡ（ｈｔｔｐ：／／ｂｉｏ－ｂｗａ．ｓｏｕｒｃｅｆｏｒｇｅ．ｎｅｔ／）ソフトウェアを用いて配列比較を行い、その後Ｓａｍｔｏｏｌｓ（ｈｔｔｐ：／／ｗｗｗ．ｈｔｓｌｉｂ．ｏｒｇ／）ソフトウェアを用いてＳＮＰ予測と評価を行い、４２．２ＭのＳＮＰ集合を取得する。 After extracting the genomic DNA of the individual of the reference group of 931 and passing the DNA inspection, the next-generation sequence library is created, and the library creation type is the double-ended DNA library (insertion fragment 350 bp), which is sequenced using the IlluminaHiseqX10 sequence platform. And the data export is completed, and the average amount of data acquired by quality control is 2G / individual. Using the Hirame genome sequence (GenBankID: PRJNA73673) provided by this task team as a reference genome, sequence comparison was performed using BWA (http://bio-bwa.sourceforge.net/) software, and then Samtools (http: /). /Www.https.org/) SNP prediction and evaluation are performed using software, and an SNP set of 42.2M is obtained.

３、ＳＮＰローカス評価と選別
（１）ステップ２で取得した４２．２ＭのヒラメＳＮＰマーカーに対し、以下の規格に選別を行う：
見逃し率＞０．１、最小対立遺伝子頻度（ＭＡＦ）＜０．０５のサイトを削除し、繰り返し配列または反復配列におけるサイトを削除し、ハドウィンバランスに適合しないサイトを削除し、３．４ＭのヒラメＳＮＰマーカーを取得する。 3. SNP locus evaluation and selection (1) The 42.2M flatfish SNP marker acquired in step 2 is selected according to the following standards:
Sites with miss rate> 0.1, minimum allele frequency (MAF) <0.05 were removed, sites in repeats or repeats were removed, sites that did not fit the hadwin balance were removed, and 3.4 M. Obtain a flatfish SNP marker.

（２）ステップ（１）で選別した３．４Ｍ分子マーカーに対し、ＳＮＰローカス効果値解析と推定育種値計算を実施し、
ヒラメゲノム選択計算はＢａｙｅｓＣπアルゴリズムを用い、解析モデル等式：

モデルにおいて、ｙは表現型値で、ｕはグループ平均値で、グループｑｉはマーカー効果が正規分布ｑｉ～Ｎ（０、

）で、ｍはマーカーの総数で、Ｘはｑｉに対応する関連マトリックスで、ｅは残差である。 (2) For the 3.4M molecular markers selected in step (1), SNP locus effect value analysis and estimated breeding value calculation were performed.
The flatfish genome selection calculation uses the BayesCπ algorithm, and the analysis model equation:

In the model, y is a phenotypic value, u is a group mean value, and group qi has a normal distribution of marker effects qi to N (0,

), M is the total number of markers, X is the related matrix corresponding to qi, and e is the residual.

Ｒ言語パックが提供されるＢａｙｅｓＣπアルゴリズムを用い、組み合わせ済みの遺伝子型データｇｅｎｏｔｙｐｅ．ｃｓｖと表現型データｐｈｏｎｅｔｙｐｅ．ｃｓｖを組み合わせ、フルゲノム再配列の参考グループの計９３１のヒラメ個体にゲノム選択計算を行う。その後取得したＳＮＰローカス効果値を最大から最小への並べ替え、推定育種値＜１０－５のサイトを削除し、計８６４２２９のＳＮＰローカスを取得して遺伝子チップＳＮＰローカスの選定に用いられる。 Combined genotype data genotype. Using the BayesCπ algorithm provided by the R language pack. csv and phenotypic data phonetype. Combining csv, genome selection calculation is performed on a total of 931 flatfish individuals in the reference group for full genome rearrangement. After that, the acquired SNP locus effect values are rearranged from the maximum to the minimum, sites with estimated breeding values <10-5 are deleted, and a total of 864229 SNP locus are acquired and used for selection of gene chip SNP locus.

（３）さらにＡｆｆｙｍｅｔｒｉｘＡｘｉｏｍ遺伝子分類プローブ設計生物解析プロセスを用いてステップ（２）で選別したＳＮＰにそれぞれプローブ設計と評価を行い、プローブ変換可能性評価点数＜０．６のサイトを削除する。また、ＳＮＰがゲノム全体をカバーしかつ均等に分布され、ＳＮＰのフランキング配列３５ｂｐ内に他のＳＮＰが存在せず、ＳＮＰのフランキング配列３５ｂｐのＧＣ含有量が３０－７０％で、最終的に４８６９７のヒラメＳＮＰマーカーを選別してチップ製造に用いられることを保証し、前記４８６９７のＳＮＰ分子マーカーの配列記録は配列リストにある。
米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製ＡｆｆｙｍｅｔｒｉｘＡｘｉｏｍチップの生産技術を用いてヒラメＳＮＰチップ（遺伝子チップ）を製造し、延べ４８６９７のヒラメＳＮＰローカスを含み、各チップは２４のサンプルを同時に検測することができる。 (3) Further, probe design and evaluation are performed on each SNP selected in step (2) using the AffymetrixAxiom gene classification probe design biological analysis process, and sites having a probe conversion possibility evaluation score <0.6 are deleted. In addition, the SNP covers the entire genome and is evenly distributed, there are no other SNPs in the SNP flanking sequence 35bp, and the GC content of the SNP flanking sequence 35bp is 30-70%, which is the final. The 48697 SNP molecular markers are selected and guaranteed to be used in chip production, and the sequence record of the 48697 SNP molecular markers is in the sequence list.
A flatfish SNP chip (gene chip) is produced using the production technology of the AffymetrixAxiom chip manufactured by Thermo Fisher, USA, and contains a total of 48697 flatfish SNP locus, and each chip can simultaneously inspect 24 samples.

実施例２、「魚チップ１号」遺伝子チップの使用方法
１、チップ検測用サンプルの製造と交雑
少量のヒラメ鰭棘（米の粒ほどの大きさ）を採取し、ＤＮＡ抽出キット（中国・天根）を利用して鰭棘ゲノムＤＮＡを抽出し、１％のアガロースゲル電気泳動と核酸分光光度計を利用してＤＮＡ品質と濃度を検測し、最終的な合格サンプルの規格は以下のとおりである：電気泳動はＤＮＡに単一のストリップが生じることを求め、フラグメント長さが１０ｋｂより大きく、完全性に優れ、生分解は生じず、サンプル品質のＤＮＡ検測結果。紫外分光光度計を用いてＡ２６０／２８０を検測する：１．８－２．０、Ａ２６０／２３０＞１．５、濃度が２０ｎｇ／μｌより低くなく、総量が４μｇより小さくない。その後米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製ＳＮＰチップ検測サンプルの製造標準操作プロセスに従って（ｈｔｔｐｓ：／／ｗｗｗ．ｔｈｅｒｍｏｆｉｓｈｅｒ．ｃｏｍ／）チップ検測用サンプルの製造を行う。１．４ｕｇ以上の高品質なＤＮＡテンプレートを２ｍｌ＊９６の深穴板に加え、変性剤を加えて常温変性を行い、変性１０ｍｉｎ後に変性を停止し、単鎖ＤＮＡを取得する。チップサイト増幅用４８６９７ペアプライマーと等温増幅酵素・ｄＮＴＰ等を深穴板に加え、深穴板を封止し、３７℃で等温増幅を行う。２４ｈ増幅した後、増幅産物を断片化し、等体積のイソプロパノールを加え、－２０℃の冷蔵庫において沈殿する。２４ｈ沈殿した後、４℃、３０００ｇを用いて遠心してＤＮＡ産物の沈殿物を取得し、３７℃で残りのイソプロパノールを除去し、沈殿物を溶解させ、交雑液を取得し、交雑液は５％のゲル電気泳動の検測結果の質量を用い、増幅産物品質の検測結果は、ストリップが明晰で、輝度が高い。交雑液は温度制御増幅装置を用いて交雑を行い、条件が９５℃１０ｍｉｎ、４８℃３ｍｉｎで、その後交雑液を持続的に４８℃に維持する。チップブロックを交雑液に浸漬し、４８℃の交雑炉において２４ｈ交雑させ、その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各信号点は一つのプローブ交雑の結果で、各サイトの交雑結果を取得した後、チップスキャンの結果はＡｘｉｏｍＡｎａｌｙｓｉｓＳｕｉｔｅ（ＡｘＡＳ）ソフトウェア（米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製）を用いて解析を行う。 Example 2, How to use the "fish chip No. 1" gene chip 1. Production and crossing of a sample for chip inspection A small amount of fin spines (about the size of a grain of rice) was collected and a DNA extraction kit (China, China). The fin spine genomic DNA was extracted using Amane), and the DNA quality and concentration were measured using 1% agarose gel electrophoresis and a nucleic acid spectrophotometer. As is: Electrophoresis requires that a single strip be formed in the DNA, the fragment length is greater than 10 kb, the completeness is excellent, no biodegradation occurs, and the sample quality DNA test results. A260 / 280 is measured using an ultraviolet spectrophotometer: 1.8-2.0, A260 / 230> 1.5, the concentration is not less than 20 ng / μl and the total amount is not less than 4 μg. After that, a sample for chip inspection is manufactured according to a standard operation process for manufacturing an SNP chip inspection sample manufactured by Thermo Fisher, USA (https://www.thermovisher.com/). A high-quality DNA template of 1.4 ug or more is added to a 2 ml * 96 deep hole plate, denaturation is performed by adding a denaturant, and denaturation is stopped after 10 minutes of denaturation to obtain single-stranded DNA. A 48697 pair primer for chip site amplification and an isothermal amplification enzyme, dNTP, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37 ° C. After amplification for 24 hours, the amplified product is fragmented, equal volumes of isopropanol are added and precipitated in a refrigerator at −20 ° C. After precipitating for 24 hours, centrifuge at 4 ° C. and 3000 g to obtain a precipitate of DNA product, remove the remaining isopropanol at 37 ° C., dissolve the precipitate, obtain a hybrid solution, and the hybrid solution is 5%. Using the mass of the gel electrophoresis test result, the amplification product quality test result shows that the strip is clear and the brightness is high. The crossed liquid is crossed using a temperature control amplification device, and the conditions are 95 ° C. for 10 min and 48 ° C. for 3 min, and then the crossed liquid is continuously maintained at 48 ° C. By immersing the chip block in a crossing solution and crossing it in a crossing furnace at 48 ° C. for 24 hours, then elution, connecting the fluorescent protein, fixing the fluorescent protein, and scanning the crossing probe and fluorescent signal, each signal point becomes one. After acquiring the crossing results of each site from the results of the two probe crosses, the results of the chip scan are analyzed using the AxiomAnalysisSuit (AxAS) software (manufactured by Thermo Fisher, USA).

２、チップ検測と遺伝子分類データ解析
（１）被検測グループサンプルの採取とＤＮＡ抽出
再配列済みの一部ヒラメ個体ＤＮＡを選択し、前記チップを用いて検測を実施し、チップ遺伝子分類の正確性および再配列と遺伝子チップ分類を用いて取得した遺伝子型データにゲノム選択計算を実施する再現性に検測を行う。その後ヒラメ育種の選択過程に、家系が用いられる候補成体の個体を確立し、ゲノムＤＮＡの抽出を行いかつチップで検測を実施し、遺伝子チップがヒラメゲノム選択育種における応用効果を検証する。採用個体の情報は表４による。

2. Chip inspection and gene classification data analysis (1) Collection of test group sample and DNA extraction Select some rearranged individual fluffy DNA, perform inspection using the chip, and classify the chip gene. The accuracy and reproducibility of performing genome selection calculations on the genotype data obtained using the rearrangement and gene chip classification will be checked. After that, in the selection process of flatfish breeding, an individual of a candidate adult whose family line is used is established, genomic DNA is extracted and inspection is performed with a chip, and the applied effect of the gene chip in flatfish genome selective breeding is verified. Information on the adopted individuals is shown in Table 4.

（２）チップ検測
遺伝子チップ検測の標準プロセスに従い、ＡｆｆｙｍｅｔｒｉｘＧｅｎｅＴｉｔａｎ遺伝子チップ処理システムを用いてプローブ交雑・染色とチップスキャンを完成する。具体的な操作方法は以下のとおりである：４μｇの高品質なＤＮＡテンプレートを２ｍｌ＊９６の深穴板に加え、変性剤を加えて変性（２８℃）を行い、変性１０ｍｉｎ後に変性停止液（反応時間が１０ｍｉｎより長くない）変性を停止し、単鎖ＤＮＡを取得する。チップサイト増幅用４８６９７ペアプライマーと等温増幅酵素・ｄＮＴＰと反応液等を深穴板に加え、深穴板を封止し、３７℃で等温増幅を２２－３６ｈ行う。好ましくは２４ｈ増幅した後、高温６５℃で２０－３０ｍｉｎで反応液を不活性化させ、その後３７℃の培養箱に移転して４０ｍｉｎ培養し、断片化酵素と反応液［４１｝を加え、断片化増幅産物を断片化し、既存の反応液と等体積のイソプロパノールを加え、反応液が澄むまで反応液を均一に混合させ、その後－２０℃の冷蔵庫において産物を沈殿させる。２４ｈ沈殿した後、４℃、３、０００ｇで４０－６０ｍｉｎ遠心してＤＮＡ産物の沈殿物を取得し、上澄液を除去し、沈殿物を保留し、３７℃で残りのイソプロパノールを完全に除去し、沈殿物を溶解させ、交雑液を取得する。交雑液は温度制御増幅装置を用いて交雑を行い、条件が９５℃１０ｍｉｎ、４８℃３ｍｉｎで、その後交雑液を持続的に４８℃に維持する。チップブロックを交雑液に浸漬し、４８℃の交雑炉において２４ｈ交雑させる。その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各サイトの交雑結果を取得し、チップスキャン結果はＡｘｉｏｍＡｎａｌｙｓｉｓＳｕｉｔｅ（ＡｘＡＳ）ソフトウェア（米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製）を用いて解析を行う。 (2) Chip inspection According to the standard process of gene chip inspection, probe crossing / staining and chip scanning are completed using the Affymetrix GeneTitan gene chip processing system. The specific operation method is as follows: Add 4 μg of high-quality DNA template to a 2 ml * 96 deep hole plate, add a denaturing agent to perform denaturation (28 ° C.), and after 10 minutes of denaturation, a denaturation stop solution ( (Reaction time is not longer than 10 min) Stop denaturation and obtain single-stranded DNA. A 48697 pair primer for chip site amplification, an isothermal amplification enzyme, dNTP, a reaction solution, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37 ° C. for 22-36 hours. Preferably, after amplification for 24 hours, the reaction solution is inactivated at a high temperature of 65 ° C. for 20-30 min, then transferred to a culture box at 37 ° C. and cultured for 40 minutes, and the fragmentation enzyme and the reaction solution [41} are added to the fragment. The chemical amplification product is fragmented, the same volume of isopropanol as the existing reaction solution is added, the reaction solution is uniformly mixed until the reaction solution is clear, and then the product is precipitated in a refrigerator at −20 ° C. After precipitation for 24 hours, centrifuge at 4 ° C. at 3,000 g for 40-60 min to obtain a DNA product precipitate, remove the supernatant, retain the precipitate, and completely remove the remaining isopropanol at 37 ° C. , Dissolve the precipitate and obtain a hybrid. The crossed liquid is crossed using a temperature control amplification device, and the conditions are 95 ° C. for 10 min and 48 ° C. for 3 min, and then the crossed liquid is continuously maintained at 48 ° C. The chip block is immersed in a crossing solution and crossed in a crossing furnace at 48 ° C. for 24 hours. After that, by elution, connecting the fluorescent protein, fixing the fluorescent protein, scanning the crossing probe and the fluorescent signal, the crossing result of each site is obtained, and the chip scan result is the Axiom Analysis Suite (AxAS) software (manufactured by Thermo Fisher, USA). Is used for analysis.

（３）データ解析
ＡｘＡＳソフトウェア（米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製）を利用してチップスキャンの結果を解析し、各サンプルの遺伝子分類結果を取得する。解析結果によると、チップの平均的な分類率が９８．７７％で、分類効果が抜群である。そのうち、高品質なＳＮＰ比率が７４．６１％で、各サンプルはいずれも高品質な分類情報を生成することができる。 (3) Data analysis The result of chip scan is analyzed using AxAS software (manufactured by Thermo Fisher, USA), and the gene classification result of each sample is obtained. According to the analysis results, the average classification rate of chips is 98.77%, and the classification effect is outstanding. Among them, the high-quality SNP ratio is 74.61%, and each sample can generate high-quality classification information.

実施例３「魚チップ１号」遺伝子チップがヒラメ耐病性育種における応用
１、「魚チップ１号」遺伝子チップ分類の効果検証
参考グループから一部の個体を選定して遺伝子チップ分類の信頼性を検証するために用いられるもの、これらの選定される個体は再配列の遺伝子型でもあり、「魚チップ１号」遺伝子チップ分類を用いて取得される遺伝子型でもある。発明者の既存のヒラメ参考グループから一部の個体遺伝子チップを選定して分類・統計を行う。チップを用いて取得した遺伝子型（０／１／２はＡＡ／Ａａ／ａａを示す）と再配列して取得した遺伝子型の一致性および再配列とチップデータを利用して推定したＧＥＢＶの関連係数を統計することによりチップ分類の効果を評価する。分類結果の一致性が８８％以上に達しかつＧＥＢＶの間の関連係数が０．９以上に達する場合、チップが優れた分類結果を有すると見なす。 Example 3 Application of "Fish Chip No. 1" Gene Chip to Disease-Resistant Breeding 1. Verification of Effect of "Fish Chip No. 1" Gene Chip Classification Select some individuals from the reference group to improve the reliability of gene chip classification. Used for validation, these selected individuals are also genotypes of rearrangement and genotypes obtained using the "Fish Chip No. 1" gene chip classification. Some individual gene chips are selected from the inventor's existing flounder reference group for classification and statistics. Consistency between genotypes obtained using chips (0/1/2 indicates AA / Aa / aa) and genotypes obtained by rearrangement, and the relationship between rearrangements and GEBV estimated using chip data Evaluate the effect of chip classification by statistic of coefficients. If the consistency of the classification results reaches 88% or higher and the correlation coefficient between GEBV reaches 0.9 or higher, the chip is considered to have excellent classification results.

解析結果によると、「魚チップ１号」遺伝子チップ分類を利用して取得したサイト情報の９０．０８％は再配列と同様で、２組のＧＥＢＶの間の関連係数は０．９５８である。そのため、本発明が開発されるヒラメ遺伝子チップの分類結果は再配列と基本的に一致し、ヒラメに正確な遺伝子分類を行うことができる。 According to the analysis results, 90.08% of the site information obtained using the "Fish Chip No. 1" gene chip classification is similar to the rearrangement, and the correlation coefficient between the two sets of GEBV is 0.958. Therefore, the classification result of the flounder gene chip for which the present invention is developed basically matches the rearrangement, and accurate gene classification can be performed on the flounder.

具体的な操作方法は以下のとおりである：
ＰＬＩＮＫソフトウェアを利用してチップデータを読み取り、サーバーにおいて以下のコマンドを入力して上記データに処理を行う：
ｐｌｉｎｋ－－ｖｃｆｏｐ２－１．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿１
ｐｌｉｎｋ－－ｖｃｆｃｓ２－２．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿２
ｐｌｉｎｋ－－ｖｃｆｏｐ２－３．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿３
ｐｌｉｎｋ－－ｖｃｆｏｐ２－４．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿４ The specific operation method is as follows:
Read the chip data using the LINK software and enter the following command on the server to process the above data:
pink －－vcf op2-1. vcf －－make-bed －－out op_Val_1
pink －－vcf cs2-2. vcf －－make-bed －－out op_Val_2
pink －－vcf op2-3. vcf －－make-bed －－out op_Val_3
pink －－vcf op2-4. vcf －－make-bed －－out op_Val_4

読み取りを経て、４つのｖｃｆにおける情報は表５による：

After reading, the information in the four vcfs is according to Table 5.

ａ）ＲにおいてＳＮＰを再命名しかつ４つのファイルにおける共通となるマーカー情報を抽出し、コマンドは以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃ｃｓ＿Ｖａｌ＿１とｃｓ＿Ｖａｌ＿２のサイト情報を読み取る
ｖａｌ＿１＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿１．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｖａｌ＿２＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿２．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｖａｌ＿３＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿３．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｖａｌ＿４＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿４．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
＃ＳＮＰ命名方式を統一しかつ再命名したファイルを出力する
ｖａｌ＿１＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿１）），ｖａｌ＿１＄Ｖ１，ｓｅｐ＝“ ”），ｖａｌ＿１＄Ｖ４，ｓｅｐ＝ “：”）
ｖａｌ＿２＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿２）），ｖａｌ＿２＄Ｖ１，ｓｅｐ＝ “ ”），ｖａｌ＿２＄Ｖ４，ｓｅｐ＝ “：”）
ｖａｌ＿３＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿３）），ｖａｌ＿３＄Ｖ１，ｓｅｐ＝“ ”），ｖａｌ＿３＄Ｖ４，ｓｅｐ＝ “：”）
ｖａｌ＿４＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿４）），ｖａｌ＿４＄Ｖ１，ｓｅｐ＝“ ”），ｖａｌ＿４＄Ｖ４，ｓｅｐ＝ “：”）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿１， “ｏｐ＿Ｖａｌ＿１．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿２， “ｏｐ＿Ｖａｌ＿２．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿３， “ｏｐ＿Ｖａｌ＿３．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿４， “ｏｐ＿Ｖａｌ＿４．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃共通となるサイト情報を抽出しかつ出力する
ｃｏｍｍ＜－Ｒｅｄｕｃｅ（ｉｎｔｅｒｓｅｃｔ，ｌｉｓｔ（ａ＝ｖａｌ＿１＄Ｖ２，ｂ＝ｖａｌ＿２＄Ｖ２，ｃ＝ｖａｌ＿３＄Ｖ２，ｄ＝ｖａｌ＿４＄Ｖ２））
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｍｍ， “ｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） a) Rename the SNP in R and extract the common marker information in the four files, the command is:
# Unload the required R package library (data.table)
# Read the site information of cs_Val_1 and cs_Val_1 val_1 <-fread (“op_Val_1.bim”, header = F)
val_1 <-fread ("op_Val_2.bim", header = F)
val_3 <-fread (“op_Val_3.bim”, header = F)
val_4 <-fread ("op_Val_4.bim", header = F)
# Outputs a file with a unified SNP naming method and renamed. sep = “:”)
val_2 $ V2 <-paste (paste (rep ("rs", now (val_2)), val_2 $ V1, sep = ""), val_2 $ V4, sep = ":")
val_3 $ V2 <-paste (paste (rep ("rs", now (val_3)), val_3 $ V1, sep = ""), val_3 $ V4, sep = ":")
val_4 $ V2 <-paste (paste (rep ("rs", now (val_4)), val_4 $ V1, sep = ""), val_4 $ V4, sep = ":")
write. table (val_11, "op_Val_1.bim", sep = "\ t", col.names = F, low.names = F, quote = F)
write. table (val_2, "op_Val_2.bim", sep = "\ t", col.names = F, low.names = F, quote = F)
write. table (val_3, "op_Val_3.bim", sep = "\ t", col.names = F, low.names = F, quote = F)
write. table (val_4, "op_Val_4.bim", sep = "¥ t", col.names = F, low.names = F, quote = F)
# Extract and output common site information com <-Reduction (intersect, list (a = val_1 $ V2, b = val_2 $ V2, c = val_3 $ V2, d = val_4 $ V2))
write. table (com, “common_sns.txt”, sep = “¥ t”, coll.names = F, row.names = F, quote = F)

ｂ）ＰＬＩＮＫソフトウェアを利用して４つのファイルを組み合わせ、かつ共通となるマーカーを保留し、コマンドは以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿Ｖａｌ＿１－－ｍｅｒｇｅ－ｌｉｓｔｍｅｒｇｅ＿ｏｐ．ｔｘｔ－－ｅｘｔｒａｃｔｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿ｃｈｉｐ
ファイル「ｍｅｒｇｅ＿ｏｐ．ｔｘｔ」に以下の情報が貯蔵されている：
ｏｐ＿Ｖａｌ＿２．ｂｅｄｏｐ＿Ｖａｌ＿２．ｂｉｍｏｐ＿Ｖａｌ＿２．ｆａｍ
ｏｐ＿Ｖａｌ＿３．ｂｅｄｏｐ＿Ｖａｌ＿３．ｂｉｍｏｐ＿Ｖａｌ＿３．ｆａｍ
ｏｐ＿Ｖａｌ＿４．ｂｅｄｏｐ＿Ｖａｌ＿４．ｂｉｍｏｐ＿Ｖａｌ＿４．ｆａｍ b) Combine the four files using the PLLINK software and reserve the common marker, the command is:
pink －－bfile op_Val_1 －－merge-list merge_op. pxt --- extract common_snps. pxt －－recode A －－out op_chip
The following information is stored in the file "merge_op.txt":
op_Val_2. bed op_Val_2. Bim op_Val_2. fam
op_Val_3. bed op_Val_3. Bim op_Val_3. fam
op_Val_4. bed op_Val_4. Bim op_Val_4. fam

ｃ）ＰＬＩＮＫソフトウェアを利用して参考グループから同様である個体とサイトを抽出し、上記の４つのファイルにおける．ｆａｍ情報をファイルに整理しかつ「ｏｐ＿ｃｈｉｐ＿ｉｎｄｉ．ｔｘｔ」と命名し、「…」はファイルカタログを表し、コマンドは以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅ …／Ｖａｌ＿ｒｅｆ－－ｋｅｅｐｏｐ＿ｃｈｉｐ＿ｉｎｄｉ．ｔｘｔ－－ｅｘｔｒａｃｔｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿ｒｓｅｑ
処理を経て、上記４つのファイルが共通となるマーカー数が１１、７１９で、参考グループにおいて検索可能な個体数が９５である。 c) Extract similar individuals and sites from the reference group using the PLLINK software, and in the above four files. Organize the fam information into a file and name it "op_chip_indi.txt", where "..." represents the file catalog and the command is:
pink --- bfile ... / Val_ref --- keep op_chip_indi. pxt --- extract common_snps. pxt －－recode A －－out op_rseq
After the processing, the number of markers common to the above four files is 11,719, and the number of individuals that can be searched in the reference group is 95.

ｄ）Ｒ統計チップと再配列分類の一致性を利用し、その方法は以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃それぞれチップと再配列が取得した分類情報を読み取る
ｃｈｉｐ＜－ｆｒｅａｄ（“ｏｐ＿ｃｈｉｐ．ｒａｗ”）
ｒｓｅｑ＜－ｆｒｅａｄ（“ｏｐ＿ｒｓｅｑ．ｒａｗ”）
＃ファイルにおける個体配列を統一する
ｆｉｄ＜－ｄａｔａ．ｆｒａｍｅ（ｒｓｅｑ＄ＦＩＤ）
ｃｏｌｎａｍｅｓ（ｆｉｄ）＜－ “ＦＩＤ”
ｃｈｉｐ＜－ｄａｔａ．ｔａｂｌｅ（ｍｅｒｇｅ（ｆｉｄ，ｃｈｉｐ，ｓｏｒｔ＝Ｆ））
＃ファイルにおける最初の６列を削除し、かつ遺伝子型を出力する
ｃｈｉｐ［，ｃ（１：６）：＝ＮＵＬＬ］
ｒｓｅｑ［，ｃ（１：６）：＝ＮＵＬＬ］
ｆｗｒｉｔｅ（ｃｈｉｐ， “ｇｅｎｏ＿ｏｐ＿ｃｈｉｐ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｒｓｅｑ， “ｇｅｎｏ＿ｏｐ＿ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃一致性を統計する
ｓｕｍ（ｃｈｉｐ＝＝ｒｓｅｑ）／（ｎｒｏｗ（ｃｈｉｐ）＊ｎｃｏｌ（ｃｈｉｐ））＊１００
統計によると、上記９５の個体は計１、１１３、３０５のマーカーを有し、完全に同様であるマーカー数が１、００２、８２９であることがわかった。そのため、チップと再配列分類の結果は９０．０８％完全に一致する。 d) Utilizing the match between the R statistical chip and the rearrangement classification, the method is as follows:
# Unload the required R package library (data.table)
# Chip <-fread (“op_chip.raw”) that reads the classification information acquired by the chip and rearrangement, respectively.
rseq <-fread ("op_rseq.raw")
# Unify individual sequences in files fid <-data. frame (rseq $ FID)
colnames (fid) <-"FID"
chip <-data. table (merge (fid, chip, sort = F))
# Delete the first 6 columns in the file and output the genotype chip [, c (1: 6): = NULL]
rseq [, c (1: 6): = NULL]
file (chip, “geno_op_chip.csv”, sep = “,”, low.names = F, quote = F)
fwrite (rseq, “geno_op_rseq.csv”, sep = “,”, low.names = F, quote = F)
# Statistical consistency sum (chip == rseq) / (nlow (chip) * ncol (chip)) * 100
According to statistics, it was found that the 95 individuals had a total of 1,113,305 markers, and the number of markers that were completely similar was 1,002,829. Therefore, the results of chip and rearrangement classification are 90.08% in perfect agreement.

ｅ）ＧＥＢＶ推定の正確性を保証するために、ＰＬＩＮＫソフトウェアを利用して参考グループにおける残りの個体の遺伝子型を抽出し、コマンドは以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅ …／Ｖａｌ＿ｒｅｆ－－ｒｅｍｏｖｅｏｐ＿ｃｈｉｐ＿ｉｎｄｉ．ｔｘｔ－－ｅｘｔｒａｃｔｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｒｅｆ e) To ensure the accuracy of the GEBV estimation, genotypes of the remaining individuals in the reference group were extracted using the PLLINK software, and the commands are:
pink －－bfile… / Val_ref －－remove op_chip_indi. pxt --- extract common_snps. pxt －－recode A －－out ref

ｆ）Ｒを利用して参考グループを合併するおよび個体の遺伝子型を検証し、その方法は以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃それぞれチップと再配列が取得した分類情報を読み取る
ｃｈｉｐ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ＿ｃｈｉｐ．ｃｓｖ”））
ｒｓｅｑ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ＿ｒｓｅｑ．ｃｓｖ”））
ｒｅｆ＜－ｆｒｅａｄ（“ｒｅｆ．ｒａｗ”）
＃ファイルにおける最初の６列を削除する
ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｒｅｆ）
＃遺伝子型ファイルを組み合わせかつ組み合わせた後の遺伝子型ファイルを出力する
ｇｅｎｏ＿ｃｈｉｐ＜－ｒｂｉｎｄ（ｃｈｉｐ，ｒｅｆ）
ｇｅｎｏ＿ｒｓｅｑ＜－ｒｂｉｎｄ（ｒｓｅｑ，ｒｅｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｃｈｉｐ， “ｇｅｎｏ＿Ｖａｌ＿Ｃｈｉｐ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｒｓｅｑ， “ｇｅｎｏ＿Ｖａｌ＿Ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） f) Use R to merge reference groups and verify genotypes of individuals, the method of which is:
# Unload the required R package library (data.table)
# Read the classification information acquired by the chip and rearrangement, respectively. Chip <-as. matrix (fread (“geno_op_chip.csv”))
rseq <-as. matrix (fread (“geno_op_rseq.csv”))
ref <-fread (“ref.raw”)
# Delete the first 6 columns in the file ref [, c (1: 6): = NULL]
ref <-as. matrix (ref)
# Combine genotype files and output the genotype file after the combination geno_chip <-rbind (chip, ref)
geno_rseq <-rbind (rseq, ref)
write. table (geno_chip, “geno_Val_Chip.csv”, sep = “,”, low.names = F, quote = F)
write. table (geno_rseq, “geno_Val_Rseq.csv”, sep = “,”, low.names = F, quote = F)

ｇ）ｇ）において取得した２つのｘｘｘ．ｃｓｖファイルがＲにおいて加重ＧＢＬＵＰ方法を用いてＧＥＢＶを推定する。具体的な操作方法は以下のとおりである（Ｌｉｎｕｘ環境）：
＃必要なＲパッケージと関数をアンロードする
ｌｉｂｒａｒｙ（ｐａｒａｌｌｅｌ）
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｌｉｂｒａｒｙ（ａｓｒｅｍｌ）
ｌｉｂｒａｒｙ（ｐＲＯＣ）
ｓｏｕｒｃｅ（“ｇｉｎｖ．Ｒ”）
関数ｇｉｎｖの定義は以下のとおりである：
ｇｉｎｖ＜－ｆｕｎｃｔｉｏｎ（ｉｎｖＧ）｛
Ｇｉｎｖ＜－ｄａｔａ．ｆｒａｍｅ（ｒｏｗ＝ｒｅｐ（１：ｎｒｏｗ（ｉｎｖＧ），ｎｒｏｗ（ｉｎｖＧ）），ｃｏｌｕｍｎ＝ｒｅｐ（１：ｎｒｏｗ（ｉｎｖＧ），ｅａｃｈ＝ｎｒｏｗ（ｉｎｖＧ）），ｖａｌｕｅ＝ａｓ．ｎｕｍｅｒｉｃ（ｉｎｖＧ），ｌｏｗｅｒ．ｍａｔ＝ａｓ．ｌｏｇｉｃａｌ（ｌｏｗｅｒ．ｔｒｉ（ｉｎｖＧ，ｄｉａｇ＝Ｔ）））
Ｇｉｎｖ＜－Ｇｉｎｖ［Ｇｉｎｖ＄ｌｏｗｅｒ．ｍａｔ＝＝Ｔ，ｃ（“ｒｏｗ”， “ｃｏｌｕｍｎ”， “ｖａｌｕｅ”）］
Ｇｉｎｖ＜－Ｇｉｎｖ［ｏｒｄｅｒ（Ｇｉｎｖ＄ｒｏｗ，Ｇｉｎｖ＄ｃｏｌｕｍｎ），］
ｒｅｔｕｒｎ（Ｇｉｎｖ）
｝
＃遺伝子型、表現型情報を読み取る
ｇｅｎｏ＿ｃｈｉｐ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿Ｖａｌ＿Ｃｈｉｐ．ｃｓｖ”，ｎＴｈｒｅａｄ＝１０））
ｇｅｎｏ＿ｒｓｅｑ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿Ｖａｌ＿Ｒｓｅｑ．ｃｓｖ”，ｎＴｈｒｅａｄ＝１０））
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｈｅｎｏ＿ｏｐ＿Ｖａｌ．ｃｓｖ”，ｈｅａｄｅｒ＝Ｔ，ｓｅｐ＝“ ，”）
ｒｎａｍｅｓ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｐｈｅｎｏ＿ｏｐ＿Ｖａｌ．ｃｓｖ”））［，１］
Ｍ＿１＜－ｇｅｎｏ＿ｃｈｉｐ
Ｍ＿２＜－ｇｅｎｏ＿ｒｓｅｑ
＃各サイトの二次等位遺伝子頻度を計算する
ｐｉ＿１＜－ｒｏｕｎｄ（ｃｏｌＳｕｍｓ（Ｍ＿１）／（２＊ｎｒｏｗ（Ｍ＿１）），３）
ｐｉ＿２＜－ｒｏｕｎｄ（ｃｏｌＳｕｍｓ（Ｍ＿２）／（２＊ｎｒｏｗ（Ｍ＿２）），３）
＃Ｐマトリックスを構築する
Ｐ＿１＜－ｍａｔｒｉｘ（２＊ｐｉ＿１，ｂｙｒｏｗ＝Ｔ，ｎｒｏｗ＝ｎｒｏｗ（Ｍ＿１），ｎｃｏｌ＝ｎｃｏｌ（Ｍ＿１））
Ｐ＿２＜－ｍａｔｒｉｘ（２＊ｐｉ＿２，ｂｙｒｏｗ＝Ｔ，ｎｒｏｗ＝ｎｒｏｗ（Ｍ＿２），ｎｃｏｌ＝ｎｃｏｌ（Ｍ＿２））
＃Ｚマトリックスを構築する
Ｚ＿１＜－ａｓ．ｍａｔｒｉｘ（Ｍ＿１－Ｐ＿１）
Ｚ＿２＜－ａｓ．ｍａｔｒｉｘ（Ｍ＿２－Ｐ＿２）
＃等式分子の項を構築する
ＺＺｔ＿１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚ＿１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｔｃｒｏｓｓｐｒｏｄ（Ｚ＿１［ｘ，］，Ｚｔ＿１）｝，ｍｃ．ｃｏｒｅｓ＝２０））
ＺＺｔ＿２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚ＿２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｔｃｒｏｓｓｐｒｏｄ（Ｚ＿２［ｘ，］，Ｚｔ＿２）｝，ｍｃ．ｃｏｒｅｓ＝２０））
＃等式分母の項を構築する
ｄｅｎｏｍ＿１＜－２＊（ｓｕｍ（ｐｉ＿１＊（１－ｐｉ＿１）））
ｄｅｎｏｍ＿２＜－２＊（ｓｕｍ（ｐｉ＿２＊（１－ｐｉ＿２）））
＃Ｇマトリックスを構築する
Ｇ＿ｃｈｉｐ＜－ＺＺｔ＿１／ｄｅｎｏｍ＿１
Ｇ＿ｒｓｅｑ＜－ＺＺｔ＿２／ｄｅｎｏｍ＿２
ｄｉａｇ（Ｇ＿ｃｈｉｐ）＜－ｄｉａｇ（Ｇ＿ｃｈｉｐ）＋０．０１
ｄｉａｇ（Ｇ＿ｒｓｅｑ）＜－ｄｉａｇ（Ｇ＿ｒｓｅｑ）＋０．０１
＃Ｇマトリックスインバージョン
ｉｎｖＧ＿ｃｈｉｐ０＜－ｓｏｌｖｅ（Ｇ＿ｃｈｉｐ）
ｉｎｖＧ＿ｒｓｅｑ０＜－ｓｏｌｖｅ（Ｇ＿ｒｓｅｑ）
＃ＧマトリックスインバージョンをＡＳＲｅｍｌが利用可能な三列の形式に変換し、かつ出力する
Ｇｉｎｖ＿ｃｈｉｐ＜－ｇｉｎｖ（ｉｎｖＧ＿ｃｈｉｐ）
Ｇｉｎｖ＿ｒｓｅｑ＜－ｇｉｎｖ（ｉｎｖＧ＿ｒｓｅｑ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ＿ｃｈｉｐ， “Ｇｉｎｖ＿ｏｐ＿Ｖａｌ＿Ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ＿ｒｓｅｑ， “Ｇｉｎｖ＿ｏｐ＿Ｖａｌ＿Ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃ＧＥＢＶを計算する
ａｔｔｒ（Ｇｉｎｖ＿ｃｈｉｐ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ａｔｔｒ（Ｇｉｎｖ＿ｒｓｅｑ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ＷＧＢＬＵＰ１＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ＿ｃｈｉｐ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”）
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝ ‘ｏｍｉｔ’）
ｇｅｂｖ０１＜－ｃｏｅｆ（ＷＧＢＬＵＰ１）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ０１， “ＧＥＢＶｓ＿ａｔ＿ｉｔｅｒ＿０＿ｃｈｉｐ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）

ＷＧＢＬＵＰ２＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ＿ｒｓｅｑ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝ ‘ｏｍｉｔ’）
ｇｅｂｖ０２＜－ｃｏｅｆ（ＷＧＢＬＵＰ２）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ０２， “ＧＥＢＶｓ＿ａｔ＿ｉｔｅｒ＿０＿ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）

＃ＧＢＬＵＰプロセス・チップ部分を加重し、ｆｏｒサイクルを用い、６回の加重反復過程を実行する
＃マーカーの効果値を推定する
Ｚｔ１＜－ｔ（Ｚ＿１）
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚｔ１［ｘ，］％＊％ｉｎｖＧ０１｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｐ１％＊％ｇｅｂｖ０１
ｕ０１＜－ｐ２／ｐｐ＿１
Ｖａｒｕ０１＜－ｕ０１＊ｕ０１＊２＊ｐｉ＿１＊（１－ｐｉ＿１）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ０１， “ ｍａｒｋｅｒＥｆｆ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ０１， “ｍａｒｋｅｒＶａｒ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃各マーカーの重みを推定する
Ｄ＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿１））
Ｄ０＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿１））
ｐｒｅ＿Ｄ＜－Ｄ０
ｕ＜－ｕ０１
ｃａｌ＜－６
ｉｎｔｅｒ＜－１
ｆｏｒ（ｔｉｎ１：ｃａｌ）｛
ｄ＜－ａｓ．ｖｅｃｔｏｒ（ｕ＊ｕ＊２＊ｐｉ＿１＊（１－ｐｉ＿１））
ｆｏｒ（ｉｉｎ１：（ｌｅｎｇｔｈ（ｄ）／ｉｎｔｅｒ））｛
ｄｉ＜－ｍｅａｎ（ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］）
ｐｒｅ＿Ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］＜－ｄｉ
｝
Ｄ＜－ｓｕｍ（Ｄ０）／ｓｕｍ（ｐｒｅ＿Ｄ）＊ｐｒｅ＿Ｄ
ｗｒｉｔｅ．ｔａｂｌｅ（Ｄ，ｆｉｌｅ＝ｐａｓｔｅ（“ｗｅｉｇｈｔｓ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重Ｇマトリックスを推定する
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｃｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｃｏｌ（Ｚ＿１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚ＿１［，ｘ］＊Ｄ［ｘ］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％Ｚｔ１｝，ｍｃ．ｃｏｒｅｓ＝５））
Ｇ＜－ｐ２／ｐｐ＿１
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｄｉａｇ（Ｇ）＜－ｄｉａｇ（Ｇ）＋０．０１
ｉｎｖＧ＜－ｓｏｌｖｅ（Ｇ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｉｎｖＧ，ｆｉｌｅ＝ｐａｓｔｅ（“ｉｎｖＧ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
Ｇｉｎｖ＜－ｇｉｎｖ（ｉｎｖＧ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇｉｎｖ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ａｔｔｒ（Ｇｉｎｖ，“ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ＷＧＢＬＵＰ＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝‘ｏｍｉｔ’）
ｇｅｂｖ＜－ｃｏｅｆ（ＷＧＢＬＵＰ）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ，ｆｉｌｅ＝ｐａｓｔｅ（“ＧＥＢＶｓ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重後マーカーの効果値を推定する
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｄ［ｘ］＊Ｚｔ１［ｘ，］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％ｉｎｖＧ｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ３＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ２［ｘ，］％＊％ｇｅｂｖ０１｝，ｍｃ．ｃｏｒｅｓ＝５））
ｕ＜－ｐ３／ｐｐ＿１
Ｖａｒｕ＜－ｕ＊ｕ＊２＊ｐｉ＿１＊（１－ｐｉ＿１）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ｍａｒｋｅｒＥｆｆ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ ｍａｒｋｅｒＶａｒ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
｝
＃ＧＢＬＵＰプロセス・再配列部分を加重し、ｆｏｒサイクルを用い、６回の加重反復過程を実行する
Ｚｔ２＜－ｔ（Ｚ＿２）
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚｔ２［ｘ，］％＊％ｉｎｖＧ０２｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｐ１％＊％ｇｅｂｖ０２
ｕ０２＜－ｐ２／ｐｐ＿２
Ｖａｒｕ０２＜－ｕ０２＊ｕ０２＊２＊ｐｉ＿２＊（１－ｐｉ＿２）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ０２， “ ｍａｒｋｅｒＥｆｆ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ０２， “ ｍａｒｋｅｒＶａｒ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃各マーカーの重みを推定する
Ｄ＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿２））
Ｄ０＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿２））
ｐｒｅ＿Ｄ＜－Ｄ０
ｕ＜－ｕ０２
ｃａｌ＜－６
ｉｎｔｅｒ＜－１
ｆｏｒ（ｔｉｎ１：ｃａｌ）｛
ｄ＜－ａｓ．ｖｅｃｔｏｒ（ｕ＊ｕ＊２＊ｐｉ＿２＊（１－ｐｉ＿２））
ｆｏｒ（ｉｉｎ１：（ｌｅｎｇｔｈ（ｄ）／ｉｎｔｅｒ））｛
ｄｉ＜－ｍｅａｎ（ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］）
ｐｒｅ＿Ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］＜－ｄｉ
｝
Ｄ＜－ｓｕｍ（Ｄ０）／ｓｕｍ（ｐｒｅ＿Ｄ）＊ｐｒｅ＿Ｄ
ｗｒｉｔｅ．ｔａｂｌｅ（Ｄ，ｆｉｌｅ＝ｐａｓｔｅ（“ ｗｅｉｇｈｔｓ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重Ｇマトリックスを推定する
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｃｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｃｏｌ（Ｚ＿２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚ＿２［，ｘ］＊Ｄ［ｘ］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％Ｚｔ２｝，ｍｃ．ｃｏｒｅｓ＝５））
Ｇ＜－ｐ２／ｐｐ＿２
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｄｉａｇ（Ｇ）＜－ｄｉａｇ（Ｇ）＋０．０１
ｉｎｖＧ＜－ｓｏｌｖｅ（Ｇ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｉｎｖＧ，ｆｉｌｅ＝ｐａｓｔｅ（“ｉｎｖＧ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
Ｇｉｎｖ＜－ｇｉｎｖ（ｉｎｖＧ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇｉｎｖ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ａｔｔｒ（Ｇｉｎｖ，“ ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ＷＧＢＬＵＰ＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ ”ｌｏｇｉｔ”），
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝ ‘ｏｍｉｔ’）
ｇｅｂｖ＜－ｃｏｅｆ（ＷＧＢＬＵＰ）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ，ｆｉｌｅ＝ｐａｓｔｅ（“ ＧＥＢＶｓ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重後マーカーの効果値を推定する
ｐ１＜－ｄｏ．ｃａｌｌ（’ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｄ［ｘ］＊Ｚｔ２［ｘ，］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％ｉｎｖＧ｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ３＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ２［ｘ，］％＊％ｇｅｂｖ０２｝，ｍｃ．ｃｏｒｅｓ＝５））
ｕ＜－ｐ３／ｐｐ＿２
Ｖａｒｕ＜－ｕ＊ｕ＊２＊ｐｉ＿２＊（１－ｐｉ＿２）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ｍａｒｋｅｒＥｆｆ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ ｍａｒｋｅｒＶａｒ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
｝
計算を経て、第４回反復した時に加重ＧＢＬＵＰ方法は安定に近づき、そのためこの時の反復結果について後続の研究を実施する。検証に用いられる９５の個体ＧＥＢＶの間における関連係数は０．９５８で、これらの個体はＧＥＢＶは表６による： g) Two xxx obtained in g). The csv file estimates GEBV in R using the weighted GBLUP method. The specific operation method is as follows (Linux environment):
# Unload the required R packages and functions library (parallell)
library (data.table)
library (asreml)
library (prOC)
source (“ginv.R”)
The definition of the function ginv is as follows:
ginv <-function (invG) {
Ginv <-data. frame (low = rep (1: low (invG), low (invG)), logic = rep (1: low (invG), reach = narrow (invG)), value = as.numeric (invG), low. = As.logical (lower.tri (invG, diag = T)))
Ginv <-Ginv [Ginv $ lower. mat == T, c (“low”, “column”, “value”)]
Ginv <-Ginv [order (Ginv $ row, Ginv $ volume),]
return (Ginv)
}
# Read genotype and phenotype information geno_chip <-as. matrix (fread (“geno_Val_Chip.csv”, nThread = 10))
geno_rseq <-as. matrix (fread (“geno_Val_Rseq.csv”, nThread = 10))
pheno <-asreml. read. table ("pheno_op_Val.csv", header = T, sep = ",")
rnames <-as. matrix (fread (“pheno_op_Val.csv”)) [, 1]
M_1 <-geno_chip
M_2 <-geno_rseq
# Calculate the frequency of secondary coordinating genes at each site pi_1 <-round (colSums (M_1) / (2 * now (M_1)), 3)
pi_2 <-round (colSums (M_2) / (2 * now (M_2)), 3)
# Build a P matrix P_1 <-matrix (2 * pi_1, byrow = T, now = now (M_1), ncol = ncol (M_1))
P_2 <-matrix (2 * pi_2, byrow = T, now = now (M_2), ncol = ncol (M_2))
# Build a Z matrix Z_1 <-as. matrix (M_1-P_1)
Z_2 <-as. matrix (M_2-P_2)
# ZZt_1 to construct the term of equation numerator <-do. call ('rbind', mclapply (1: now (Z_1), FUN = faction (x) {tcrossprod (Z_1 [x,], Zt_1)}, mc.cores = 20))
ZZt_2 <-do. call ('rbind', mclapply (1: now (Z_2), FUN = faction (x) {tcrossprod (Z_2 [x,], Zt_2)}, mc.cores = 20))
# Construct the term of equation denominator denom_1 <-2 * (sum (pi_1 * (1-pi_1)))
denom_2 <-2 * (sum (pi_2 * (1-pi_2)))
# Build the G matrix G_chip <-ZZt_1 / denom_1
G_rseq <-ZZt_2 / denom_2
diag (G_chip) <-diag (G_chip) + 0.01
diag (G_rseq) <-diag (G_rseq) + 0.01
# G Matrix Inversion invG_chip0 <-solve (G_chip)
invG_rseq0 <-solve (G_rseq)
# G Matrix inversion is converted to a three-column format that ASReml can use and output Ginv_chip <-ginv (invG_chip)
Ginv_rseq <-ginv (invG_rseq)
write. table (Ginv_chip, "Ginv_op_Val_Chip_at_itter_0.csv", sep = ",", low.names = F, quote = F)
write. table (Ginv_rseq, "Ginv_op_Val_Rseq_at_ita_0.csv", sep = ",", low.names = F, quote = F)
#Calculate GEBV attr (Ginv_chip, “lowNames”) <-paste (rnames)
attr (Ginv_rseq, “lowNames”) <-paste (runames)
WGBLUP1 <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv_chip),
rkov = ~ units,
family = asreml. Binomial (link = “logit”)
data = pheno,
maximizer = 100,
na. method. X ='omit')
gebv01 <-coef (WGBLUP1) $ random
write. table (gebv01, "GEBVs_at_ita_0_chip.csv", sep = ",", low.names = F, quote = F)

WGBLUP2 <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv_rseq),
rkov = ~ units,
family = asreml. Binomial (link = “logit”),
data = pheno,
maximizer = 100,
na. method. X ='omit')
gebv02 <-coef (WGBLUP2) $ random
write. table (gebv02, "GEBVs_at_ita_0_rseq.csv", sep = ",", low.names = F, quote = F)

#GBLUP process ・ Zt1 <-t (Z_1) that estimates the effect value of the marker that weights the chip portion and executes the weighted iteration process 6 times using the for cycle.
p1 <-do. call ('rbind', mclappley (1: now (Zt1), FUN = faction (x) {Zt1 [x,]% *% invG01}, mc.cores = 5))
p2 <-p1% *% gebv01
u01 <-p2 / pp_1
Varu01 <-u01 * u01 * 2 * pi_1 * (1-pi_1)
write. table (u01, "markerEff_chip_at_itter_0.csv", sep = ",", low.names = F, quaote = F)
write. table (Varu01, "markerVar_chip_at_ita_0.csv", sep = ",", low.names = F, quote = F)
# Estimate the weight of each marker D <-rep (1, ncol (M_1))
D0 <-rep (1, ncol (M_1))
pre_D <-D0
u <-u01
cal <-6
inter <-1
for (t in 1: cal) {
d <-as. vector (u * u * 2 * pi_1 * (1-pi_1))
for (i in 1: (length (d) / inter)) {
di <-mean (d [(inter * (i-1) + 1): (inter * i)])
pre_D [(inter * (i-1) + 1): (inter * i)] <-di
}
D <-sum (D0) / sum (pre_D) * pre_D
write. table (D, file = paste ("weights_chip_at_ita_", t, ".csv", sep = ""), sep = ",", low.names=F, quote = F)
# Estimate the weighted G matrix p1 <-do. call ('cbind', mclapply (1: ncol (Z_1), FUN = faction (x) {Z_1 [, x] * D [x]}, mc.cores = 5))
p2 <-do. call ('rbind', mclapply (1: now (p1), FUN = faction (x) {p1 [x,]% *% Zt1}, mc.cores = 5))
G <-p2 / pp_1
write. table (G, file = paste ("G_chip_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
diag (G) <-diag (G) + 0.01
invG <-solve (G)
write. table (invG, file = paste ("invG_chip_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
Ginv <-ginv (invG)
write. table (Ginv, file = paste ("Ginv_chip_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
attr (Ginv, "lowNames") <-paste (runames)
WGBLUP <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv),
rkov = ~ units,
family = asreml. Binomial (link = “logit”),
data = pheno,
maximizer = 100,
na. method. X ='omit')
gevv <-coef (WGBLUP) $ random
write. table (gebv, file = paste ("GEBVs_chip_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
# Estimate the effect value of the post-weighted marker p1 <-do. call ('rbind', mclapply (1: now (Zt1), FUN = faction (x) {D [x] * Zt1 [x,]}, mc.cores = 5))
p2 <-do. call ('rbind', mclappley (1: now (p1), FUN = faction (x) {p1 [x,]% *% invG}, mc.cores = 5))
p3 <-do. call ('rbind', mclapply (1: now (p2), FUN = faction (x) {p2 [x,]% *% gebv01}, mc.cores = 5))
u <-p3 / pp_1
Varu <-u * u * 2 * pi_1 * (1-pi_1)
write. table (u, file = paste ("markerEff_chip_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
write. table (Varu, file = paste ("markerVar_chip_at_itter_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
}
#GBLUP process-Zt2 <-t (Z_2) that weights the rearranged part and executes a weighted iteration process 6 times using a for cycle.
p1 <-do. call ('rbind', mclappley (1: now (Zt2), FUN = faction (x) {Zt2 [x,]% *% invG02}, mc.cores = 5))
p2 <-p1% *% gebv02
u02 <-p2 / pp_2
Varu02 <-u02 * u02 * 2 * pi_2 * (1-pi_2)
write. table (u02, "markerEff_rsq_at_itter_0.csv", sep = ",", low.names = F, quaote = F)
write. table (Varu02, "markerVar_rseq_at_ita_0.csv", sep = ",", low.names = F, quote = F)
# Estimate the weight of each marker D <-rep (1, ncol (M_2))
D0 <-rep (1, ncol (M_2))
pre_D <-D0
u <-u02
cal <-6
inter <-1
for (t in 1: cal) {
d <-as. vector (u * u * 2 * pi_2 * (1-pi_2))
for (i in 1: (length (d) / inter)) {
di <-mean (d [(inter * (i-1) + 1): (inter * i)])
pre_D [(inter * (i-1) + 1): (inter * i)] <-di
}
D <-sum (D0) / sum (pre_D) * pre_D
write. table (D, file = paste ("weights_rsq_at_itter_", t, ".csv", sep = ""), sep = ",", low.names=F, quote = F)
# Estimate the weighted G matrix p1 <-do. call ('cbind', mclapply (1: ncol (Z_2), FUN = faction (x) {Z_2 [, x] * D [x]}, mc.cores = 5))
p2 <-do. call ('rbind', mclapply (1: now (p1), FUN = faction (x) {p1 [x,]% *% Zt2}, mc.cores = 5))
G <-p2 / pp_2
write. table (G, file = paste ("G_rseq_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
diag (G) <-diag (G) + 0.01
invG <-solve (G)
write. table (invG, file = paste (“invG_rseq_at_itter_”, t, “.csv”, sep = “”), sep = “,”, low.names = F, quote = F)
Ginv <-ginv (invG)
write. table (Ginv, file = paste (“Ginv_rsq_at_itter_”, t, “.csv”, sep = “”), sep = “,”, low.names = F, quote = F)
attr (Ginv, "lowNames") <-paste (runames)
WGBLUP <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv),
rkov = ~ units,
family = asreml. Binomial (link = "logit"),
data = pheno,
maximizer = 100,
na. method. X ='omit')
gevv <-coef (WGBLUP) $ random
write. table (gebv, file = paste ("GEBVs_rseq_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
# Estimate the effect value of the post-weighted marker p1 <-do. call ('rbind', mclapply (1: now (Zt2), FUN = faction (x) {D [x] * Zt2 [x,]}, mc.cores = 5))
p2 <-do. call ('rbind', mclappley (1: now (p1), FUN = faction (x) {p1 [x,]% *% invG}, mc.cores = 5))
p3 <-do. call ('rbind', mclapply (1: now (p2), FUN = faction (x) {p2 [x,]% *% gebv02}, mc.cores = 5))
u <-p3 / pp_2
Varu <-u * u * 2 * pi_2 * (1-pi_2)
write. table (u, file = paste ("markerEff_rsq_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
write. table (Varu, file = paste ("markerVar_rseq_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
}
After calculation, the weighted GBLUP method approaches stability at the 4th iteration, so subsequent studies will be conducted on the results of the iterations at this time. The correlation coefficient among the 95 individual GEBVs used for validation was 0.958, and these individuals had GEBVs according to Table 6.

２、「魚チップ１号」遺伝子チップサイトが参考グループにおける検証
発明者の既存の参考グループから「魚チップ１号」遺伝子チップの設計サイトを抽出し、これらのサイト情報を利用して加重ＧＢＬＵＰを実施し、かつ５倍の交差検証方法のランダムな組分けを用いて加重ＧＢＬＵＰ予測正確性の評価方法として、被検測者の操作特徴曲線下面積（ＡＵＣ）を加重ＧＢＬＵＰの評価方法で正確性の指標とする。解析モデルは一般化線形混合モデルを用いる。組分けのランダム誤差を減らすために、データセットに１０回の組分けを行い、各組は５回計算する。そのため、延べ５０回計算し、５０回のＡＵＣの平均値を最終的な評価結果とする。 2. Verification of "Fish Chip No. 1" gene chip site in the reference group Extract the design site of "Fish Chip No. 1" gene chip from the existing reference group of the inventor, and use these site information to perform weighted GBLUP. As a method for evaluating the weighted GBLUP prediction accuracy by using random grouping of the cross-validation method of 5 times, the area under the operating characteristic curve (AUC) of the subject is evaluated by the weighted GBLUP evaluation method. It is used as an index of. A generalized linear mixed model is used as the analytical model. In order to reduce the random error of grouping, the data set is grouped 10 times, and each group is calculated 5 times. Therefore, a total of 50 times are calculated, and the average value of 50 times of AUC is used as the final evaluation result.

解析結果によると、ヒラメ参考グループにおいてＳＮＰチップと同様であるマーカーを用いてゲノム選択を実施し、ＡＵＣ（正確性）値が０．８８５で、従来のＢＬＵＰ方法によるＡＵＣ（０．５７９）値より高い。そのため、発明者が設計されるチップサイトを用いてはゲノム選択を順調かつ高効率に実施することができる。 According to the analysis results, genome selection was performed using markers similar to SNP chips in the flatfish reference group, and the AUC (accuracy) value was 0.885, which was higher than the AUC (0.579) value by the conventional BLUP method. high. Therefore, the genome selection can be carried out smoothly and efficiently by using the chip site designed by the inventor.

具体的な操作方法は以下のとおりである：
ｆｃＧＥＮＥ、ＢＥＡＧＬＥとＰＬＩＮＫソフトウェアを利用してヒラメ参考グループから抽出されるチップ設計サイト：欠損サイトを充填しかつ遺伝子型ファイルを出力し、コマンドは以下のとおりである：
ｆｃｇｅｎｅ－－ｐｅｄｇｅｎｏ＿ｏｐ＿Ｒｓｅｑ＿ｃｈｉｐ．ｐｅｄ－－ｍａｐｇｅｎｏ＿ｏｐ＿Ｒｓｅｑ＿ｃｈｉｐ．ｍａｐ－－ｏｆｏｒｍａｔｂｅａｇｌｅ－－ｏｕｔｐｌｉｎｋ２ｂｅａｇｌｅ

ｊａｖａ－Ｘｍｘ５１２０ｍ－ｊａｒｂｅａｇｌｅ．ｊａｒｕｎｐｈａｓｅｄ＝ｐｌｉｎｋ２ｂｅａｇｌｅ．ｂｇｌｍｉｓｓｉｎｇ＝０ｎｉｔｅｒａｔｉｏｎｓ＝２０ｇｐｒｏｂｓ＝ｔｒｕｅｏｕｔ＝ｉｍｐｕｔｅｄ＿ｇｅｎｏ

ｇｕｎｚｉｐｉｍｐｕｔｅｄ＿ｇｅｎｏ．ｐｌｉｎｋ２ｂｅａｇｌｅ．ｂｇｌ．ｐｈａｓｅｄ．ｇｚ

ｆｃｇｅｎｅ－－ｂｇｌｉｍｐｕｔｅｄ＿ｇｅｎｏ．ｐｌｉｎｋ２ｂｅａｇｌｅ．ｂｇｌ．ｐｈａｓｅｄ－－ｐｅｄｉｎｆｏｐｌｉｎｋ２ｂｅａｇｌｅ＿ｐｅｄｉｎｆｏ．ｔｘｔ－－ｓｎｐｉｎｆｏｐｌｉｎｋ２ｂｅａｇｌｅ＿ｓｎｐｉｎｆｏ．ｔｘｔ－－ｏｆｏｒｍａｔｐｌｉｎｋ－－ｏｕｔｂｅａｇｌｅ２ｐｌｉｎｋ

ｐｌｉｎｋ－－ｆｉｌｅｂｅａｇｌｅ２ｐｌｉｎｋ－－ｒｅｃｏｄｅＡ－－ｏｕｔｇｅｎｏｔｙｐｅ＿ｏｐ＿ｃｈｉｐ＿Ｒｓｅｑ
さらにＲに以下のコマンドを入力する：
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｇｅｎｏ＜－ｆｒｅａｄ（“ｇｅｎｏｔｙｐｅ＿ｏｐ＿ｃｈｉｐ＿Ｒｓｅｑ．ｒａｗ”）
ｇｅｎｏ［，ｃ（１：６）：＝ＮＵＬＬ］
ｆｗｒｉｔｅ（ｇｅｎｏ， “ｇｅｎｏｔｙｐｅ＿ｏｐ＿ｃｈｉｐ＿Ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） The specific operation method is as follows:
Chip design site extracted from flounder reference group using fcGENE, BEAGLE and PLLINK software: Filling the defective site and outputting the genotype file, the command is:
fcgene --- ped geno_op_Rseq_chip. ped －－map geno_op_Rseq_chip. map --- offormat beagle --- out pink2beagle

Java-Xmx5120m-jar beagle. jar unphased = pink2beagle. bgl misising = 0 nitterations = 20 gprobs = true out = imputed_geno

gzipip imputed_geno. pink2beagle. bgl. phased. gz

fcgene －－bgl imputed_geno. pink2beagle. bgl. phased --- pedinfo pink2beagle_pedinfo. pxt --- snpinfo pink2beagle_snpinfo. pxt --- offormat pink --- out beagle2link

pink --file beagle2link --recode A --out genotipe_op_chip_Rseq
Then enter the following command in R:
library (data.table)
geno <-fread ("genotype_op_chip_Rseq.raw")
geno [, c (1: 6): = NULL]
fwrite (geno, “genotype_op_chip_Rseq.csv”, sep = “,”, low.names = F, quote = F)

ａ）Ｒにおいてａ）で取得した遺伝子型ファイルを利用してＲにおいて加重ＧＢＬＵＰを実施する。加重ＧＢＬＵＰの具体的な方法は１）におけるｈ）部分を参照して実施する。 a) In R, weighted GBLUP is performed in R using the genotype file obtained in a). The specific method of weighted GBLUP is carried out with reference to the h) part in 1).

ｂ）構築済みの加重ＧマトリックスをＡＳＲｅｍｌ－Ｒに代入して交差検証方法を行う。交差検証方法前に組分けを実施する必要がある：Ｒにおいて関数ｓａｍｐｌｅ（１：９３１、９３１）を用いてすべての個体にランダムソートを行い、さらにソート後のデジタルを５列に分け、各列に含まれる要素個数はそれぞれ１８６・１８６・１８６・１８６と１８７である。上記の過程を１０回繰り返し、延べ１０のファイルを取得する。この１０のファイルを同一のフォルダに入れて使用に備える。解析は一般化線形混合モデルを用い、異なる検測ロットと個体齢を固定効果として、各個体はランダム効果として適合を実施する。５倍の交差検証方法の具体的な実施方法は以下のとおりである：
＃必要なＲパッケージと関数をアンロードする
ｌｉｂｒａｒｙ（ｐａｒａｌｌｅｌ）
ｌｉｂｒａｒｙ（ａｓｒｅｍｌ）
ｌｉｂｒａｒｙ（ｐＲＯＣ）
＃表現型、Ｇマトリックスインバージョンの三列形式を読み取る
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｈｅｎｏｔｙｐｅ＿９３１．ｃｓｖ”，ｈｅａｄｅｒ＝Ｔ，ｓｅｐ＝“ ，”，ｎａ．ｓｔｒｉｎｇ＝ＮＡ）
ｐｅｄ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｅｄｉｇｒｅｅ＿９３１．ｃｓｖ”，ｈｅａｄｅｒ＝Ｔ，ｓｅｐ＝“ ，”，ｎａ．ｓｔｒｉｎｇ＝ＮＡ）
ａｉｎｖ＜－ａｓｒｅｍｌ．Ａｉｎｖｅｒｓｅ（ｐｅｄ）＄ｇｉｎｖ
Ｇｉｎｖ＜－ｆｒｅａｄ（“…／Ｇｉｎｖ＿ａｔ＿ｉｔｅｒ＿４．ｃｓｖ ”）
ａｔｔｒ（Ｇｉｎｖ，“ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
＃外部サイクルの回数を設定する
Ｎ＜－１０
＃結果変数を設定する
ｒｅｓ＜－ｍａｔｒｉｘ（ＮＡ，ｎｒｏｗ＝５＊Ｎ，ｎｃｏｌ＝１）
ｃｏｌｎａｍｅｓ（ｒｅｓ）＜－ｃ（”ａｕｃ”）
＃交差検証方法を行いかつ検証結果を出力する
ｆｏｒ（ｉｉｎ１：Ｎ）｛
ｃｏｏｒ＜－ｒｅａｄ．ｔａｂｌｅ（ｆｉｌｅ＝ｐａｓｔｅ（“．／ｃｏｏｒ／ｃｏｏｒ＿”，ｉ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｆ）
ｃｙｃ＜－ｎｃｏｌ（ｃｏｏｒ）
ｇｅｂｖ＜－ｍａｔｒｉｘ（ＮＡ，ｎｒｏｗ＝ｎｒｏｗ（ｐｈｅｎｏ），ｎｃｏｌ＝ｃｙｃ）
ｆｏｒ（ｊｉｎ１：ｃｙｃ）｛
ｙ＜－ｐｈｅｎｏ
ｙ＄ｓｔａｔｕｓ［ｃｏｏｒ［（１：ｓｕｍ（ｃｏｏｒ［，ｊ］＞０，ｎａ．ｒｍ＝Ｔ）），ｊ］］＜－ＮＡ
ＣＶ＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，
ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），
ｄａｔａ＝ｙ，
ｍａｘｉｔｅｒ＝５０）

ｇｅｂｖ［，ｊ］＜－ｃｏｅｆ（ＣＶ）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ，ｆｉｌｅ＝ｐａｓｔｅ（“ＧＥＢＶｓ＿ｃｈｉｐ＿ｒｅｆ＿ｃｏｏｒ＿”，ｉ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｒｅｓ［（ｊ＋（ｉ－１）＊ｃｙｃ），］＜－ｒｏｃ（ａｓ．ｖｅｃｔｏｒ（ｐｈｅｎｏ＄ｓｔａｔｕｓ［ｃｏｏｒ［（１：ｓｕｍ（ｃｏｏｒ［，ｊ］＞０，ｎａ．ｒｍ＝Ｔ）），ｊ］］），ａｓ．ｖｅｃｔｏｒ（ｇｅｂｖ［ｃｏｏｒ［（１：ｓｕｍ（ｃｏｏｒ［，ｊ］＞０，ｎａ．ｒｍ＝Ｔ）），ｊ］，ｊ］））＄ａｕｃ
ｗｒｉｔｅ．ｔａｂｌｅ（ｒｅｓ， “ｒｅｓｕｌｔｓ＿ｏｆ＿ｒｏｃ＿ｏｐ＿ｃｈｉｐ＿ｒｅｆ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
｝
｝
＃従来のＢＬＵＰの推定方法は加重ＧＢＬＵＰと同様で、モデルにおいてａｉｎｖを用いてＧｉｎｖを代替すればよい
上記のコードを実行した後、加重ＧＢＬＵＰ方法によるＡＵＣ平均値は０．８８５で、従来のＢＬＵＰ方法によるＡＵＣ平均値は０．５７９である。５０回の交差検証方法の結果は表７による： b) Substitute the constructed weighted G matrix into ASReml-R to perform cross-validation. It is necessary to carry out grouping before the cross-validation method: Random sort is performed on all individuals using the function sample (1: 931, 931) in R, and the sorted digital is further divided into 5 columns, and each column is divided. The number of elements included in is 186, 186, 186, 186 and 187, respectively. The above process is repeated 10 times to obtain a total of 10 files. Put these 10 files in the same folder to prepare for use. The analysis uses a generalized linear mixed model, with different measurement lots and individual ages as fixed effects, and each individual performing adaptation as a random effect. The specific method of implementing the 5-fold cross-validation method is as follows:
# Unload the required R packages and functions library (parallell)
library (asreml)
library (prOC)
# Phenotype, read the three-column format of G-matrix inversion pheno <-asreml. read. table ("phenotype_931.csv", header = T, sep = ",", na.string = NA)
ped <-asreml. read. table (“pedigree_931.csv”, header = T, sep = “,”, na.string = NA)
ainv <-asreml. Ainverse (ped) $ ginv
Ginv <-fread ("... / Ginv_at_ita_4.csv")
attr (Ginv, "lowNames") <-paste (pheno [, 1])
# Set the number of external cycles N <-10
# Set the result variable res <-matrix (NA, now = 5 * N, ncol = 1)
colnames (res) <-c ("auc")
# Perform cross-validation method and output verification result for (i in 1: N) {
coor <-read. file (file = paste ("./coor/coor_", i, ".csv", sep = ""), sep = ",", header = F)
cyc <-ncol (coor)
gevv <-matrix (NA, now = now (pheno), ncol = cyc)
for (j in 1: cyc) {
y <-pheno
y $ status [coor [(1: sum (coor [, j]> 0, na.rm = T)), j]] <-NA
CV <-asreml (status-Batch + Age,
random = ~ giv (AnimalID),
ginverse = list (AnimalID = Ginv),
rkov = ~ units,
family = asreml. Binomial (link = “logit”),
data = y,
maximizer = 50)

gebv [, j] <-coef (CV) $ random
write. table (gebv, file = paste ("GEBVs_chip_ref_coor_", i, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
res [(j + (i-1) * cyc),] <-loc (as.vector (pheno $ status [coor [(1: sum (coor [, j]> 0, na.rm = T))), j]]), as.vector (gebv [coor [(1: sum (coor [, j]> 0, na.rm = T)), j], j])) $ auc
write. table (res, "results_of_loc_op_chip_ref.csv", sep = ",", low.names = F, quote = F)
}
}
# Conventional BLUP estimation methods are similar to weighted GBLUPs, and ainvs can be used to replace Ginvs in the model. After executing the above code, the AUC mean by weighted GBLUP methods is 0.885, which is a conventional BLUP. The AUC averages by method are 0.579. The results of the 50 cross-validation methods are shown in Table 7.

３、「魚チップ１号」遺伝子チップがヒラメ耐病性育種における応用
候補個体のゲノム推定育種価（ＧＥＢＶ）を推定するために、まず候補個体遺伝子型（「魚チップ１号」遺伝子チップ分類から取得される）を発明者の既存の参考グループ遺伝子型と組み合わせ、さらにＲを用いて加重Ｇマトリックスを構築し、最後に用意済みの加重Ｇマトリックスと表現型データをＡＳＲｅｍｌ－Ｒに代入し、加重ＧＢＬＵＰ方法を用いて各家系成体のＧＥＢＶを推定し、さらに成体ＧＥＢＶの平均値を相応の家系のＧＥＢＶとする。各家系を感染生存率に従って高生存率家系（生存率が５５％より高い）と低生存率家系（生存率が５５％より低い）、かつ各家系ＧＥＢＶと感染生存率の間のＡＵＣ値を計算し、さらに該ＡＵＣ値を２で取得したＡＵＣ値と比較し、近いひいては２で取得したＡＵＣより高い場合、発明者が設計される遺伝子チップがゲノム選択技術の要件を満たしかつヒラメ耐病性選別において優れた応用効果を有することを示す。各家系ＧＥＢＶと感染生存率の間のＡＵＣ値を計算しかつ該ＡＵＣ値を２で取得したＡＵＣ値と比較し、それにより発明者が設計される遺伝子チップとゲノム選択技術がヒラメ耐病性選別における実際の応用効果を検証するために用いられるもの。ＡＵＣ値を推定する前、子の代の感染生存率を

で変換を実施する。変換後、平均値より高い家系の生存率を１と記載し、平均値より低いものを０と記載する。遺伝子チップ方法がマーカーに対する要件を満たし、すべての候補個体の分類結果に組み合わせを行い、各チップの分類結果を単独で参考グループ遺伝子型と合併する必要がある。 3. Application of "Fish Chip No. 1" gene chip in scallop disease-resistant breeding In order to estimate the genome estimated breeding value (GEBV) of candidate individuals, first obtain the candidate individual genotype ("Fish Chip No. 1" gene chip classification). Is combined with the inventor's existing reference group genotype, and a weighted G matrix is constructed using R. Finally, the prepared weighted G matrix and phenotype data are substituted into ASReml-R, and the weighted GBLUP. The GEBV of each adult family is estimated using the method, and the average value of the adult GEBV is taken as the GEBV of the corresponding family. High survival rate (survival rate is higher than 55%) and low survival rate family (survival rate is lower than 55%) according to the infection survival rate of each family, and AUC value between GEBV and infection survival rate of each family. And further compare the AUC value with the AUC value obtained in 2, and if it is close and thus higher than the AUC obtained in 2, the gene chip designed by the inventor meets the requirements of genome selection technology and is resistant to fluff. It is shown that it has an excellent application effect in sorting. The AUC value between each family GEBV and infection survival rate is calculated and the AUC value is compared with the AUC value obtained in 2, and the gene chip and genome selection technology designed by the inventor are used for flounder disease resistance selection. Used to verify the actual application effect in. Before estimating the AUC value, determine the infection survival rate of the offspring.

Perform the conversion with. After the conversion, the survival rate of the family whose family value is higher than the average value is described as 1, and the survival rate of the family whose value is lower than the average value is described as 0. It is necessary that the gene chip method meets the requirements for the marker, the classification results of all candidate individuals are combined, and the classification results of each chip are independently combined with the reference group genotype.

解析結果によると１６の候補群の子の代家系における６の高生存率家系の平均生存率が６２．４％であるおよび１０の低生存率家系の平均的な生存率が３３．４７％（表８）であることがわかる。そのうち、高生存率家系成体の平均的なＧＥＢＶが２．１０で、低生存率家系成体の平均的なＧＥＢＶが１．３４である。計算はよると、これらのヒラメ家系のＧＥＢＶ値を利用してはその感染生存率が予測される正確性が０．７９４に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。 According to the analysis results, the average survival rate of 6 high survival rate families in the offspring of 16 candidate groups is 62.4% and the average survival rate of 10 low survival rate families is 33.47% ( It can be seen that Table 8). Among them, the average GEBV of adults with high survival rate is 2.10, and the average GEBV of adults with low survival rate is 1.34. According to the calculation, the accuracy of predicting the infection survival rate can reach 0.794 by using the GEBV value of these flounder families, which is close to the theoretical value. Therefore, the gene chip designed by the inventor can be effectively applied to the selection of flounder disease resistance properties.

具体的な操作方法は以下のとおりである：
ＰＬＩＮＫとＲを利用して候補個体分類ファイルを利用し、さらに候補個体から後続検証に用いられる個体を選定し、これらの個体情報を一つのテキストファイルに貯蔵し、家系番号・個体番号・父本番号・母本番号・性別と表現型値の配列に従ってテキストファイルを用意し、各行に一つの個体で、各個体の各項目情報はｔａｂｌｅ区切り記号を用いて区切りを実施する。発明を実施するための形態：
＃各ＳＮＰチップの分類結果を読み取り、結果はｘｘｘ．ｂｅｄ、ｘｘｘ．ｂｉｍとｘｘｘ．ｆａｍに貯蔵される。
ｐｌｉｎｋ－－ｖｃｆｏｐ１．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ１
ｐｌｉｎｋ－－ｖｃｆｏｐ２．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ２
ｐｌｉｎｋ－－ｖｃｆｏｐ３．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ３

Ｒにおいて各ファイルにおけるＳＮＰの命名方法を変換し、具体的な操作方法は以下のとおりである：
＃Ｒパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃データを読み取る
ｏｐ１＜－ｆｒｅａｄ（“ｏｐ１．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｏｐ２＜－ｆｒｅａｄ（“ｏｐ２．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｏｐ３＜－ｆｒｅａｄ（“ｏｐ３．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
＃ＳＮＰ名称を変換する
ｏｐ１＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｏｐ１）），ｏｐ１＄Ｖ１，ｓｅｐ＝“ ”），ｏｐ１＄Ｖ４，ｓｅｐ＝ “：”）
ｏｐ２＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｏｐ２）），ｏｐ２＄Ｖ１，ｓｅｐ＝“ ”），ｏｐ２＄Ｖ４，ｓｅｐ＝ “：”）
ｏｐ３＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｏｐ３）），ｏｐ３＄Ｖ１，ｓｅｐ＝“ ”），ｏｐ３＄Ｖ４，ｓｅｐ＝ “：”）
＃各チップが取得した分類情報を出力する
ｗｒｉｔｅ．ｔａｂｌｅ（ｏｐ１＄Ｖ２， “ｓｎｐｓ＿ｏｐ１．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｏｐ２＄Ｖ２， “ｓｎｐｓ＿ｏｐ２．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｏｐ３＄Ｖ２， “ｓｎｐｓ＿ｏｐ３．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃整理後のｂｉｍファイルを出力する
ｆｗｒｉｔｅ（ｏｐ１， “ｏｐ１．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ２， “ｏｐ２．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ３， “ｏｐ３．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃Ｒを終了する

検証個体情報を用意し、かつタイトルが「ｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ」のファイルに入れ、ファイルは以下の方式に従って整理する：
１７０１．ＣＥＬ１７０１．ＣＥＬ０００－９
１７０２．ＣＥＬ１７０２．ＣＥＬ０００－９
＃ＰＬＩＮＫソフトウェアを利用して各ＳＮＰチップに含まれる個体を抽出し、その方式は以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ１－－ｋｅｅｐｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿ｃａｎ＿１
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ２－－ｋｅｅｐｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿ｃａｎ＿２
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ３－－ｋｅｅｐｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿ｃａｎ＿３

＃検証に用いられる個体の遺伝子型ファイルを出力する
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿ｃａｎ＿１－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ１
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿ｃａｎ＿２－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ２
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿ｃａｎ＿３－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ３
さらにＲにおいて以下のコマンドを用いて遺伝子型ファイルを出力する：
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｏｐ１＜－ｆｒｅａｄ（“ｏｐ１．ｒａｗ”）
ｏｐ２＜－ｆｒｅａｄ（“ｏｐ２．ｒａｗ”）
ｏｐ３＜－ｆｒｅａｄ（“ｏｐ３．ｒａｗ”）
ｏｐ１［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ２［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ３［，ｃ（１：６）：＝ＮＵＬＬ］
ｆｗｒｉｔｅ（ｏｐ１， “ｇｅｎｏ＿ｏｐ１＿ｃａｎ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ２， “ｇｅｎｏ＿ｏｐ２＿ｃａｎ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ３， “ｇｅｎｏ＿ｏｐ３＿ｃａｎ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ａ）ＰＬＩＮＫとＲを利用して参考グループ遺伝子型をそれぞれ各チップ候補個体遺伝子型と組み合わせを実施し、具体的な操作方法は以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅ …／ｏｐ＿Ｒｅｆ－－ｅｘｔｒａｃｔｓｎｐｓ＿ｏｐ１．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿Ｒｅｆ＿ｏｐ１
ｐｌｉｎｋ－－ｂｆｉｌｅ …／ｏｐ＿Ｒｅｆ－－ｅｘｔｒａｃｔｓｎｐｓ＿ｏｐ２．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿Ｒｅｆ＿ｏｐ２
ｐｌｉｎｋ－－ｂｆｉｌｅ …／ｏｐ＿Ｒｅｆ－－ｅｘｔｒａｃｔｓｎｐｓ＿ｏｐ３．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿Ｒｅｆ＿ｏｐ３
さらにＲにおいて以下のコマンドを用い、遺伝子型ファイルを処理し、出力する：
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃参考グループ遺伝子型を読み取りかつそれに処理を実施する
ｏｐ１＿ｒｅｆ＜－ｆｒｅａｄ（“ｏｐ＿Ｒｅｆ＿ｏｐ１．ｒａｗ”）
ｏｐ２＿ｒｅｆ＜－ｆｒｅａｄ（“ｏｐ＿Ｒｅｆ＿ｏｐ２．ｒａｗ”）
ｏｐ３＿ｒｅｆ＜－ｆｒｅａｄ（“ｏｐ＿Ｒｅｆ＿ｏｐ３．ｒａｗ”）
ｏｐ１＿ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ２＿ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ３＿ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ１＿ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｏｐ１＿ｒｅｆ）
ｏｐ２＿ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｏｐ２＿ｒｅｆ）
ｏｐ３＿ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｏｐ３＿ｒｅｆ）
＃候補個体遺伝子型を処理する：読み取り、組み合わせ
ｏｐ１＿ｃａｎ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ１＿ｃａｎ．ｃｓｖ”））
ｏｐ２＿ｃａｎ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ２＿ｃａｎ．ｃｓｖ”））
ｏｐ３＿ｃａｎ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ３＿ｃａｎ．ｃｓｖ”））
ｇｅｎｏ＿ｏｐ１＜－ｒｂｉｎｄ（ｏｐ１＿ｃａｎ，ｏｐ１＿ｒｅｆ）
ｇｅｎｏ＿ｏｐ２＜－ｒｂｉｎｄ（ｏｐ２＿ｃａｎ，ｏｐ２＿ｒｅｆ）
ｇｅｎｏ＿ｏｐ３＜－ｒｂｉｎｄ（ｏｐ３＿ｃａｎ，ｏｐ３＿ｒｅｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｏｐ１， “ｇｅｎｏ＿ｏｐ１．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｏｐ２， “ｇｅｎｏ＿ｏｐ２．ｃｓｖ”，ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｏｐ３， “ｇｅｎｏ＿ｏｐ３．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） The specific operation method is as follows:
A candidate individual classification file is used using PLLNK and R, and individuals to be used for subsequent verification are selected from the candidate individuals, and these individual information is stored in one text file, and the family number, individual number, and father's book are stored. A text file is prepared according to an array of numbers, mother numbers, genders, and phenotypic values, and each item is separated by one individual on each line, and each item information of each individual is separated by using a table delimiter. A form for carrying out the invention:
# Read the classification result of each SNP chip, and the result is xxx. bed, xxx. Bim and xxx. Stored in fam.
pink --- vcf op1. vcf －－make－bed －－out op1
pink －－vcf op2. vcf －－make－bed －－out op2
pink －－vcf op3. vcf －－make－bed －－out op3

The SNP nomenclature in each file is converted in R, and the specific operation method is as follows:
#R Unloading the package library (data.table)
# Read data op1 <-fread (“op1.bim”, header = F)
op2 <-fread (“op2.bim”, header = F)
op3 <-fread (“op3.bim”, header = F)
# Convert SNP name op1 $ V2 <-paste (paste (rep ("rs", now (op1)), op1 $ V1, sep = ""), op1 $ V4, sep = ":")
op2 $ V2 <-paste (paste (rep ("rs", now (op2)), op2 $ V1, sep = ""), op2 $ V4, sep = ":")
op3 $ V2 <-paste (paste (rep ("rs", now (op3)), op3 $ V1, sep = ""), op3 $ V4, sep = ":")
# Write to output the classification information acquired by each chip. table (op1 $ V2, "snaps_op1.txt", sep = "\ t", col.names = F, row.names = F, quote = F)
write. table (op2 $ V2, “snaps_op2.txt”, sep = “¥ t”, col.names = F, row.names = F, quote = F)
write. table (op3 $ V2, “snaps_op3.txt”, sep = “¥ t”, col.names = F, row.names = F, quote = F)
# Output the organized Bim file fwrite (op1, "op1.bim", sep = "\ t", col.names = F, row.names = F, quote = F)
file (op2, “op2.bim”, sep = “¥ t”, col.names = F, low.names = F, quote = F)
file (op3, “op3.bim”, sep = “¥ t”, col.names = F, low.names = F, quote = F)
End # R

Prepare the verification individual information and put it in a file with the title "selected_indi.txt", and organize the files according to the following method:
1701. CEL 1701. CEL 0 0 0-9
1702. CEL 1702. CEL 0 0 0-9
The individual contained in each SNP chip is extracted using #LINK software, and the method is as follows:
pink －－bfile op1 －－ keep selected_indi. pxt －－make-bed －－out op_can_1
pink －－bfile op2 －－ keep selected_indi. pxt －－make-bed －－out op_can_2
pink －－bfile op3 －－ keep selected_indi. pxt －－make-bed －－out op_can_3

# Output the genotype file of the individual used for verification pink －－bfile op_can_1 －－recode A －－out op1
pink －－bfile op_can_2 －－recode A －－out op2
pink －－bfile op_can_3 －－recode A －－out op3
Furthermore, in R, the genotype file is output using the following command:
library (data.table)
op1 <-fread (“op1.raw”)
op2 <-fread (“op2.raw”)
op3 <-fread (“op3.raw”)
op1 [, c (1: 6): = NULL]
op2 [, c (1: 6): = NULL]
op3 [, c (1: 6): = NULL]
fwrite (op1, “geno_op1_can.csv”, sep = “,”, low.names = F, quote = F)
fwrite (op2, “geno_op2_can.csv”, sep = “,”, low.names = F, quote = F)
fwrite (op3, “geno_op3_can.csv”, sep = “,”, low.names = F, quote = F)
a) Using PLLNK and R, the reference group genotypes are combined with each chip candidate individual genotype, and the specific operation method is as follows:
pink －－bfile… / op_Ref －－extract snaps_op1. pxt --recode A --out op_Ref_op1
pink －－bfile… / op_Ref －－extract snaps_op2. pxt －－recode A －－out op_Ref_op2
pink －－bfile… / op_Ref －－extract snaps_op3. pxt －－recode A －－out op_Ref_op3
In addition, use the following command in R to process and output the genotype file:
library (data.table)
# Reference group Read the genotype and perform processing on it op1_ref <-fread (“op_Ref_op1.raw”)
op2_ref <-fread (“op_Ref_op2.raw”)
op3_ref <-fread (“op_Ref_op3.raw”)
op1_ref [, c (1: 6): = NULL]
op2_ref [, c (1: 6): = NULL]
op3_ref [, c (1: 6): = NULL]
op1_ref <-as. matrix (op1_ref)
op2_ref <-as. matrix (op2_ref)
op3_ref <-as. matrix (op3_ref)
# Process candidate individual genotypes: read, combine op1_can <-as. matrix (fread (“geno_op1_can.csv”))
op2_can <-as. matrix (fread (“geno_op2_can.csv”))
op3_can <-as. matrix (fread (“geno_op3_can.csv”))
geno_op1 <-rbind (op1_can, op1_ref)
geno_op2 <-rbind (op2_can, op2_ref)
geno_op3 <-rbind (op3_can, op3_ref)
write. table (geno_op1, “geno_op1.csv”, sep = “,”, low.names = F, quote = F)
write. table (geno_op2, "geno_op2.csv", sep = ",", low.names = F, quote = F)
write. table (geno_op3, "geno_op3.csv", sep = ",", low.names = F, quote = F)

ｂ）ｂ）における処理済みの４つの遺伝子型ファイルを利用し、Ｒにおいてそれぞれ加重Ｇマトリックスを構築し、加重Ｇマトリックスの構築方法は１）における記述と同様である b) Using the four processed genotype files in b), a weighted G matrix is constructed in R, respectively, and the method for constructing the weighted G matrix is the same as the description in 1).

ｃ）ＡＳＲｅｍｌ－Ｒを用いて候補個体のＧＥＢＶを推定し、コードは以下のとおりである：
＃必要なＲパッケージと関数をアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｌｉｂｒａｒｙ（ａｓｒｅｍｌ）
＃＃＃ｏｐ１＃＃＃
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｈｅｎｏｔｙｐｅ＿ｏｐ１．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｔ）
Ｇｉｎｖ＜－ｆｒｅａｄ（“…／Ｇｉｎｖ＿ｏｐ１．ｃｓｖ”）
ａｔｔｒ（Ｇｉｎｖ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
ｏｐ１＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），ｒｃｏｖ＝～ｕｎｉｔｓ，ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），ｎａ．ｍｅｔｈｏｄ．Ｘ＝ “ｏｍｉｔ”，ｄａｔａ＝ｐｈｅｎｏ，ｍａｘｉｔｅｒ＝５０）
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｅｆ（ｏｐ１）＄ｒａｎｄｏｍ， “ｇｅｂｖ＿ｏｐ１．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）

＃＃＃ｏｐ２＃＃＃
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ ｐｈｅｎｏｔｙｐｅ＿ｏｐ２．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｔ）
Ｇｉｎｖ＜－ｆｒｅａｄ（“ …／Ｇｉｎｖ＿ｏｐ２．ｃｓｖ”）
ａｔｔｒ（Ｇｉｎｖ， “ ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
ｏｐ２＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），ｒｃｏｖ＝～ｕｎｉｔｓ，ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），ｎａ．ｍｅｔｈｏｄ．Ｘ＝ “ ｏｍｉｔ”，ｄａｔａ＝ｐｈｅｎｏ，ｍａｘｉｔｅｒ＝５０）
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｅｆ（ｏｐ２）＄ｒａｎｄｏｍ， “ ｇｅｂｖ＿ｏｐ２．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃＃＃ｏｐ３＃＃＃
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ ｐｈｅｎｏｔｙｐｅ＿ｏｐ３．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｔ）
Ｇｉｎｖ＜－ｆｒｅａｄ（“ …／Ｇｉｎｖ＿ｏｐ３．ｃｓｖ”）
ａｔｔｒ（Ｇｉｎｖ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
ｏｐ３＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），ｒｃｏｖ＝～ｕｎｉｔｓ，ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），ｎａ．ｍｅｔｈｏｄ．Ｘ＝ “ｏｍｉｔ”，ｄａｔａ＝ｐｈｅｎｏ，ｍａｘｉｔｅｒ＝５０）
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｅｆ（ｏｐ３）＄ｒａｎｄｏｍ， “ｇｅｂｖ＿ｏｐ３．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） c) Estimate the GEBV of a candidate individual using ASReml-R and the code is:
# Unload the required R packages and functions library (data.table)
library (asreml)
#### op1 ####
pheno <-asreml. read. table ("phenotype_op1.csv", sep = ",", header = T)
Ginv <-fread ("... / Ginv_op1.csv")
attr (Ginv, “rowNames”) <-paste (pheno [, 1])
op1 <-asreml (status ~ Batch + Age, random = ~ give (AnimalID), giverse = list (AnimalID = Ginv), logov = ~ units, random = inl. X = “omit”, data = pheno, maximizer = 50)
write. table (coef (op1) $ random, "gebv_op1.csv", sep = ",", col.names = F, quote = F)

#### op2 ####
pheno <-asreml. read. table ("phenotype_op2.csv", sep = ",", header = T)
Ginv <-fread (“… / Ginv_op2.csv”)
attr (Ginv, "lowNames") <-paste (pheno [, 1])
op2 <-asreml (status ~ Batch + Age, random = ~ give (AnimalID), giverse = list (AnimalID = Ginv), logov = ~ units, random = inl. X = “omit”, data = pheno, maximizer = 50)
write. table (coef (op2) $ random, "gebv_op2.csv", sep = ",", col.names = F, quote = F)
#### op3 ####
pheno <-asreml. read. table ("phenotype_op3.csv", sep = ",", header = T)
Ginv <-fread (“… / Ginv_op3.csv”)
attr (Ginv, “rowNames”) <-paste (pheno [, 1])
op3 <-asreml (status ~ Batch + Age, random = ~ give (AnimalID), giverse = list (AnimalID = Ginv), logov = ~ units, random = inl. X = “omit”, data = pheno, maximizer = 50)
write. table (coef (op3) $ random, "gebv_op3.csv", sep = ",", col.names = F, quote = F)

ｄ）推定されるすべての候補個体のＧＥＢＶに従って相応の家系ＧＥＢＶを計算し、かつ公式

を用いて各家系の感染生存率に変換を行い、変換後の平均値より高い家系の生存率を１に設定し、平均値より低いものを０に設定する。最後に、各家系ＧＥＢＶと変換後の生存率の間のＡＵＣ値を計算し、ＡＵＣ値の計算方法は以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｌｉｂｒａｒｙ（ｐＲＯＣ）
＃各家系ＧＥＢＶおよび相応の感染生存率を一つのファイルに整理し、家系番号・ＧＥＢＶと変換後の感染生存率の配列に従って配列し、各行は一つの家系情報のみを含み、整理後のファイルを読み取る
ｒｅｓ＜－ｆｒｅａｄ（“…／ｇｅｂｖ＿ａｎｄ＿ｓｒ＿ｏｐ＿ｃａｎ．ｃｓｖ”）
＃子の代の家系生存率を変換する
ｒｅｓ＄ＳＲ＿ｔｒａｎｓ＜－ｅｘｐ（ｒｅｓ＄ＳＲ）／（１＋ｅｘｐ（ｒｅｓ＄ＳＲ））
＃変換後の生存率を平均値に従ってさらに０と１に変換する
ＳＲ＿ｂｉｎａｒｙ＜－ｍａｔｒｉｘ（ＮＡ，ｎｒｏｗ＝ｎｒｏｗ（ｒｅｓ），ｎｃｏｌ＝１）
ＳＲ＿ｂｉｎａｒｙ［ｗｈｉｃｈ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ＞ｍｅａｎ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ）），］＜－１
ＳＲ＿ｂｉｎａｒｙ［ｗｈｉｃｈ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ＜ｍｅａｎ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ）），］＜－０
＃ＡＵＣ値を計算する
ｒｏｃ（ＳＲ＿ｂｉｎａｒｙ［，１］，ｒｅｓ＄ＧＥＢＶ）
計算を経て、１６のヒラメ家系のＧＥＢＶを取得し、計算は各家系ＧＥＢＶと相応の感染生存率の間のＡＵＣ（正確性）が０．７９４であることを示す。各家系ＧＥＢＶおよび感染生存率は表８による。 d) Calculate and formulate the appropriate pedigree GEBV according to the estimated GEBV of all candidate individuals

Is used to convert the infection survival rate of each family, the survival rate of the family higher than the average value after conversion is set to 1, and the survival rate of the family lower than the average value is set to 0. Finally, the AUC value between each family GEBV and the post-conversion survival rate is calculated, and the calculation method of the AUC value is as follows:
# Unload the required R package library (data.table)
library (prOC)
# Organize each family GEBV and corresponding infection survival rate into one file, arrange according to the sequence of family number / GEBV and converted infection survival rate, each line contains only one family information, and the file after organization Read res <-fread ("... / gebv_and_sr_op_can.csv")
# Convert the family survival rate of the child's generation res $ SR_trans <-exp (res $ SR) / (1 + exp (res $ SR))
# Convert the survival rate after conversion to 0 and 1 according to the average value SR_binary <-matrix (NA, now = now (res), ncol = 1)
SR_binary [which (res $ SR_trans> mean (res $ SR_trans)),] <-1
SR_binary [which (res $ SR_trans <mean (res $ SR_trans)),] <-0
# Loc to calculate AUC value (SR_binary [, 1], res $ GEBV)
Through the calculations, GEBV of 16 flounder families is obtained, and the calculation shows that the AUC (accuracy) between each family GEBV and the corresponding infection survival rate is 0.794. Table 8 shows the GEBV and infection survival rate of each family.

候補群における１６の子の代家系を感染生存率に従って６の高生存率家系（平均的な生存率が６２．４％である）と１０の低生存率家系（平均的な生存率が３３．４７％である）の二大類（表８）に分け、高生存率と低生存率家系の成体のＧＥＢＶを比較し、高生存率家系成体の平均的なＧＥＢＶが２．１０で、低生存率家系成体の平均的なＧＥＢＶが１．３４であることがわかる。計算はよると、これらのヒラメ家系のＧＥＢＶ値を使用してはその感染生存率が予測される正確性が０．７９４に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。

16 offspring in the candidate group were infected according to survival rate, 6 high survival rate families (average survival rate is 62.4%) and 10 low survival rate families (average survival rate 33. It is divided into two major categories (Table 8) (47%), and the GEBV of adults with high survival rate and low survival rate is compared. The average GEBV of adult with high survival rate is 2.10, and the low survival rate. It can be seen that the average GEBV of an adult family is 1.34. According to the calculation, the accuracy of predicting the survival rate of infection can reach 0.794 using the GEBV values of these flounder families, which is close to the theoretical value. Therefore, the gene chip designed by the inventor can be effectively applied to the selection of flounder disease resistance properties.

上記の結果によると、「魚チップ１号」遺伝子チップを用いてヒラメ候補グループの個体に遺伝子分類を行い、加重ＧＢＬＵＰを用いてゲノム育種値（ＧＥＢＶ）を計算し、ＧＥＢＶ数値の大きさに従ってヒラメ耐病性親魚の選別を行い、これらの親魚を用いて育成される後代種苗の耐感染生存率が顕著に高まり、それにより「魚チップ１号」遺伝子チップがヒラメ耐病性優良品種の育成において普及応用を実行することができることがわかる。 According to the above results, the individual of the flounder candidate group was genetically classified using the "Fish Chip No. 1" gene chip, the genome breeding value (GEBV) was calculated using the weighted GBLUP, and the flounder was calculated according to the magnitude of the GEBV value. Disease-resistant parent fish are selected, and the survival rate of progeny seedlings bred using these parent fish is significantly increased. As a result, the "Fish Chip No. 1" gene chip is widely applied in the breeding of excellent flounder disease-resistant varieties. It turns out that you can do.

産業中の実施可能性
本発明より提供されるヒラメ耐病性に関連するＳＮＰローカスの遺伝子チップはヒラメ耐病性個体の選別に利用可能であり、かつ実際の選択正確性は理論値に近いため、ヒラメ耐病性優良品種の選択正確性を高め、育種期間を短縮させ、これにより、ヒラメ耐病性優良品種の選別のための遺伝子チップ技術を提供し、魚類耐病性優良品種の選別のための遺伝子チップ育種の新しい道を切り開くことができる。 Industrial feasibility The gene chip of SNP locus related to flounder disease resistance provided by the present invention can be used for selection of flounder disease resistant individuals, and the actual selection accuracy is close to the theoretical value. Increased selection accuracy of excellent disease-resistant varieties and shortened breeding period, thereby providing gene chip technology for selection of excellent flounder disease-resistant varieties, and gene chip breeding for selection of excellent fish disease-resistant varieties. Can pave the way for new roads.

Claims

An SNP locus related to flounder disease resistance, wherein the SNP locus is a base at position 36 of any one of the sequences in which the sequence is SEQ NO: 1-SEQ ID NO: 48697. SNP locus related to flounder disease resistance.

Application of the SNP locus related to flounder disease resistance according to claim 1 to the selection and breeding of flounder disease resistant varieties.

Application of the SNP locus related to the flounder disease resistance according to claim 1 to the manufacture of a inspection product for selecting and cultivating a variety having excellent flounder disease resistance.

The application according to claim 3, wherein the inspection product is a gene chip.

A gene chip used for selecting an excellent flounder disease-resistant variety, wherein the gene chip can detect SNP locus related to the flounder disease resistance according to claim 1, and is used for selecting an excellent flounder disease-resistant variety. The gene chip used.

A method for selecting a flounder disease-resistant individual, wherein the method is a method for selecting a flounder disease-resistant individual, which comprises performing a measurement using the gene chip according to claim 3.

The method for selecting a flounder disease-resistant individual according to claim 6.
The method is
1) A step of extracting individual genomic DNA in a candidate group and performing a test using the above gene chip to obtain the result of genotyping of an SNP marker.
2) The result of genotyping of SNP locus, which is similar to the gene chip, is extracted from the SNP set of the reference group, and the result of genotyping of SNP of the reference group and the genotype of the candidate group obtained using the chip. Steps to merge with the judgment result,
3) Using the combined genotype and the type expressed by the reference group, the estimated breeding value GEBV of the candidate group is estimated using the weighted GBLUP method, and the disease resistance potential of the test-tested individual is further estimated based on the GEBV value. Steps to determine ability and
Including
In addition, the genotype of the reference group is used to estimate the prediction accuracy using the weighted best linear bias estimation method.
Among them, the 5-fold cross-validation method is used to estimate the prediction accuracy, and the area AUC under the feature curve is used as an index to judge the prediction accuracy. The closer the AUC is to 1, the higher the prediction accuracy is. How to select flounder disease resistant individuals.