JP7158496B2

JP7158496B2 - Disease resistance breeding gene chip of flounder and its application

Info

Publication number: JP7158496B2
Application number: JP2020556756A
Authority: JP
Inventors: 松林陳; 茜周; 昇盧; 亜東陳; 洋劉; 文騰徐; 仰真李; 磊王; 英明楊; 娜王; 希紅李
Original assignee: 中国水産科学研究院黄海水産研究所
Priority date: 2019-12-17
Filing date: 2019-12-17
Publication date: 2022-10-21
Anticipated expiration: 2039-12-17
Also published as: JP2022518304A; CN111278994B; CN111278994A; WO2021119980A1

Description

本発明は水産遺伝育種の技術分野に属し、具体的にヒラメ耐病性優良品種選別に用いられる遺伝子チップの製造方法および応用に関する。 TECHNICAL FIELD The present invention belongs to the technical field of aquatic genetic breeding, and specifically relates to a method and application of a gene chip used for selection of excellent disease-resistant flounder varieties.

水産養殖業は中国食品の重要なソースであり、魚類養殖業はまた水産養殖業の基幹産業でもあり、２０１５年に魚類養殖の生産量は２８４５７万トンで、水産養殖生産量全体の５７．６％である。養殖魚類はすでに中国タンパク質の重要な由来となった。 Aquaculture is an important source of Chinese food, and fish farming is also the key industry of aquaculture. %. Farmed fish has already become an important source of Chinese protein.

しかし、魚類養殖業の急速な発展につれて、優良な品種が欠乏し、養殖種類の品質が劣化する。養殖規模が拡大し、集約化水準の向上および養殖環境の悪化は水産養殖病害の頻繁な発生を引き起こし、養殖製品の薬物残留の深刻化等問題も魚類養殖業の持続可能な発展を深刻に制約する。魚類のみにとって、高密度な養殖で形成した免疫抑制のため、養殖魚類の耐病性の低下を引き起こす。魚類の免疫耐病性メカニズムおよび耐病性の分子遺伝に対する研究はいまだに進まないため、分子レベルで魚類病害の予防案を提出しにくい。さらに、耐病性機能遺伝子と耐病性分子マーカーが欠乏し、耐病性優良品種の育成を行いにくく、そのため現在養殖生産は耐病性が低下する野生または人工繁殖した多世代の種苗のみに依存し、流行病が魚類養殖における頻繁な発生を引き起こす。不完全な統計によると、中国における魚類養殖業では毎年病害による直接的経済損失は１００億人民元にも達する。病害はすでに中国における魚類養殖業の持続可能な発展を制約するボトルネックとなった。 However, with the rapid development of fish farming industry, there is a shortage of good breeds and the quality of farmed breeds is deteriorating. As the scale of aquaculture expands, the level of intensification increases, and the aquaculture environment deteriorates, aquaculture diseases will occur more frequently, and problems such as increased drug residues in aquaculture products will seriously constrain the sustainable development of the fish farming industry. do. For fish only, the immunosuppression formed by high-density aquaculture causes a decrease in disease resistance in farmed fish. Research on immune disease resistance mechanisms and molecular inheritance of disease resistance in fish is still incomplete, so it is difficult to propose preventive measures against fish diseases at the molecular level. Furthermore, disease resistance functional genes and disease resistance molecular markers are deficient, making it difficult to breed excellent disease-resistant varieties. The disease causes frequent outbreaks in fish farming. According to incomplete statistics, the direct economic loss from disease in China's fish farming industry amounts to RMB 10 billion every year. Diseases have already become a bottleneck constraining the sustainable development of fish farming industry in China.

ヒラメは世界的な海水養殖魚類で、中国における海水養殖の主導的な魚類の一つでもある。但し、ヒラメ養殖業においても病害が頻発し、死亡率が高いという問題が存在する。ヒラメ等養殖魚類を脅かす主要な病害は細菌性疾患・ウイルス性疾患を含む。その中危害が大きい疾患はそれぞれエドワジエラ症・ビブリオ症とリンホシスチス病が挙げられる。抗生物質系薬物またはワクチン等疾患予防措置は一定の効果があるが、水産養殖における病害問題を根本的に解決できない。かつ抗生物質系薬物に魚体内に蓄積しやすく、養殖魚類の商品品質を低下させ、消費者の健康に潜在的な危害を有し、病原菌に薬剤耐性および養殖環境を深刻に汚染するという問題を引き起こしやすいため、水産養殖業における応用はますます制約される。同時に、抗生物質の使用はまた人々日増しに伸びる無残留の無公害水産品の需要を満たすことができない。そのため、魚類耐病性優良品種の選別は中国の水産領域において解決が急がれる重大な課題の一つである。 Flounder is a global saltwater farmed fish, and is also one of the leading fish in saltwater farming in China. However, even in the flounder farming industry, there is a problem that disease damage occurs frequently and the mortality rate is high. Major diseases that threaten cultured fish such as flounder include bacterial and viral diseases. Among them, edwardieriasis, vibriopathy and lymphocystis disease are the most dangerous diseases, respectively. Disease prevention measures such as antibiotics and vaccines have certain effects, but they cannot fundamentally solve the problem of diseases in aquaculture. In addition, antibiotics tend to accumulate in fish bodies, degrade the product quality of farmed fish, have potential hazards to consumer health, cause drug resistance to pathogenic bacteria, and seriously pollute the farming environment. Its susceptibility to triggering increasingly limits its application in the aquaculture industry. At the same time, the use of antibiotics also cannot meet people's ever-growing demand for residue-free and pollution-free aquatic products. Therefore, the selection of fish varieties with excellent disease resistance is one of the serious problems that need to be solved urgently in China's aquaculture industry.

今まで、魚類優良品種の選別は主に表現型性状の選別に基づき、グループ選別・家系選別・交雑選別とＢＬＵＰ選別等を含み、主に体長・体重等測定しやすい表現型値に基づいて算出される育種値に基づいて選択を行う。分子マーカーができた後、重要な経済性状に関連する分子マーカーを特定することにより経済性状に選択を行い、従来の分子マーカー補助選別に用いられる分子マーカーの数量が非常に限られ、単一遺伝子性状または品質性状の選択効果に優れたが、多重遺伝子が決定される数量性状に対する選択効果がかんばしくない。耐病性性状は複数の遺伝子が制御される数量性状であり、直接測定しにくく、選択正確性が非常に低く、そのため耐病性優良品種に対する選別は長い間伸び悩み、魚類耐病性新品種の育成を制約し、新たしい育種技術でこの課題を解決する必要が急がれる。 Until now, the selection of superior fish varieties has been based mainly on the selection of phenotypic characteristics, including group selection, family selection, cross selection, BLUP selection, etc., and is mainly calculated based on phenotypic values that are easy to measure, such as body length and weight. The selection is made based on the breeding value given. After the molecular markers are available, selection can be made economically by identifying the molecular markers associated with important economic properties. Although the selection effect of traits or quality traits was excellent, the selection effect of quantity traits determined by multiple genes was not good. Disease resistance traits are quantitative traits controlled by multiple genes, are difficult to measure directly, and have very low selection accuracy. However, there is an urgent need to solve this problem with new breeding techniques.

遺伝子チップは、ＤＮＡチップ・ＤＮＡマイクロアレイともいい、フォトエッチング技術を用い、シリコンウェハを固相担体とし、選択最適化を経た大量のＤＮＡ配列をオリゴヌクレオチドに合成し、特殊な処理を経たスライドにつけ、変性・固定を経た後にＤＮＡマイクロアレイを形成する。核酸分子交雑技術に基づき、遺伝子チップは数万ひいては数十万のＤＮＡフラグメントに並行化交雑と解析を同時に実行でき、高スループット・並行性・高効率・サンプルの数量が少ないという利点を有する。現在、遺伝子チップはすでに人類の疾患・腫瘍の診断と、動植物の遺伝学的解析および遺伝育種に広範に応用される。動物育種において、遺伝子チップはすでに牧畜、特に乳牛・豚等種の優良品種の選別に応用されたことに成功した。例えば、乳牛において、すでにＢｏｖｉｎｅ３Ｋｃｈｉｐ・Ｂｏｖｉｎｅ２５Ｋ・ＳＮＰｃｈｉｐ・ＢｏｖｉｎｅＨＤ７００Ｋ・ＢｏｖｉｎＬＤ７Ｋ等多くの遺伝子チップが相次いで開発される。今まで、北米・欧州・オーストラリア等はＢｏｖｉｎｅＳＮＰ５０Ｂｅａｄｃｈｉｐチップ（５４ＫＳＮＰ）を用いてゲノムＳＮＰマーカー分類検測の汎用プラットフォームとして、かつ大規模参考グループが取得したＳＮＰ分類結果に基づき、産乳量・繁殖力・耐病性等多種の経済性状のフルゲノム関連解析を行い、乳牛ゲノムの選択体系を構築する。ゲノム選択を通して新生雄牛の初期選択を実現させ、乳牛育種の世代間隔を短縮させ、遺伝的進歩を促進させ、種雄牛の選択効率を大いに向上させ、養殖と育種コストを顕著に節約する。 A gene chip, also known as a DNA chip or a DNA microarray, uses photoetching technology, uses a silicon wafer as a solid phase carrier, synthesizes a large amount of DNA sequences through selective optimization into oligonucleotides, attaches them to slides that have undergone special processing, A DNA microarray is formed after denaturation and fixation. Based on nucleic acid hybridization technology, the gene chip can perform parallel hybridization and analysis on tens of thousands and even hundreds of thousands of DNA fragments at the same time, and has the advantages of high throughput, parallelism, high efficiency and small sample quantity. At present, gene chips have been widely applied to the diagnosis of human diseases and tumors, and the genetic analysis and genetic breeding of animals and plants. In animal breeding, gene chips have already been successfully applied to the selection of excellent breeds, especially dairy cows and pigs. For example, for dairy cows, many gene chips such as Bovine3Kchip, Bovine25K, SNPchip, BovineHD700K, and BovinLD7K have been developed one after another. Until now, North America, Europe, Australia, etc. have been using the BovineSNP50Beadchip chip (54KSNP) as a general-purpose platform for genomic SNP marker classification and testing, and based on the SNP classification results obtained by a large-scale reference group, milk production, fertility, and Perform full genome association analysis of various economic properties such as disease resistance, and construct a selection system for dairy cow genomes. Through genome selection, the early selection of newborn bulls can be achieved, the generation interval of dairy cattle breeding can be shortened, genetic progress can be promoted, the selection efficiency of sires can be greatly improved, and breeding and breeding costs can be significantly reduced.

但し、水産養殖動物に、今まで育種用遺伝子チップ、特に耐病性育種遺伝子チップの報道がいまだに見られない。 However, there have been no reports of breeding gene chips, especially disease-resistant breeding gene chips, for aquaculture animals.

本発明はヒラメの耐病性優良品種の選別に用いられる遺伝子チップを提供し、魚類優良品種の育成において遺伝子チップが欠乏するという問題を解決し、従来育種技術の欠点を補完し、魚類の耐病性と高生産性を兼ね備える優れた優良品種育成のために、新たな分子育種方法を提供し、魚類育種技術の世代交代を実現させ、魚類育種産業の高速な発展を推進することを目的とする。 The present invention provides a gene chip that is used to select flounder with excellent disease resistance, solves the problem of lack of gene chips in the breeding of excellent fish breeds, makes up for the shortcomings of conventional breeding techniques, and improves fish disease resistance. The aim is to provide a new molecular breeding method, realize a generational change in fish breeding technology, and promote the rapid development of the fish breeding industry in order to develop excellent breeds that combine both high productivity and high productivity.

本発明はまずヒラメ耐病性に関連するＳＮＰローカスを提供し、前記ＳＮＰローカスは配列がＳＥＱＮＯ：１ＳＥＱＩＤＮＯ：４８６９７である中のいずれか一つの配列の第３６位の塩基である。 The present invention first provides a SNP locus associated with flounder disease resistance, wherein the SNP locus is the 36th base of any one of the sequences SEQ NO:1 SEQ ID NO:48697.

本発明のＳＮＰローカスはヒラメ耐病性優良品種の選別に利用できる。 The SNP locus of the present invention can be used for selection of flounder with excellent disease resistance.

本発明が提供されるＳＮＰローカスはヒラメ耐病性優良品種の選別用の検測製品の製造にも用いられることができる。 The SNP locus to which the present invention is provided can also be used to produce inspection products for selection of excellent disease-resistant flounder varieties.

前記検測製品は、好ましくは遺伝子チップである。 Said test product is preferably a gene chip.

本発明のさらなる方面は、ヒラメ耐病性優良品種の選別に用いられる遺伝子チップを提供し、それはヒラメ耐病性に関連するＳＮＰローカスを検測することができる。 A further aspect of the present invention provides a gene chip used for selection of flounder disease-resistant superior cultivars, which can detect SNP locus associated with flounder disease resistance.

本発明のさらなる方面はヒラメ耐病性個体の選別方法を提供し、この方法は、上記の遺伝子チップを用いて実行することができる。 A further aspect of the invention provides a method for selecting disease-resistant flounder individuals, which method can be performed using the gene chip described above.

前記方法は、以下のステップを含む：
１）候補グループにおける個体ゲノムＤＮＡを抽出し、かつ上記の遺伝子チップを利用して検測してＳＮＰマーカーの遺伝子型判定の結果を取得する。
２）参考グループにおけるＳＮＰグループより遺伝子チップと同様であるＳＮＰローカスの遺伝子型判定の結果を抽出し、さらに参考グループのＳＮＰの遺伝子型判定の結果と候補グループからチップを利用して取得した遺伝子型判定の結果とを合併する。
３）合併された遺伝子型と参考グループが表現する型とを利用し、加重ＧＢＬＵＰ方法を用いて候補グループの推定育種値（ＧＥＢＶ）を推定し、さらにＧＥＢＶ値に基づいて被検測個体の耐病性潜在能力を決定する。 The method includes the following steps:
1) Extract the genomic DNA of individuals in the candidate group, and use the above-mentioned gene chip to detect and obtain genotyping results for the SNP markers.
2) From the SNP group in the reference group, the results of genotyping of SNP loci similar to the gene chip are extracted, and the results of genotyping of the SNPs in the reference group and genotypes obtained from the candidate group using the chip Merge with the judgment result.
3) Utilizing the combined genotype and the type represented by the reference group to estimate the estimated breeding value (GEBV) of the candidate group using the weighted GBLUP method, and based on the GEBV value the disease resistance of the tested individuals. Determines sexual potential.

参考グループの遺伝子型を利用し、加重最良線形不偏推定量（加重ＧＢＬＵＰ）を用いて予測正確性を推定する。そのうち、５倍交差検証方法を予測正確性の判定方法に利用し、特徴曲線下面積（ＡＵＣ）を予測正確性の判定指標とある。ＡＵＣは１に近いほど、予測正確性は高い。 The genotype of the reference group is used to estimate prediction accuracy using the weighted best linear unbiased estimator (weighted GBLUP). Among them, the 5-fold cross-validation method is used as the judgment method of prediction accuracy, and the area under the feature curve (AUC) is used as the judgment index of prediction accuracy. The closer the AUC is to 1, the higher the prediction accuracy.

本発明が提供されるヒラメ耐病性に関連するＳＮＰローカスの遺伝子チップはヒラメ耐病性個体の選別に利用することが可能であり、かつ実際の選択正確性が理論値に近いため、ヒラメ耐病性優良品種の選択正確性が向上し、育種期間を短縮することができる。これにより、ヒラメ耐病性優良品種の選別のために遺伝子チップ技術を提供し、魚類耐病性に優れた品種を選別するための、遺伝子チップによる育種という新しい道を切り開いた。 The gene chip of SNP locus associated with flounder disease resistance provided by the present invention can be used for selection of flounder disease-resistant individuals, and the actual selection accuracy is close to the theoretical value. The selection accuracy of varieties can be improved and the breeding period can be shortened. As a result, we have provided gene chip technology for the selection of flounder cultivars with excellent disease resistance, and have opened up a new way of breeding using gene chips to select cultivars with excellent disease resistance in fish.

本発明はヒラメ耐病性優良品種育成の遺伝子チップの製造と応用方法を確立し、ヒラメ等魚類の耐病性優良品種の育成のために新たな分子育種の技術手段を提供することを目的とする。 The purpose of the present invention is to establish a method for producing and applying a gene chip for breeding flounder with excellent disease resistance, and to provide new molecular breeding technical means for breeding flounder and other fish with excellent disease resistance.

次に本発明が関する専門用語に対する説明は以下のとおりである：
ＳＮＰ：ＳｉｎｇｌｅＮｕｃｌｅｏｔｉｄｅＰｏｌｙｍｏｒｐｈｉｓｍの略語で、すなわち一塩基多型で、ゲノムレベルにおいてヌクレオチド単体の変異により引き起こされるＤＮＡ配列の多型である。 Next, explanations for the terminology with which the present invention relates are as follows:
SNP: An abbreviation for Single Nucleotide Polymorphism, that is, a single nucleotide polymorphism, which is a DNA sequence polymorphism caused by a single nucleotide mutation at the genome level.

遺伝子チップ：微細加工技術を通し、数万乃至百万特定のＤＮＡ配列フラグメントを、シリコンウエハー・スライド等支持物に規則的に配置して固定し、構成される二次元ＤＮＡプローブアレーで、遺伝物質（ＤＮＡ等）に遺伝子分類および分子検測を行うことができる。 Gene chip: A two-dimensional DNA probe array consisting of tens of thousands to millions of specific DNA sequence fragments that are regularly arranged and fixed on a support such as a silicon wafer or slide through microfabrication technology. (DNA, etc.) can be subjected to genetic typing and molecular testing.

縮退塩基：コドンの縮退性に基づき、一つの記号をよく用いて不特定の二つまたはそれ以上の塩基を代替する。 Degenerate bases: Based on the degeneracy of codons, one symbol is often used to replace two or more unspecified bases.

例えばＲはＡ／Ｇを示し、ＹはＣ／Ｔを示し、ＭはＡ／Ｃを示し、ＫはＧ／Ｔを示し、ＳはＧ／Ｃを示し、ＷはＡ／Ｔを示すなど。 For example, R indicates A/G, Y indicates C/T, M indicates A/C, K indicates G/T, S indicates G/C, W indicates A/T, and so on.

参考グループ：ゲノムの選択において、人為感染等検測を通して取得した表現型データを有するグループは、通常表現型性状を有する大型グループから選別されるグループ全体を代表する表現型分布で、かつゲノム再配列を行い、遺伝子型データを取得し、実際のゲノム選択計算を行う個体の集合である。 Reference group: In genome selection, the group with phenotypic data acquired through testing such as artificial infection is selected from a large group with normal phenotypic characteristics. It is a set of individuals who perform the genotype data, and perform the actual genome selection calculation.

候補グループ：ゲノムの選択において、候補グループとはゲノム再配列を通し、遺伝子型データを取得したが、表現型データがないグループで、該グループは育種の潜在力を有し、後続の実際優良品種育成作業に用いる予定がある個体の集合である。 Candidate group: in genome selection, the candidate group is the group that has obtained genotype data through genome rearrangement but has no phenotypic data, and has the potential for breeding, and the subsequent actual superior varieties. This is a set of individuals scheduled to be used for breeding work.

ＧＢＬＵＰ：ＧｅｎｏｍｉｃＢｅｓｔＬｉｎｅａｒＵｎｂｉａｓｅｄＰｒｅｄｉｃｔｉｏｎの略語で、すなわちゲノム最良線形不偏推定量で、ゲノムにおける高密度な分子マーカーを用いて個体間の成因関係（Ｇマトリックス）を利用し、ゲノム育種値の推定する方法である。 GBLUP: An abbreviation for GenomicBestLinearUnbiasedPrediction, that is, a genome best linear unbiased estimator, which is a method for estimating genome breeding value by using high-density molecular markers in the genome and utilizing genetic relationships (G matrix) between individuals.

ＧＥＢＶ：ＧｅｎｏｍｉｃＥｓｔｉｍａｔｅｄＢｒｅｅｄｉｎｇＶａｌｕｅｓの略語で、すなわちゲノム推定育種価で、フルゲノムにおけるすべてのマーカーまたはハプロタイプの効果推定を加算して取得する。 GEBV: Abbreviation for Genomic Estimated Breeding Values, ie Genomic Estimated Breeding Values, obtained by adding up the effect estimates of all markers or haplotypes in the full genome.

次に実施例を組み合わせて本発明に詳細な記述を行う。 Next, the present invention will be described in detail by combining examples.

実施例１：「魚チップ１号」遺伝子チップＳＮＰローカスの選別およびチップ製造
１、ヒラメ耐エドワージエラタルダ参考グループの作成および表現型性状の測定
ヒラメゲノムが選定される参考グループと候補グループの個体はいずれも本課題チームが２００３年以来作成したヒラメ家系より由来され、多年の育成過程において、次第に韓国・日本および中国のヒラメグループの急速な成長、耐病性や耐逆性等優良な性状より由来される。 Example 1: Selection of "Fish Chip No. 1" Gene Chip SNP Locus and Chip Production 1. Creation of flounder-resistant Edwardsella tarda reference group and measurement of phenotypic characteristics Individuals of reference group and candidate group for selection of flounder genome All of them are derived from the flounder family created by this project team since 2003, and in the process of breeding for many years, the rapid growth of the flounder group in Korea, Japan and China, and excellent properties such as disease resistance and reversal resistance. be done.

特に２０１３年から、ヒラメ養殖業におけるエドワージエラタルダが日増しに深刻化する情勢に対し、ヒラメ耐エドワージエラタルダ家系選別の研究を実施する。 In particular, from 2013, in response to the situation where Edwardsiella tarda in the flounder farming industry is becoming more serious day by day, we will conduct research on flounder-resistant Edwardsiella tarda family selection.

２０１３－２０１５年、当年に作成したヒラメ家系に腹腔接種人為感染エドワージエラタルダ検測を連続的に実施し、感染した検測魚苗に鰭棘を収集し、生長と耐病性表現型を測定し、２０１３年・２０１４年と２０１５年にサンプル４５７７匹・５９４２匹と６９１９匹を採取し、ヒラメエドワージエラタルダゲノムが選択される参考グループを選択して作成するために用いられるもの。 From 2013 to 2015, peritoneal inoculation artificial infection Edwardsiella tarda inspection was continuously carried out in flounder families created in the current year, fin spines were collected from the infected inspection fish seedlings, and growth and disease resistance phenotypes were measured. , 4577, 5942 and 6919 samples taken in 2013, 2014 and 2015 and used to select and generate a reference group from which the flounder Edwardian la tarda genome was selected.

感染検測のサンプルより、９６家系（２０１３年に３２、２０１４年に１０、２０１５年に４８）を選定し、各家系は死亡率に従って等比率の死亡と生存した個体１０－１５を選定し、ゲノムが選択される参考グループを組成し、選定した個体の感染検測の結果（死亡または生存）を参考グループの表現型性状（表２）とする。

From the infection test samples, 96 families (32 in 2013, 10 in 2014, 48 in 2015) were selected, and each family selected 10-15 individuals who died and survived in an equal proportion according to mortality. , a reference group from which the genome is selected is formed, and the results of infection detection (death or survival) of the selected individuals are used as the phenotypic characteristics of the reference group (Table 2).

２、ヒラメフルゲノム再配列およびＳＮＰローカス評価
ヒラメ参考グループはＤＮＡ抽出・検測を経た後、利用可能な個体を計９３１有する（表３）。

2. Flounder full genome rearrangement and SNP locus evaluation The flounder reference group has a total of 931 individuals available after undergoing DNA extraction and testing (Table 3).

９３１の参考グループの個体のゲノムＤＮＡを抽出し、ＤＮＡ検測が合格した後、次世代配列ライブラリを作成し、ライブラリ作成類型は両端ＤＮＡライブラリ（挿入フラグメント３５０ｂｐ）で、ＩｌｌｕｍｉｎａＨｉｓｅｑＸ１０配列プラットフォームを用いて配列とデータエクスポートを完成し、品質管理で取得された平均的なデータ量は２Ｇ／個体である。本課題チームが提供されるヒラメゲノム配列（ＧｅｎＢａｎｋＩＤ：ＰＲＪＮＡ７３６７３）を参考ゲノムとして、ＢＷＡ（ｈｔｔｐ：／／ｂｉｏ－ｂｗａ．ｓｏｕｒｃｅｆｏｒｇｅ．ｎｅｔ／）ソフトウェアを用いて配列比較を行い、その後Ｓａｍｔｏｏｌｓ（ｈｔｔｐ：／／ｗｗｗ．ｈｔｓｌｉｂ．ｏｒｇ／）ソフトウェアを用いてＳＮＰ予測と評価を行い、４２．２ＭのＳＮＰ集合を取得する。 The genomic DNA of 931 reference group individuals was extracted, and after passing the DNA test, the next generation sequence library was constructed. and the average amount of data acquired in quality control is 2G/individual. Using the flounder genome sequence (GenBankID: PRJNA73673) provided by this project team as a reference genome, sequence comparison was performed using BWA (http://bio-bwa.sourceforge.net/) software, and then Samtools (http:/ /www.htslib.org/) software is used for SNP prediction and evaluation to obtain a SNP set of 42.2M.

３、ＳＮＰローカス評価と選別
（１）ステップ２で取得した４２．２ＭのヒラメＳＮＰマーカーに対し、以下の規格に選別を行う：
見逃し率＞０．１、最小対立遺伝子頻度（ＭＡＦ）＜０．０５のサイトを削除し、繰り返し配列または反復配列におけるサイトを削除し、ハドウィンバランスに適合しないサイトを削除し、３．４ＭのヒラメＳＮＰマーカーを取得する。 3. SNP locus evaluation and selection (1) Select the 42.2M flounder SNP markers obtained in step 2 according to the following standards:
Deletion of sites with a miss rate >0.1, minimum allele frequency (MAF) <0.05, deletion of sites at or in repeats, deletion of sites that do not fit the Hadwin balance, deletion of 3.4M Acquire flounder SNP markers.

（２）ステップ（１）で選別した３．４Ｍ分子マーカーに対し、ＳＮＰローカス効果値解析と推定育種値計算を実施し、
ヒラメゲノム選択計算はＢａｙｅｓＣπアルゴリズムを用い、解析モデル等式：

モデルにおいて、ｙは表現型値で、ｕはグループ平均値で、グループｑｉはマーカー効果が正規分布ｑｉ～Ｎ（０、

）で、ｍはマーカーの総数で、Ｘはｑｉに対応する関連マトリックスで、ｅは残差である。 (2) Perform SNP locus effect value analysis and estimated breeding value calculation for the 3.4M molecular markers selected in step (1),
Flounder genome selection calculation uses the Bayes C π algorithm, analysis model equation:

In the model, y is the phenotypic value, u is the group mean value, and group q is the marker effect normally distributed q ~ N(0,

), m is the total number of markers, X is the association matrix corresponding to qi, and e is the residual.

Ｒ言語パックが提供されるＢａｙｅｓＣπアルゴリズムを用い、組み合わせ済みの遺伝子型データｇｅｎｏｔｙｐｅ．ｃｓｖと表現型データｐｈｏｎｅｔｙｐｅ．ｃｓｖを組み合わせ、フルゲノム再配列の参考グループの計９３１のヒラメ個体にゲノム選択計算を行う。その後取得したＳＮＰローカス効果値を最大から最小への並べ替え、推定育種値＜１０－５のサイトを削除し、計８６４２２９のＳＮＰローカスを取得して遺伝子チップＳＮＰローカスの選定に用いられる。 Using the BayesCπ algorithm provided with the R language pack, the combined genotype data genotype. csv and phenotype data phonetype. csv and perform genome selection calculations on a total of 931 flounder individuals in the reference group for full genome rearrangement. After that, the obtained SNP locus effect values are rearranged from the largest to the smallest, sites with estimated breeding values <10-5 are deleted, and a total of 864,229 SNP loci are obtained and used for selection of gene chip SNP locus.

（３）さらにＡｆｆｙｍｅｔｒｉｘＡｘｉｏｍ遺伝子分類プローブ設計生物解析プロセスを用いてステップ（２）で選別したＳＮＰにそれぞれプローブ設計と評価を行い、プローブ変換可能性評価点数＜０．６のサイトを削除する。また、ＳＮＰがゲノム全体をカバーしかつ均等に分布され、ＳＮＰのフランキング配列３５ｂｐ内に他のＳＮＰが存在せず、ＳＮＰのフランキング配列３５ｂｐのＧＣ含有量が３０－７０％で、最終的に４８６９７のヒラメＳＮＰマーカーを選別してチップ製造に用いられることを保証し、前記４８６９７のＳＮＰ分子マーカーの配列記録は配列リストにある。
米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製ＡｆｆｙｍｅｔｒｉｘＡｘｉｏｍチップの生産技術を用いてヒラメＳＮＰチップ（遺伝子チップ）を製造し、延べ４８６９７のヒラメＳＮＰローカスを含み、各チップは２４のサンプルを同時に検測することができる。 (3) Further probe design and evaluation are performed for each of the SNPs selected in step (2) using the Affymetrix Axiom genetic classification probe design bioanalysis process, and sites with probe conversion feasibility evaluation scores <0.6 are deleted. In addition, the SNPs covered and evenly distributed throughout the genome, there were no other SNPs within the 35 bp flanking sequence of the SNP, the GC content of the 35 bp flanking sequence of the SNP was 30-70%, and the final 48,697 flounder SNP markers were screened to ensure their use in chip manufacturing, and the sequence records of said 48,697 SNP molecular markers are in the sequence listing.
A flounder SNP chip (gene chip) was manufactured using Affymetrix Axiom chip production technology manufactured by ThermoFisher, USA, containing a total of 48,697 flounder SNP loci, and each chip can detect 24 samples simultaneously.

実施例２、「魚チップ１号」遺伝子チップの使用方法
１、チップ検測用サンプルの製造と交雑
少量のヒラメ鰭棘（米の粒ほどの大きさ）を採取し、ＤＮＡ抽出キット（中国・天根）を利用して鰭棘ゲノムＤＮＡを抽出し、１％のアガロースゲル電気泳動と核酸分光光度計を利用してＤＮＡ品質と濃度を検測し、最終的な合格サンプルの規格は以下のとおりである：電気泳動はＤＮＡに単一のストリップが生じることを求め、フラグメント長さが１０ｋｂより大きく、完全性に優れ、生分解は生じず、サンプル品質のＤＮＡ検測結果。紫外分光光度計を用いてＡ２６０／２８０を検測する：１．８－２．０、Ａ２６０／２３０＞１．５、濃度が２０ｎｇ／μｌより低くなく、総量が４μｇより小さくない。その後米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製ＳＮＰチップ検測サンプルの製造標準操作プロセスに従って（ｈｔｔｐｓ：／／ｗｗｗ．ｔｈｅｒｍｏｆｉｓｈｅｒ．ｃｏｍ／）チップ検測用サンプルの製造を行う。１．４ｕｇ以上の高品質なＤＮＡテンプレートを２ｍｌ＊９６の深穴板に加え、変性剤を加えて常温変性を行い、変性１０ｍｉｎ後に変性を停止し、単鎖ＤＮＡを取得する。チップサイト増幅用４８６９７ペアプライマーと等温増幅酵素・ｄＮＴＰ等を深穴板に加え、深穴板を封止し、３７℃で等温増幅を行う。２４ｈ増幅した後、増幅産物を断片化し、等体積のイソプロパノールを加え、－２０℃の冷蔵庫において沈殿する。２４ｈ沈殿した後、４℃、３０００ｇを用いて遠心してＤＮＡ産物の沈殿物を取得し、３７℃で残りのイソプロパノールを除去し、沈殿物を溶解させ、交雑液を取得し、交雑液は５％のゲル電気泳動の検測結果の質量を用い、増幅産物品質の検測結果は、ストリップが明晰で、輝度が高い。交雑液は温度制御増幅装置を用いて交雑を行い、条件が９５℃１０ｍｉｎ、４８℃３ｍｉｎで、その後交雑液を持続的に４８℃に維持する。チップブロックを交雑液に浸漬し、４８℃の交雑炉において２４ｈ交雑させ、その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各信号点は一つのプローブ交雑の結果で、各サイトの交雑結果を取得した後、チップスキャンの結果はＡｘｉｏｍＡｎａｌｙｓｉｓＳｕｉｔｅ（ＡｘＡＳ）ソフトウェア（米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製）を用いて解析を行う。 Example 2, How to use "Fish Chip No. 1" gene chip 1. Manufacture and crossbreeding of samples for chip detection The fin spine genomic DNA was extracted using 1% agarose gel electrophoresis and nucleic acid spectrophotometer, and the final acceptable sample specifications were as follows: As follows: electrophoresis sought to produce a single strip of DNA, fragment length greater than 10 kb, excellent integrity, no biodegradation, sample quality DNA detection results. A260/280 is measured using UV spectrophotometer: 1.8-2.0, A260/230>1.5, concentration not lower than 20 ng/μl, total amount not lower than 4 μg. After that, according to the manufacturing standard operating process for SNP chip test samples manufactured by ThermoFisher, USA (https://www.thermofisher.com/), samples for chip test are manufactured. 1.4 ug or more of high-quality DNA template is added to a 2 ml*96 deep-well plate, a denaturant is added, denaturation is performed at room temperature, denaturation is stopped after denaturation for 10 minutes, and single-stranded DNA is obtained. 48697 pair primers for chip site amplification, isothermal amplification enzymes, dNTPs, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37°C. After 24 h of amplification, the amplified products are fragmented, added with an equal volume of isopropanol and precipitated in a refrigerator at -20°C. After precipitation for 24 h, centrifuge at 3000 g at 4° C. to obtain a precipitate of DNA product, remove the remaining isopropanol at 37° C., dissolve the precipitate, obtain a hybrid fluid, and the hybrid fluid is 5%. Using the mass of the gel electrophoresis detection results, the amplification product quality detection results show that the strips are clear and bright. The hybrid was hybridized using a temperature-controlled amplifier under the conditions of 95°C for 10 min and 48°C for 3 min, after which the hybrid was maintained at 48°C continuously. The chip block was immersed in a hybridization solution, hybridized for 24 hours in a hybridization oven at 48°C, then eluted, fluorescent proteins were connected, the fluorescent proteins were immobilized, hybridization probes were scanned, and each signal point was scanned. After obtaining the hybridization results for each site from the results of one probe hybridization, the chip scan results are analyzed using Axiom Analysis Suite (AxAS) software (manufactured by ThermoFisher, USA).

２、チップ検測と遺伝子分類データ解析
（１）被検測グループサンプルの採取とＤＮＡ抽出
再配列済みの一部ヒラメ個体ＤＮＡを選択し、前記チップを用いて検測を実施し、チップ遺伝子分類の正確性および再配列と遺伝子チップ分類を用いて取得した遺伝子型データにゲノム選択計算を実施する再現性に検測を行う。その後ヒラメ育種の選択過程に、家系が用いられる候補成体の個体を確立し、ゲノムＤＮＡの抽出を行いかつチップで検測を実施し、遺伝子チップがヒラメゲノム選択育種における応用効果を検証する。採用個体の情報は表４による。

2. Chip inspection and gene classification data analysis (1) Collection of samples of the group to be tested and DNA extraction A part of rearranged flounder individual DNA is selected, the chip is used for inspection, and chip gene classification is performed. The accuracy and reproducibility of performing genome selection calculations on genotypic data obtained using rearrangement and gene chip classification are tested. Then, in the selection process of flounder breeding, establish candidate adult individuals whose pedigrees are used, perform genomic DNA extraction and chip detection, and verify the application effect of gene chips in flounder genome selection breeding. Information on the adopted individuals is shown in Table 4.

（２）チップ検測
遺伝子チップ検測の標準プロセスに従い、ＡｆｆｙｍｅｔｒｉｘＧｅｎｅＴｉｔａｎ遺伝子チップ処理システムを用いてプローブ交雑・染色とチップスキャンを完成する。具体的な操作方法は以下のとおりである：４μｇの高品質なＤＮＡテンプレートを２ｍｌ＊９６の深穴板に加え、変性剤を加えて変性（２８℃）を行い、変性１０ｍｉｎ後に変性停止液（反応時間が１０ｍｉｎより長くない）変性を停止し、単鎖ＤＮＡを取得する。チップサイト増幅用４８６９７ペアプライマーと等温増幅酵素・ｄＮＴＰと反応液等を深穴板に加え、深穴板を封止し、３７℃で等温増幅を２２－３６ｈ行う。好ましくは２４ｈ増幅した後、高温６５℃で２０－３０ｍｉｎで反応液を不活性化させ、その後３７℃の培養箱に移転して４０ｍｉｎ培養し、断片化酵素と反応液［４１｝を加え、断片化増幅産物を断片化し、既存の反応液と等体積のイソプロパノールを加え、反応液が澄むまで反応液を均一に混合させ、その後－２０℃の冷蔵庫において産物を沈殿させる。２４ｈ沈殿した後、４℃、３、０００ｇで４０－６０ｍｉｎ遠心してＤＮＡ産物の沈殿物を取得し、上澄液を除去し、沈殿物を保留し、３７℃で残りのイソプロパノールを完全に除去し、沈殿物を溶解させ、交雑液を取得する。交雑液は温度制御増幅装置を用いて交雑を行い、条件が９５℃１０ｍｉｎ、４８℃３ｍｉｎで、その後交雑液を持続的に４８℃に維持する。チップブロックを交雑液に浸漬し、４８℃の交雑炉において２４ｈ交雑させる。その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各サイトの交雑結果を取得し、チップスキャン結果はＡｘｉｏｍＡｎａｌｙｓｉｓＳｕｉｔｅ（ＡｘＡＳ）ソフトウェア（米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製）を用いて解析を行う。 (2) Chip detection Following the standard process of gene chip detection, Affymetrix GeneTitan gene chip processing system is used to complete probe hybridization/staining and chip scanning. The specific operating method is as follows: add 4 μg of high-quality DNA template to a 2 ml*96 deep-well plate, add a denaturant to denature (28° C.), denature for 10 min, then denature stop solution ( (Reaction time not longer than 10 min) Stop denaturation and obtain single-stranded DNA. 48697 pair primers for chip site amplification, isothermal amplification enzyme/dNTP, reaction solution, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37° C. for 22 to 36 hours. Preferably, after amplification for 24 hours, the reaction mixture is inactivated at a high temperature of 65°C for 20-30 minutes, then transferred to a culture box at 37°C and cultured for 40 minutes. Fragment the amplified product, add an equal volume of isopropanol to the existing reaction mixture, mix the reaction mixture evenly until the reaction mixture is clear, and then place the product in a refrigerator at -20°C to precipitate. After 24 h of precipitation, the DNA product was precipitated by centrifugation at 3,000 g for 40-60 min at 4°C, the supernatant was removed, the precipitate was retained, and the remaining isopropanol was completely removed at 37°C. , to dissolve the precipitate and obtain the hybridization fluid. The hybrid was hybridized using a temperature-controlled amplifier under the conditions of 95°C for 10 min and 48°C for 3 min, after which the hybrid was maintained at 48°C continuously. The chip blocks are immersed in the hybridization solution and hybridized for 24 h in a hybridization oven at 48°C. After that, elution, fluorescent protein is attached, fluorescent protein is immobilized, hybridization probe and fluorescence signal are scanned to obtain the hybridization result of each site, and the chip scan result is obtained with Axiom Analysis Suite (AxAS) software (manufactured by ThermoFisher, USA). Analysis is performed using

（３）データ解析
ＡｘＡＳソフトウェア（米国ＴｈｅｒｍｏＦｉｓｈｅｒ社製）を利用してチップスキャンの結果を解析し、各サンプルの遺伝子分類結果を取得する。解析結果によると、チップの平均的な分類率が９８．７７％で、分類効果が抜群である。そのうち、高品質なＳＮＰ比率が７４．６１％で、各サンプルはいずれも高品質な分類情報を生成することができる。 (3) Data Analysis The chip scan results are analyzed using AxAS software (manufactured by ThermoFisher, USA) to obtain gene classification results for each sample. According to the analysis results, the average classification rate of the chips is 98.77%, which has excellent classification effect. Among them, the high-quality SNP ratio is 74.61%, and each sample can generate high-quality classification information.

実施例３「魚チップ１号」遺伝子チップがヒラメ耐病性育種における応用
１、「魚チップ１号」遺伝子チップ分類の効果検証
参考グループから一部の個体を選定して遺伝子チップ分類の信頼性を検証するために用いられるもの、これらの選定される個体は再配列の遺伝子型でもあり、「魚チップ１号」遺伝子チップ分類を用いて取得される遺伝子型でもある。発明者の既存のヒラメ参考グループから一部の個体遺伝子チップを選定して分類・統計を行う。チップを用いて取得した遺伝子型（０／１／２はＡＡ／Ａａ／ａａを示す）と再配列して取得した遺伝子型の一致性および再配列とチップデータを利用して推定したＧＥＢＶの関連係数を統計することによりチップ分類の効果を評価する。分類結果の一致性が８８％以上に達しかつＧＥＢＶの間の関連係数が０．９以上に達する場合、チップが優れた分類結果を有すると見なす。 Example 3 Application of “Fish Chip No. 1” Gene Chip to Disease Resistance Breeding of Flounder Used for validation, these selected individuals were both genotyped for rearrangement and genotyped using the "Fish Chip No. 1" gene chip classification. Classification and statistics are performed by selecting some individual gene chips from the inventor's existing flounder reference group. Consistency between genotypes obtained using a chip (0/1/2 indicates AA/Aa/aa) and genotypes obtained by rearrangement, and association between rearrangement and GEBV estimated using chip data Evaluate the effect of chip classification by statistical coefficients. A chip is considered to have a good classification result if the consistency of the classification result reaches 88% or more and the correlation coefficient between GEBV reaches 0.9 or more.

解析結果によると、「魚チップ１号」遺伝子チップ分類を利用して取得したサイト情報の９０．０８％は再配列と同様で、２組のＧＥＢＶの間の関連係数は０．９５８である。そのため、本発明が開発されるヒラメ遺伝子チップの分類結果は再配列と基本的に一致し、ヒラメに正確な遺伝子分類を行うことができる。 The analysis results show that 90.08% of the site information obtained using the “fish chip No. 1” gene chip classification is similar to the rearrangement, and the association coefficient between the two sets of GEBV is 0.958. Therefore, the classification result of the flounder gene chip developed by the present invention is basically consistent with the rearrangement, and accurate gene classification can be performed for flounder.

具体的な操作方法は以下のとおりである：
ＰＬＩＮＫソフトウェアを利用してチップデータを読み取り、サーバーにおいて以下のコマンドを入力して上記データに処理を行う：
ｐｌｉｎｋ－－ｖｃｆｏｐ２－１．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿１
ｐｌｉｎｋ－－ｖｃｆｃｓ２－２．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿２
ｐｌｉｎｋ－－ｖｃｆｏｐ２－３．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿３
ｐｌｉｎｋ－－ｖｃｆｏｐ２－４．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿Ｖａｌ＿４ The specific operation method is as follows:
Use PLINK software to read the chip data, and enter the following command on the server to process the above data:
plink --vcf op2-1. vcf --make-bed --out op_Val_1
plink --vcf cs2-2. vcf --make-bed --out op_Val_2
plink --vcf op2-3. vcf --make-bed --out op_Val_3
plink --vcf op2-4. vcf --make-bed --out op_Val_4

読み取りを経て、４つのｖｃｆにおける情報は表５による：

After reading, the information in the 4 vcf is according to Table 5:

ａ）ＲにおいてＳＮＰを再命名しかつ４つのファイルにおける共通となるマーカー情報を抽出し、コマンドは以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃ｃｓ＿Ｖａｌ＿１とｃｓ＿Ｖａｌ＿２のサイト情報を読み取る
ｖａｌ＿１＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿１．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｖａｌ＿２＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿２．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｖａｌ＿３＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿３．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｖａｌ＿４＜－ｆｒｅａｄ（“ｏｐ＿Ｖａｌ＿４．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
＃ＳＮＰ命名方式を統一しかつ再命名したファイルを出力する
ｖａｌ＿１＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿１）），ｖａｌ＿１＄Ｖ１，ｓｅｐ＝“ ”），ｖａｌ＿１＄Ｖ４，ｓｅｐ＝ “：”）
ｖａｌ＿２＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿２）），ｖａｌ＿２＄Ｖ１，ｓｅｐ＝ “ ”），ｖａｌ＿２＄Ｖ４，ｓｅｐ＝ “：”）
ｖａｌ＿３＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿３）），ｖａｌ＿３＄Ｖ１，ｓｅｐ＝“ ”），ｖａｌ＿３＄Ｖ４，ｓｅｐ＝ “：”）
ｖａｌ＿４＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｖａｌ＿４）），ｖａｌ＿４＄Ｖ１，ｓｅｐ＝“ ”），ｖａｌ＿４＄Ｖ４，ｓｅｐ＝ “：”）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿１， “ｏｐ＿Ｖａｌ＿１．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿２， “ｏｐ＿Ｖａｌ＿２．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿３， “ｏｐ＿Ｖａｌ＿３．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｖａｌ＿４， “ｏｐ＿Ｖａｌ＿４．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃共通となるサイト情報を抽出しかつ出力する
ｃｏｍｍ＜－Ｒｅｄｕｃｅ（ｉｎｔｅｒｓｅｃｔ，ｌｉｓｔ（ａ＝ｖａｌ＿１＄Ｖ２，ｂ＝ｖａｌ＿２＄Ｖ２，ｃ＝ｖａｌ＿３＄Ｖ２，ｄ＝ｖａｌ＿４＄Ｖ２））
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｍｍ， “ｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） a) Rename the SNPs in R and extract the common marker information in the four files, the commands are:
# library (data.table) to unload the required R packages
# Read site info for cs_Val_1 and cs_Val_2 val_1 <- fread("op_Val_1.bim", header = F)
val_2 <- fread("op_Val_2.bim", header = F)
val_3 <- fread("op_Val_3.bim", header = F)
val_4 <-fread("op_Val_4.bim", header = F)
# Unify the SNP naming scheme and output a renamed file val_1$V2 <- paste (paste(rep(“rs”, nrow(val_1)), val_1$V1, sep = “ ”), val_1$V4, sep = “:”)
val_2$V2 <- paste(paste(rep("rs", nrow(val_2)), val_2$V1, sep = ""), val_2$V4, sep = ":")
val_3$V2 <- paste(paste(rep("rs", nrow(val_3)), val_3$V1, sep=""), val_3$V4, sep=":")
val_4$V2 <- paste(paste(rep("rs", nrow(val_4)), val_4$V1, sep=""), val_4$V4, sep=":")
write. table(val_1, "op_Val_1.bim", sep = "\t", col.names = F, row.names = F, quote = F)
write. table(val_2, "op_Val_2.bim", sep = "\t", col.names = F, row.names = F, quote = F)
write. table(val_3, "op_Val_3.bim", sep = "\t", col.names = F, row.names = F, quote = F)
write. table(val_4, "op_Val_4.bim", sep = "\t", col.names = F, row.names = F, quote = F)
# Extract and output common site information comm <- Reduce(intersect, list (a = val_1$V2, b = val_2$V2, c = val_3$V2, d = val_4$V2))
write. table(comm, "common_snps.txt", sep = "\t", col.names = F, row.names = F, quote = F)

ｂ）ＰＬＩＮＫソフトウェアを利用して４つのファイルを組み合わせ、かつ共通となるマーカーを保留し、コマンドは以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿Ｖａｌ＿１－－ｍｅｒｇｅ－ｌｉｓｔｍｅｒｇｅ＿ｏｐ．ｔｘｔ－－ｅｘｔｒａｃｔｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿ｃｈｉｐ
ファイル「ｍｅｒｇｅ＿ｏｐ．ｔｘｔ」に以下の情報が貯蔵されている：
ｏｐ＿Ｖａｌ＿２．ｂｅｄｏｐ＿Ｖａｌ＿２．ｂｉｍｏｐ＿Ｖａｌ＿２．ｆａｍ
ｏｐ＿Ｖａｌ＿３．ｂｅｄｏｐ＿Ｖａｌ＿３．ｂｉｍｏｐ＿Ｖａｌ＿３．ｆａｍ
ｏｐ＿Ｖａｌ＿４．ｂｅｄｏｐ＿Ｖａｌ＿４．ｂｉｍｏｐ＿Ｖａｌ＿４．ｆａｍ b) Combine the 4 files using PLINK software and retain the common markers, the command is:
plink --bfile op_Val_1 --merge-list merge_op. txt--extract common_snps. txt --recode A --out op_chip
The following information is stored in the file "merge_op.txt":
op_Val_2. bed op_Val_2. bim op_Val_2. fam
op_Val_3. bed op_Val_3. bim op_Val_3. fam
op_Val_4. bed op_Val_4. bim op_Val_4. fam

ｃ）ＰＬＩＮＫソフトウェアを利用して参考グループから同様である個体とサイトを抽出し、上記の４つのファイルにおける．ｆａｍ情報をファイルに整理しかつ「ｏｐ＿ｃｈｉｐ＿ｉｎｄｉ．ｔｘｔ」と命名し、「…」はファイルカタログを表し、コマンドは以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅ …／Ｖａｌ＿ｒｅｆ－－ｋｅｅｐｏｐ＿ｃｈｉｐ＿ｉｎｄｉ．ｔｘｔ－－ｅｘｔｒａｃｔｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿ｒｓｅｑ
処理を経て、上記４つのファイルが共通となるマーカー数が１１、７１９で、参考グループにおいて検索可能な個体数が９５である。 c) Using PLINK software to extract similar individuals and sites from the reference group, . Organize the fam information into a file and name it "op_chip_indi.txt", where "..." represents the file catalog, the command is:
plink --bfile .../Val_ref --keep op_chip_indi. txt--extract common_snps. txt --recode A --out op_rseq
After processing, the number of markers that are common to the above four files is 11,719, and the number of individuals that can be retrieved in the reference group is 95.

ｄ）Ｒ統計チップと再配列分類の一致性を利用し、その方法は以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃それぞれチップと再配列が取得した分類情報を読み取る
ｃｈｉｐ＜－ｆｒｅａｄ（“ｏｐ＿ｃｈｉｐ．ｒａｗ”）
ｒｓｅｑ＜－ｆｒｅａｄ（“ｏｐ＿ｒｓｅｑ．ｒａｗ”）
＃ファイルにおける個体配列を統一する
ｆｉｄ＜－ｄａｔａ．ｆｒａｍｅ（ｒｓｅｑ＄ＦＩＤ）
ｃｏｌｎａｍｅｓ（ｆｉｄ）＜－ “ＦＩＤ”
ｃｈｉｐ＜－ｄａｔａ．ｔａｂｌｅ（ｍｅｒｇｅ（ｆｉｄ，ｃｈｉｐ，ｓｏｒｔ＝Ｆ））
＃ファイルにおける最初の６列を削除し、かつ遺伝子型を出力する
ｃｈｉｐ［，ｃ（１：６）：＝ＮＵＬＬ］
ｒｓｅｑ［，ｃ（１：６）：＝ＮＵＬＬ］
ｆｗｒｉｔｅ（ｃｈｉｐ， “ｇｅｎｏ＿ｏｐ＿ｃｈｉｐ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｒｓｅｑ， “ｇｅｎｏ＿ｏｐ＿ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃一致性を統計する
ｓｕｍ（ｃｈｉｐ＝＝ｒｓｅｑ）／（ｎｒｏｗ（ｃｈｉｐ）＊ｎｃｏｌ（ｃｈｉｐ））＊１００
統計によると、上記９５の個体は計１、１１３、３０５のマーカーを有し、完全に同様であるマーカー数が１、００２、８２９であることがわかった。そのため、チップと再配列分類の結果は９０．０８％完全に一致する。 d) Using the agreement of R statistic tip and rearrangement classification, the method is as follows:
# library (data.table) to unload the required R packages
# chip <- fread("op_chip.raw") to read the classification information obtained by chip and rearrangement respectively
rseq <-fread("op_rseq.raw")
# Unify the individual arrays in the file fid <- data. frame(rseq$FID)
colnames(fid) <- “FID”
chip <- data. table(merge(fid, chip, sort=F))
# chip[, c(1:6) := NULL] to delete the first 6 columns in the file and output the genotype
rseq[,c(1:6):= NULL]
fwrite(chip, "geno_op_chip.csv", sep = ",", row.names = F, quote = F)
fwrite(rseq, "geno_op_rseq.csv", sep = ",", row.names = F, quote = F)
# Sum(chip == rseq) / (nrow(chip) * ncol(chip)) * 100 to stat match
Statistics showed that the 95 individuals had a total of 1,113,305 markers and 1,002,829 markers that were completely similar. Therefore, the results of chip and rearrangement classification are 90.08% perfect match.

ｅ）ＧＥＢＶ推定の正確性を保証するために、ＰＬＩＮＫソフトウェアを利用して参考グループにおける残りの個体の遺伝子型を抽出し、コマンドは以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅ …／Ｖａｌ＿ｒｅｆ－－ｒｅｍｏｖｅｏｐ＿ｃｈｉｐ＿ｉｎｄｉ．ｔｘｔ－－ｅｘｔｒａｃｔｃｏｍｍｏｎ＿ｓｎｐｓ．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｒｅｆ e) To ensure the accuracy of the GEBV estimation, genotype the remaining individuals in the reference group using PLINK software, the command is:
plink --bfile .../Val_ref --remove op_chip_indi. txt--extract common_snps. txt --recode A --out ref

ｆ）Ｒを利用して参考グループを合併するおよび個体の遺伝子型を検証し、その方法は以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃それぞれチップと再配列が取得した分類情報を読み取る
ｃｈｉｐ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ＿ｃｈｉｐ．ｃｓｖ”））
ｒｓｅｑ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ＿ｒｓｅｑ．ｃｓｖ”））
ｒｅｆ＜－ｆｒｅａｄ（“ｒｅｆ．ｒａｗ”）
＃ファイルにおける最初の６列を削除する
ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｒｅｆ）
＃遺伝子型ファイルを組み合わせかつ組み合わせた後の遺伝子型ファイルを出力する
ｇｅｎｏ＿ｃｈｉｐ＜－ｒｂｉｎｄ（ｃｈｉｐ，ｒｅｆ）
ｇｅｎｏ＿ｒｓｅｑ＜－ｒｂｉｎｄ（ｒｓｅｑ，ｒｅｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｃｈｉｐ， “ｇｅｎｏ＿Ｖａｌ＿Ｃｈｉｐ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｒｓｅｑ， “ｇｅｎｏ＿Ｖａｌ＿Ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） f) Utilizing R to merge reference groups and verify individual genotypes, the method is as follows:
# library (data.table) to unload the required R packages
# read the classification information obtained by chip and rearrange respectively chip <- as. matrix(fread(“geno_op_chip.csv”))
rseq <- as. matrix(fread(“geno_op_rseq.csv”))
ref <-fread("ref.raw")
# Remove the first 6 columns in the file ref[, c(1:6) := NULL]
ref <- as. matrix (ref)
# Combine genotype files and output genotype file after combination geno_chip <- rbind(chip, ref)
geno_rseq <-rbind(rseq, ref)
write. table(geno_chip, "geno_Val_Chip.csv", sep = ",", row.names = F, quote = F)
write. table(geno_rseq, "geno_Val_Rseq.csv", sep = ",", row.names = F, quote = F)

ｇ）ｇ）において取得した２つのｘｘｘ．ｃｓｖファイルがＲにおいて加重ＧＢＬＵＰ方法を用いてＧＥＢＶを推定する。具体的な操作方法は以下のとおりである（Ｌｉｎｕｘ環境）：
＃必要なＲパッケージと関数をアンロードする
ｌｉｂｒａｒｙ（ｐａｒａｌｌｅｌ）
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｌｉｂｒａｒｙ（ａｓｒｅｍｌ）
ｌｉｂｒａｒｙ（ｐＲＯＣ）
ｓｏｕｒｃｅ（“ｇｉｎｖ．Ｒ”）
関数ｇｉｎｖの定義は以下のとおりである：
ｇｉｎｖ＜－ｆｕｎｃｔｉｏｎ（ｉｎｖＧ）｛
Ｇｉｎｖ＜－ｄａｔａ．ｆｒａｍｅ（ｒｏｗ＝ｒｅｐ（１：ｎｒｏｗ（ｉｎｖＧ），ｎｒｏｗ（ｉｎｖＧ）），ｃｏｌｕｍｎ＝ｒｅｐ（１：ｎｒｏｗ（ｉｎｖＧ），ｅａｃｈ＝ｎｒｏｗ（ｉｎｖＧ）），ｖａｌｕｅ＝ａｓ．ｎｕｍｅｒｉｃ（ｉｎｖＧ），ｌｏｗｅｒ．ｍａｔ＝ａｓ．ｌｏｇｉｃａｌ（ｌｏｗｅｒ．ｔｒｉ（ｉｎｖＧ，ｄｉａｇ＝Ｔ）））
Ｇｉｎｖ＜－Ｇｉｎｖ［Ｇｉｎｖ＄ｌｏｗｅｒ．ｍａｔ＝＝Ｔ，ｃ（“ｒｏｗ”， “ｃｏｌｕｍｎ”， “ｖａｌｕｅ”）］
Ｇｉｎｖ＜－Ｇｉｎｖ［ｏｒｄｅｒ（Ｇｉｎｖ＄ｒｏｗ，Ｇｉｎｖ＄ｃｏｌｕｍｎ），］
ｒｅｔｕｒｎ（Ｇｉｎｖ）
｝
＃遺伝子型、表現型情報を読み取る
ｇｅｎｏ＿ｃｈｉｐ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿Ｖａｌ＿Ｃｈｉｐ．ｃｓｖ”，ｎＴｈｒｅａｄ＝１０））
ｇｅｎｏ＿ｒｓｅｑ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿Ｖａｌ＿Ｒｓｅｑ．ｃｓｖ”，ｎＴｈｒｅａｄ＝１０））
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｈｅｎｏ＿ｏｐ＿Ｖａｌ．ｃｓｖ”，ｈｅａｄｅｒ＝Ｔ，ｓｅｐ＝“ ，”）
ｒｎａｍｅｓ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｐｈｅｎｏ＿ｏｐ＿Ｖａｌ．ｃｓｖ”））［，１］
Ｍ＿１＜－ｇｅｎｏ＿ｃｈｉｐ
Ｍ＿２＜－ｇｅｎｏ＿ｒｓｅｑ
＃各サイトの二次等位遺伝子頻度を計算する
ｐｉ＿１＜－ｒｏｕｎｄ（ｃｏｌＳｕｍｓ（Ｍ＿１）／（２＊ｎｒｏｗ（Ｍ＿１）），３）
ｐｉ＿２＜－ｒｏｕｎｄ（ｃｏｌＳｕｍｓ（Ｍ＿２）／（２＊ｎｒｏｗ（Ｍ＿２）），３）
＃Ｐマトリックスを構築する
Ｐ＿１＜－ｍａｔｒｉｘ（２＊ｐｉ＿１，ｂｙｒｏｗ＝Ｔ，ｎｒｏｗ＝ｎｒｏｗ（Ｍ＿１），ｎｃｏｌ＝ｎｃｏｌ（Ｍ＿１））
Ｐ＿２＜－ｍａｔｒｉｘ（２＊ｐｉ＿２，ｂｙｒｏｗ＝Ｔ，ｎｒｏｗ＝ｎｒｏｗ（Ｍ＿２），ｎｃｏｌ＝ｎｃｏｌ（Ｍ＿２））
＃Ｚマトリックスを構築する
Ｚ＿１＜－ａｓ．ｍａｔｒｉｘ（Ｍ＿１－Ｐ＿１）
Ｚ＿２＜－ａｓ．ｍａｔｒｉｘ（Ｍ＿２－Ｐ＿２）
＃等式分子の項を構築する
ＺＺｔ＿１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚ＿１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｔｃｒｏｓｓｐｒｏｄ（Ｚ＿１［ｘ，］，Ｚｔ＿１）｝，ｍｃ．ｃｏｒｅｓ＝２０））
ＺＺｔ＿２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚ＿２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｔｃｒｏｓｓｐｒｏｄ（Ｚ＿２［ｘ，］，Ｚｔ＿２）｝，ｍｃ．ｃｏｒｅｓ＝２０））
＃等式分母の項を構築する
ｄｅｎｏｍ＿１＜－２＊（ｓｕｍ（ｐｉ＿１＊（１－ｐｉ＿１）））
ｄｅｎｏｍ＿２＜－２＊（ｓｕｍ（ｐｉ＿２＊（１－ｐｉ＿２）））
＃Ｇマトリックスを構築する
Ｇ＿ｃｈｉｐ＜－ＺＺｔ＿１／ｄｅｎｏｍ＿１
Ｇ＿ｒｓｅｑ＜－ＺＺｔ＿２／ｄｅｎｏｍ＿２
ｄｉａｇ（Ｇ＿ｃｈｉｐ）＜－ｄｉａｇ（Ｇ＿ｃｈｉｐ）＋０．０１
ｄｉａｇ（Ｇ＿ｒｓｅｑ）＜－ｄｉａｇ（Ｇ＿ｒｓｅｑ）＋０．０１
＃Ｇマトリックスインバージョン
ｉｎｖＧ＿ｃｈｉｐ０＜－ｓｏｌｖｅ（Ｇ＿ｃｈｉｐ）
ｉｎｖＧ＿ｒｓｅｑ０＜－ｓｏｌｖｅ（Ｇ＿ｒｓｅｑ）
＃ＧマトリックスインバージョンをＡＳＲｅｍｌが利用可能な三列の形式に変換し、かつ出力する
Ｇｉｎｖ＿ｃｈｉｐ＜－ｇｉｎｖ（ｉｎｖＧ＿ｃｈｉｐ）
Ｇｉｎｖ＿ｒｓｅｑ＜－ｇｉｎｖ（ｉｎｖＧ＿ｒｓｅｑ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ＿ｃｈｉｐ， “Ｇｉｎｖ＿ｏｐ＿Ｖａｌ＿Ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ＿ｒｓｅｑ， “Ｇｉｎｖ＿ｏｐ＿Ｖａｌ＿Ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃ＧＥＢＶを計算する
ａｔｔｒ（Ｇｉｎｖ＿ｃｈｉｐ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ａｔｔｒ（Ｇｉｎｖ＿ｒｓｅｑ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ＷＧＢＬＵＰ１＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ＿ｃｈｉｐ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”）
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝ ‘ｏｍｉｔ’）
ｇｅｂｖ０１＜－ｃｏｅｆ（ＷＧＢＬＵＰ１）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ０１， “ＧＥＢＶｓ＿ａｔ＿ｉｔｅｒ＿０＿ｃｈｉｐ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）

ＷＧＢＬＵＰ２＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ＿ｒｓｅｑ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝ ‘ｏｍｉｔ’）
ｇｅｂｖ０２＜－ｃｏｅｆ（ＷＧＢＬＵＰ２）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ０２， “ＧＥＢＶｓ＿ａｔ＿ｉｔｅｒ＿０＿ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）

＃ＧＢＬＵＰプロセス・チップ部分を加重し、ｆｏｒサイクルを用い、６回の加重反復過程を実行する
＃マーカーの効果値を推定する
Ｚｔ１＜－ｔ（Ｚ＿１）
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚｔ１［ｘ，］％＊％ｉｎｖＧ０１｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｐ１％＊％ｇｅｂｖ０１
ｕ０１＜－ｐ２／ｐｐ＿１
Ｖａｒｕ０１＜－ｕ０１＊ｕ０１＊２＊ｐｉ＿１＊（１－ｐｉ＿１）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ０１， “ ｍａｒｋｅｒＥｆｆ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ０１， “ｍａｒｋｅｒＶａｒ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃各マーカーの重みを推定する
Ｄ＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿１））
Ｄ０＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿１））
ｐｒｅ＿Ｄ＜－Ｄ０
ｕ＜－ｕ０１
ｃａｌ＜－６
ｉｎｔｅｒ＜－１
ｆｏｒ（ｔｉｎ１：ｃａｌ）｛
ｄ＜－ａｓ．ｖｅｃｔｏｒ（ｕ＊ｕ＊２＊ｐｉ＿１＊（１－ｐｉ＿１））
ｆｏｒ（ｉｉｎ１：（ｌｅｎｇｔｈ（ｄ）／ｉｎｔｅｒ））｛
ｄｉ＜－ｍｅａｎ（ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］）
ｐｒｅ＿Ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］＜－ｄｉ
｝
Ｄ＜－ｓｕｍ（Ｄ０）／ｓｕｍ（ｐｒｅ＿Ｄ）＊ｐｒｅ＿Ｄ
ｗｒｉｔｅ．ｔａｂｌｅ（Ｄ，ｆｉｌｅ＝ｐａｓｔｅ（“ｗｅｉｇｈｔｓ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重Ｇマトリックスを推定する
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｃｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｃｏｌ（Ｚ＿１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚ＿１［，ｘ］＊Ｄ［ｘ］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％Ｚｔ１｝，ｍｃ．ｃｏｒｅｓ＝５））
Ｇ＜－ｐ２／ｐｐ＿１
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｄｉａｇ（Ｇ）＜－ｄｉａｇ（Ｇ）＋０．０１
ｉｎｖＧ＜－ｓｏｌｖｅ（Ｇ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｉｎｖＧ，ｆｉｌｅ＝ｐａｓｔｅ（“ｉｎｖＧ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
Ｇｉｎｖ＜－ｇｉｎｖ（ｉｎｖＧ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇｉｎｖ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ａｔｔｒ（Ｇｉｎｖ，“ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ＷＧＢＬＵＰ＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝‘ｏｍｉｔ’）
ｇｅｂｖ＜－ｃｏｅｆ（ＷＧＢＬＵＰ）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ，ｆｉｌｅ＝ｐａｓｔｅ（“ＧＥＢＶｓ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重後マーカーの効果値を推定する
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｄ［ｘ］＊Ｚｔ１［ｘ，］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％ｉｎｖＧ｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ３＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ２［ｘ，］％＊％ｇｅｂｖ０１｝，ｍｃ．ｃｏｒｅｓ＝５））
ｕ＜－ｐ３／ｐｐ＿１
Ｖａｒｕ＜－ｕ＊ｕ＊２＊ｐｉ＿１＊（１－ｐｉ＿１）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ｍａｒｋｅｒＥｆｆ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ ｍａｒｋｅｒＶａｒ＿ｃｈｉｐ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
｝
＃ＧＢＬＵＰプロセス・再配列部分を加重し、ｆｏｒサイクルを用い、６回の加重反復過程を実行する
Ｚｔ２＜－ｔ（Ｚ＿２）
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚｔ２［ｘ，］％＊％ｉｎｖＧ０２｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｐ１％＊％ｇｅｂｖ０２
ｕ０２＜－ｐ２／ｐｐ＿２
Ｖａｒｕ０２＜－ｕ０２＊ｕ０２＊２＊ｐｉ＿２＊（１－ｐｉ＿２）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ０２， “ ｍａｒｋｅｒＥｆｆ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ０２， “ ｍａｒｋｅｒＶａｒ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿０．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃各マーカーの重みを推定する
Ｄ＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿２））
Ｄ０＜－ｒｅｐ（１，ｎｃｏｌ（Ｍ＿２））
ｐｒｅ＿Ｄ＜－Ｄ０
ｕ＜－ｕ０２
ｃａｌ＜－６
ｉｎｔｅｒ＜－１
ｆｏｒ（ｔｉｎ１：ｃａｌ）｛
ｄ＜－ａｓ．ｖｅｃｔｏｒ（ｕ＊ｕ＊２＊ｐｉ＿２＊（１－ｐｉ＿２））
ｆｏｒ（ｉｉｎ１：（ｌｅｎｇｔｈ（ｄ）／ｉｎｔｅｒ））｛
ｄｉ＜－ｍｅａｎ（ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］）
ｐｒｅ＿Ｄ［（ｉｎｔｅｒ＊（ｉ－１）＋１）：（ｉｎｔｅｒ＊ｉ）］＜－ｄｉ
｝
Ｄ＜－ｓｕｍ（Ｄ０）／ｓｕｍ（ｐｒｅ＿Ｄ）＊ｐｒｅ＿Ｄ
ｗｒｉｔｅ．ｔａｂｌｅ（Ｄ，ｆｉｌｅ＝ｐａｓｔｅ（“ ｗｅｉｇｈｔｓ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重Ｇマトリックスを推定する
ｐ１＜－ｄｏ．ｃａｌｌ（‘ｃｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｃｏｌ（Ｚ＿２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｚ＿２［，ｘ］＊Ｄ［ｘ］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％Ｚｔ２｝，ｍｃ．ｃｏｒｅｓ＝５））
Ｇ＜－ｐ２／ｐｐ＿２
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｄｉａｇ（Ｇ）＜－ｄｉａｇ（Ｇ）＋０．０１
ｉｎｖＧ＜－ｓｏｌｖｅ（Ｇ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｉｎｖＧ，ｆｉｌｅ＝ｐａｓｔｅ（“ｉｎｖＧ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
Ｇｉｎｖ＜－ｇｉｎｖ（ｉｎｖＧ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｇｉｎｖ，ｆｉｌｅ＝ｐａｓｔｅ（“Ｇｉｎｖ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ａｔｔｒ（Ｇｉｎｖ，“ ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｒｎａｍｅｓ）
ＷＧＢＬＵＰ＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～１，
ｒａｎｄｏｍ＝～ｇｉｖ（ＩＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＩＩＤ＝Ｇｉｎｖ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ ”ｌｏｇｉｔ”），
ｄａｔａ＝ｐｈｅｎｏ，
ｍａｘｉｔｅｒ＝１００，
ｎａ．ｍｅｔｈｏｄ．Ｘ＝ ‘ｏｍｉｔ’）
ｇｅｂｖ＜－ｃｏｅｆ（ＷＧＢＬＵＰ）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ，ｆｉｌｅ＝ｐａｓｔｅ（“ ＧＥＢＶｓ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃加重後マーカーの効果値を推定する
ｐ１＜－ｄｏ．ｃａｌｌ（’ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（Ｚｔ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛Ｄ［ｘ］＊Ｚｔ２［ｘ，］｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ２＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ１），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ１［ｘ，］％＊％ｉｎｖＧ｝，ｍｃ．ｃｏｒｅｓ＝５））
ｐ３＜－ｄｏ．ｃａｌｌ（‘ｒｂｉｎｄ’，ｍｃｌａｐｐｌｙ（１：ｎｒｏｗ（ｐ２），ＦＵＮ＝ｆｕｎｃｔｉｏｎ（ｘ）｛ｐ２［ｘ，］％＊％ｇｅｂｖ０２｝，ｍｃ．ｃｏｒｅｓ＝５））
ｕ＜－ｐ３／ｐｐ＿２
Ｖａｒｕ＜－ｕ＊ｕ＊２＊ｐｉ＿２＊（１－ｐｉ＿２）
ｗｒｉｔｅ．ｔａｂｌｅ（ｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ｍａｒｋｅｒＥｆｆ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（Ｖａｒｕ，ｆｉｌｅ＝ｐａｓｔｅ（“ ｍａｒｋｅｒＶａｒ＿ｒｓｅｑ＿ａｔ＿ｉｔｅｒ＿”，ｔ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
｝
計算を経て、第４回反復した時に加重ＧＢＬＵＰ方法は安定に近づき、そのためこの時の反復結果について後続の研究を実施する。検証に用いられる９５の個体ＧＥＢＶの間における関連係数は０．９５８で、これらの個体はＧＥＢＶは表６による： g) the two xxx. A csv file estimates the GEBV using the weighted GBLUP method in R. The specific operation method is as follows (Linux environment):
# library (parallel) to unload the required R packages and functions
library(data.table)
library (asreml)
library (pROC)
source("ginv.R")
The definition of the function ginv is:
ginv <- function(invG) {
Ginv <- data. frame(row = rep(1: nrow(invG), nrow(invG)), column = rep(1: nrow(invG), each = nrow(invG)), value = as.numeric(invG), lower.mat = as.logical(lower.tri(invG,diag=T)))
Ginv <- Ginv [Ginv$lower. mat == T, c(“row”, “column”, “value”)]
Ginv <- Ginv [order(Ginv$row, Ginv$column), ]
return (Ginv)
}
# geno_chip to read genotype and phenotype information <- as. matrix(fread("geno_Val_Chip.csv", nThread=10))
geno_rseq <- as. matrix(fread("geno_Val_Rseq.csv", nThread=10))
pheno <- asreml. read. table("pheno_op_Val.csv", header=T, sep=",")
rnames <- as. matrix(fread(“pheno_op_Val.csv”)) [, 1]
M_1 <- geno_chip
M_2 <- geno_rseq
# Calculate the secondary coordinate gene frequency of each site pi_1 <- round(colSums(M_1)/(2*nrow(M_1)), 3)
pi_2 <- round(colSums(M_2)/(2*nrow(M_2)), 3)
# Construct a P matrix P_1 <- matrix(2*pi_1, byrow = T, nrow = nrow(M_1), ncol = ncol(M_1))
P_2<−matrix(2*pi_2, byrow=T, nrow=nrow(M_2), ncol=ncol(M_2))
# Construct Z matrix Z_1 <- as. matrix(M_1 - P_1)
Z_2 <- as. matrix(M_2-P_2)
# Build the terms of the equation numerator ZZt_1 <- do. call('rbind', mclapply(1: nrow(Z_1), FUN = function(x) {tcrossprod(Z_1[x, ], Zt_1)}, mc.cores = 20))
ZZt_2 <- do. call('rbind', mclapply(1: nrow(Z_2), FUN = function(x) {tcrossprod(Z_2[x, ], Zt_2)}, mc. cores = 20))
# Build the terms of the equation denominator denom_1 <- 2 * (sum(pi_1 * (1 - pi_1)))
denom_2<−2*(sum(pi_2*(1−pi_2)))
# Build the G matrix G_chip <- ZZt_1/denom_1
G_ rseq <- ZZt_2 / denom_2
diag(G_chip) <− diag(G_chip) + 0.01
diag(G_rseq) <− diag(G_rseq) + 0.01
# G matrix inversion invG_chip0 <- solve(G_chip)
invG_rseq0 <- solve(G_rseq)
# Ginv_chip <- ginv(invG_chip) which converts and outputs the G matrix inversion to a three-column format that ASReml can use
Ginv_rseq <- ginv(invG_rseq)
write. table(Ginv_chip, "Ginv_op_Val_Chip_at_iter_0.csv", sep = ",", row.names = F, quote = F)
write. table(Ginv_rseq, "Ginv_op_Val_Rseq_at_iter_0.csv", sep = ",", row.names = F, quote = F)
# Calculate GEBV attr(Ginv_chip, "rowNames") <- paste(rnames)
attr(Ginv_rseq, "rowNames") <- paste(rnames)
WGBLUP1 <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv_chip),
rcov = ~units,
family = asreml. binomial (link = “logit”)
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv01 <- coef(WGBLUP1)$random
write. table(gebv01, "GEBVs_at_iter_0_chip.csv", sep = ",", row.names = F, quote = F)

WGBLUP2 <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv_rseq),
rcov = ~units,
family = asreml. binomial(link = “logit”),
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv02 <- coef(WGBLUP2)$random
write. table(gebv02, "GEBVs_at_iter_0_rseq.csv", sep = ",", row.names = F, quote = F)

# Weight the GBLUP process chip part and use the for cycle and run 6 weighted iterations # Estimate the effect value of the marker Zt1 <- t(Z_1)
p1 <- do. call('rbind', mclapply(1: nrow(Zt1), FUN = function(x) {Zt1[x, ] %*% invG01}, mc. cores = 5))
p2 <- p1 %*% gebv01
u01 <- p2/pp_1
Varu01<-u01*u01*2*pi_1*(1-pi_1)
write. table(u01, "markerEff_chip_at_iter_0.csv", sep = ",", row.names = F, quote = F)
write. table(Varu01, "markerVar_chip_at_iter_0.csv", sep = ",", row.names = F, quote = F)
# Estimate the weight of each marker D <- rep(1, ncol(M_1))
D0 <- rep(1, ncol(M_1))
pre_D <- D0
u <-u01
cal <-6
inter <- 1
for(t in 1:cal) {
d <-as. vector(u*u*2*pi_1*(1-pi_1))
for(i in 1: (length(d)/inter)) {
di <−mean(d[(inter*(i−1)+1):(inter*i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di
}
D<−sum(D0)/sum(pre_D)*pre_D
write. table(D, file = paste("weights_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the weighted G matrix p1 <- do. call('cbind', mclapply(1: ncol(Z_1), FUN = function(x) {Z_1[, x]*D[x]}, mc. cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt1}, mc. cores = 5))
G<-p2/pp_1
write. table(G, file = paste("G_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
diag(G) <− diag(G) + 0.01
invG <- solve(G)
write. table(invG, file = paste("invG_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
Ginv<-ginv(invG)
write. table(Ginv, file = paste("Ginv_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
attr(Ginv, "rowNames") <- paste(rnames)
WGBLUP <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml. binomial(link = “logit”),
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv <- coef(WGBLUP)$random
write. table(gebv, file = paste("GEBVs_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the effect value of the weighted marker p1 <- do. call('rbind', mclapply(1: nrow(Zt1), FUN = function(x) {D[x] * Zt1[x, ]}, mc. cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc. cores = 5))
p3 <- do. call('rbind', mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv01}, mc. cores = 5))
u<−p3/pp_1
Varu<−u*u*2*pi_1*(1−pi_1)
write. table(u, file = paste("markerEff_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
write. table(Varu, file = paste("markerVar_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
}
# GBLUP process - weight the rearrangement part, use the for cycle and perform 6 weighted iterations Zt2 <- t(Z_2)
p1 <- do. call('rbind', mclapply(1: nrow(Zt2), FUN = function(x) {Zt2[x, ] %*% invG02}, mc. cores = 5))
p2 <- p1 %*% gebv02
u02 <- p2/pp_2
Varu02<-u02*u02*2*pi_2*(1-pi_2)
write. table(u02, "markerEff_rseq_at_iter_0.csv", sep = ",", row.names = F, quote = F)
write. table(Varu02, "markerVar_rseq_at_iter_0.csv", sep = ",", row.names = F, quote = F)
# Estimate the weight of each marker D <- rep(1, ncol(M_2))
D0 <- rep(1, ncol(M_2))
pre_D <- D0
u <-u02
cal <-6
inter <- 1
for(t in 1:cal) {
d <-as. vector(u*u*2*pi_2*(1-pi_2))
for(i in 1: (length(d)/inter)) {
di <−mean(d[(inter*(i−1)+1):(inter*i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di
}
D<−sum(D0)/sum(pre_D)*pre_D
write. table(D, file = paste("weights_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the weighted G matrix p1 <- do. call('cbind', mclapply(1: ncol(Z_2), FUN = function(x) {Z_2[, x]*D[x]}, mc.cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt2}, mc. cores = 5))
G<-p2/pp_2
write. table(G, file = paste("G_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
diag(G) <− diag(G) + 0.01
invG <- solve(G)
write. table(invG, file = paste("invG_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
Ginv<-ginv(invG)
write. table(Ginv, file = paste("Ginv_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
attr(Ginv, "rowNames") <- paste(rnames)
WGBLUP <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml. binomial (link = "logit"),
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv <- coef(WGBLUP)$random
write. table(gebv, file = paste("GEBVs_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the effect value of the weighted marker p1 <- do. call('rbind', mclapply(1: nrow(Zt2), FUN = function(x) {D[x]*Zt2[x, ]}, mc. cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc. cores = 5))
p3 <- do. call('rbind', mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv02}, mc. cores = 5))
u<−p3/pp_2
Varu<−u*u*2*pi_2*(1−pi_2)
write. table(u, file = paste("markerEff_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
write. table(Varu, file = paste("markerVar_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
}
Through computation, the weighted GBLUP method approaches stability at the fourth iteration, so subsequent studies are performed on the iteration results at this time. The association coefficient among the 95 individuals GEBV used for validation was 0.958, and these individuals had GEBV according to Table 6:

２、「魚チップ１号」遺伝子チップサイトが参考グループにおける検証
発明者の既存の参考グループから「魚チップ１号」遺伝子チップの設計サイトを抽出し、これらのサイト情報を利用して加重ＧＢＬＵＰを実施し、かつ５倍の交差検証方法のランダムな組分けを用いて加重ＧＢＬＵＰ予測正確性の評価方法として、被検測者の操作特徴曲線下面積（ＡＵＣ）を加重ＧＢＬＵＰの評価方法で正確性の指標とする。解析モデルは一般化線形混合モデルを用いる。組分けのランダム誤差を減らすために、データセットに１０回の組分けを行い、各組は５回計算する。そのため、延べ５０回計算し、５０回のＡＵＣの平均値を最終的な評価結果とする。 2. "Fish chip No. 1" gene chip site is verified in the reference group The design site of "fish chip No. 1" gene chip is extracted from the existing reference group of the inventor, and weighted GBLUP is performed using these site information. and using a random grouping of a 5-fold cross-validation method as an assessment method for weighted GBLUP predictive accuracy, the subject's area under the operating characteristic curve (AUC) was measured as a weighted GBLUP assessment method for accuracy. as an indicator of The analytical model uses a generalized linear mixed model. To reduce the random error of grouping, the data set is grouped 10 times and each group is calculated 5 times. Therefore, a total of 50 calculations are performed, and the average value of the 50 AUCs is used as the final evaluation result.

解析結果によると、ヒラメ参考グループにおいてＳＮＰチップと同様であるマーカーを用いてゲノム選択を実施し、ＡＵＣ（正確性）値が０．８８５で、従来のＢＬＵＰ方法によるＡＵＣ（０．５７９）値より高い。そのため、発明者が設計されるチップサイトを用いてはゲノム選択を順調かつ高効率に実施することができる。 According to the analysis results, the genome selection was performed using the same markers as the SNP chip in the flounder reference group, and the AUC (accuracy) value was 0.885, which was higher than the AUC (0.579) value by the conventional BLUP method. high. Therefore, using the chip site designed by the inventor, genome selection can be carried out smoothly and highly efficiently.

具体的な操作方法は以下のとおりである：
ｆｃＧＥＮＥ、ＢＥＡＧＬＥとＰＬＩＮＫソフトウェアを利用してヒラメ参考グループから抽出されるチップ設計サイト：欠損サイトを充填しかつ遺伝子型ファイルを出力し、コマンドは以下のとおりである：
ｆｃｇｅｎｅ－－ｐｅｄｇｅｎｏ＿ｏｐ＿Ｒｓｅｑ＿ｃｈｉｐ．ｐｅｄ－－ｍａｐｇｅｎｏ＿ｏｐ＿Ｒｓｅｑ＿ｃｈｉｐ．ｍａｐ－－ｏｆｏｒｍａｔｂｅａｇｌｅ－－ｏｕｔｐｌｉｎｋ２ｂｅａｇｌｅ

ｊａｖａ－Ｘｍｘ５１２０ｍ－ｊａｒｂｅａｇｌｅ．ｊａｒｕｎｐｈａｓｅｄ＝ｐｌｉｎｋ２ｂｅａｇｌｅ．ｂｇｌｍｉｓｓｉｎｇ＝０ｎｉｔｅｒａｔｉｏｎｓ＝２０ｇｐｒｏｂｓ＝ｔｒｕｅｏｕｔ＝ｉｍｐｕｔｅｄ＿ｇｅｎｏ

ｇｕｎｚｉｐｉｍｐｕｔｅｄ＿ｇｅｎｏ．ｐｌｉｎｋ２ｂｅａｇｌｅ．ｂｇｌ．ｐｈａｓｅｄ．ｇｚ

ｆｃｇｅｎｅ－－ｂｇｌｉｍｐｕｔｅｄ＿ｇｅｎｏ．ｐｌｉｎｋ２ｂｅａｇｌｅ．ｂｇｌ．ｐｈａｓｅｄ－－ｐｅｄｉｎｆｏｐｌｉｎｋ２ｂｅａｇｌｅ＿ｐｅｄｉｎｆｏ．ｔｘｔ－－ｓｎｐｉｎｆｏｐｌｉｎｋ２ｂｅａｇｌｅ＿ｓｎｐｉｎｆｏ．ｔｘｔ－－ｏｆｏｒｍａｔｐｌｉｎｋ－－ｏｕｔｂｅａｇｌｅ２ｐｌｉｎｋ

ｐｌｉｎｋ－－ｆｉｌｅｂｅａｇｌｅ２ｐｌｉｎｋ－－ｒｅｃｏｄｅＡ－－ｏｕｔｇｅｎｏｔｙｐｅ＿ｏｐ＿ｃｈｉｐ＿Ｒｓｅｑ
さらにＲに以下のコマンドを入力する：
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｇｅｎｏ＜－ｆｒｅａｄ（“ｇｅｎｏｔｙｐｅ＿ｏｐ＿ｃｈｉｐ＿Ｒｓｅｑ．ｒａｗ”）
ｇｅｎｏ［，ｃ（１：６）：＝ＮＵＬＬ］
ｆｗｒｉｔｅ（ｇｅｎｏ， “ｇｅｎｏｔｙｐｅ＿ｏｐ＿ｃｈｉｐ＿Ｒｓｅｑ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） The specific operation method is as follows:
Chip design sites extracted from the flounder reference group using fcGENE, BEAGLE and PLINK software: fill in the missing sites and output the genotype file, the commands are as follows:
fcgene --ped geno_op_Rseq_chip. ped --map geno_op_Rseq_chip. map --format beagle --out plink2beagle

java -Xmx5120m -jar beagle. jar unphased=plink2beagle. bgl missing=0 niterations=20 gprobs=true out=imputed_geno

gunzip imputed_geno. plink2beagle. bgl. phased. gz

fcgene--bgl imputed_geno. plink2beagle. bgl. phased --pedinfo plink2beagle_pedinfo. txt --snpinfo plink2beagle_snpinfo. txt --format plink --out beagle2plink

plink --file beagle2plink --record A --out genotype_op_chip_Rseq
Then enter the following command in R:
library(data.table)
geno <-fread("genotype_op_chip_Rseq.raw")
geno[, c(1:6) := NULL]
fwrite(geno, "genotype_op_chip_Rseq.csv", sep = ",", row.names = F, quote = F)

ａ）Ｒにおいてａ）で取得した遺伝子型ファイルを利用してＲにおいて加重ＧＢＬＵＰを実施する。加重ＧＢＬＵＰの具体的な方法は１）におけるｈ）部分を参照して実施する。 a) Perform a weighted GBLUP in R using the genotype file obtained in a). A specific method of weighted GBLUP is performed with reference to part h) in 1).

ｂ）構築済みの加重ＧマトリックスをＡＳＲｅｍｌ－Ｒに代入して交差検証方法を行う。交差検証方法前に組分けを実施する必要がある：Ｒにおいて関数ｓａｍｐｌｅ（１：９３１、９３１）を用いてすべての個体にランダムソートを行い、さらにソート後のデジタルを５列に分け、各列に含まれる要素個数はそれぞれ１８６・１８６・１８６・１８６と１８７である。上記の過程を１０回繰り返し、延べ１０のファイルを取得する。この１０のファイルを同一のフォルダに入れて使用に備える。解析は一般化線形混合モデルを用い、異なる検測ロットと個体齢を固定効果として、各個体はランダム効果として適合を実施する。５倍の交差検証方法の具体的な実施方法は以下のとおりである：
＃必要なＲパッケージと関数をアンロードする
ｌｉｂｒａｒｙ（ｐａｒａｌｌｅｌ）
ｌｉｂｒａｒｙ（ａｓｒｅｍｌ）
ｌｉｂｒａｒｙ（ｐＲＯＣ）
＃表現型、Ｇマトリックスインバージョンの三列形式を読み取る
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｈｅｎｏｔｙｐｅ＿９３１．ｃｓｖ”，ｈｅａｄｅｒ＝Ｔ，ｓｅｐ＝“ ，”，ｎａ．ｓｔｒｉｎｇ＝ＮＡ）
ｐｅｄ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｅｄｉｇｒｅｅ＿９３１．ｃｓｖ”，ｈｅａｄｅｒ＝Ｔ，ｓｅｐ＝“ ，”，ｎａ．ｓｔｒｉｎｇ＝ＮＡ）
ａｉｎｖ＜－ａｓｒｅｍｌ．Ａｉｎｖｅｒｓｅ（ｐｅｄ）＄ｇｉｎｖ
Ｇｉｎｖ＜－ｆｒｅａｄ（“…／Ｇｉｎｖ＿ａｔ＿ｉｔｅｒ＿４．ｃｓｖ ”）
ａｔｔｒ（Ｇｉｎｖ，“ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
＃外部サイクルの回数を設定する
Ｎ＜－１０
＃結果変数を設定する
ｒｅｓ＜－ｍａｔｒｉｘ（ＮＡ，ｎｒｏｗ＝５＊Ｎ，ｎｃｏｌ＝１）
ｃｏｌｎａｍｅｓ（ｒｅｓ）＜－ｃ（”ａｕｃ”）
＃交差検証方法を行いかつ検証結果を出力する
ｆｏｒ（ｉｉｎ１：Ｎ）｛
ｃｏｏｒ＜－ｒｅａｄ．ｔａｂｌｅ（ｆｉｌｅ＝ｐａｓｔｅ（“．／ｃｏｏｒ／ｃｏｏｒ＿”，ｉ， “．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｆ）
ｃｙｃ＜－ｎｃｏｌ（ｃｏｏｒ）
ｇｅｂｖ＜－ｍａｔｒｉｘ（ＮＡ，ｎｒｏｗ＝ｎｒｏｗ（ｐｈｅｎｏ），ｎｃｏｌ＝ｃｙｃ）
ｆｏｒ（ｊｉｎ１：ｃｙｃ）｛
ｙ＜－ｐｈｅｎｏ
ｙ＄ｓｔａｔｕｓ［ｃｏｏｒ［（１：ｓｕｍ（ｃｏｏｒ［，ｊ］＞０，ｎａ．ｒｍ＝Ｔ）），ｊ］］＜－ＮＡ
ＣＶ＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，
ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），
ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），
ｒｃｏｖ＝～ｕｎｉｔｓ，
ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），
ｄａｔａ＝ｙ，
ｍａｘｉｔｅｒ＝５０）

ｇｅｂｖ［，ｊ］＜－ｃｏｅｆ（ＣＶ）＄ｒａｎｄｏｍ
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｂｖ，ｆｉｌｅ＝ｐａｓｔｅ（“ＧＥＢＶｓ＿ｃｈｉｐ＿ｒｅｆ＿ｃｏｏｒ＿”，ｉ， “ ．ｃｓｖ”，ｓｅｐ＝“ ”），ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｒｅｓ［（ｊ＋（ｉ－１）＊ｃｙｃ），］＜－ｒｏｃ（ａｓ．ｖｅｃｔｏｒ（ｐｈｅｎｏ＄ｓｔａｔｕｓ［ｃｏｏｒ［（１：ｓｕｍ（ｃｏｏｒ［，ｊ］＞０，ｎａ．ｒｍ＝Ｔ）），ｊ］］），ａｓ．ｖｅｃｔｏｒ（ｇｅｂｖ［ｃｏｏｒ［（１：ｓｕｍ（ｃｏｏｒ［，ｊ］＞０，ｎａ．ｒｍ＝Ｔ）），ｊ］，ｊ］））＄ａｕｃ
ｗｒｉｔｅ．ｔａｂｌｅ（ｒｅｓ， “ｒｅｓｕｌｔｓ＿ｏｆ＿ｒｏｃ＿ｏｐ＿ｃｈｉｐ＿ｒｅｆ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
｝
｝
＃従来のＢＬＵＰの推定方法は加重ＧＢＬＵＰと同様で、モデルにおいてａｉｎｖを用いてＧｉｎｖを代替すればよい
上記のコードを実行した後、加重ＧＢＬＵＰ方法によるＡＵＣ平均値は０．８８５で、従来のＢＬＵＰ方法によるＡＵＣ平均値は０．５７９である。５０回の交差検証方法の結果は表７による： b) Substitute the pre-constructed weighted G matrix into ASReml-R to perform the cross-validation method. Before the cross-validation method, it is necessary to perform grouping: in R, use the function sample(1:931, 931) to perform random sorting on all individuals, and further divide the digital after sorting into 5 columns, each column are 186, 186, 186, 186 and 187, respectively. The above process is repeated 10 times to obtain a total of 10 files. Place these 10 files in the same folder and prepare for use. The analysis uses a generalized linear mixed model, with different test lots and individual ages as fixed effects and each individual as a random effect for fitting. A specific implementation of the 5-fold cross-validation method is as follows:
# library (parallel) to unload the required R packages and functions
library (asreml)
library (pROC)
# Phenotype, pheno <- asreml. read. table("phenotype_931.csv", header=T, sep=",", na.string=NA)
ped <- asreml. read. table("pedigree_931.csv", header=T, sep=",", na.string=NA)
ainv <- asreml. Ainverse (ped) $ginv
Ginv<-fread(".../Ginv_at_iter_4.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
# Set the number of external cycles N <- 10
# Set result variables res <- matrix(NA, nrow = 5*N, ncol = 1)
colnames(res) <- c("auc")
# Perform cross-validation method and output validation result for (i in 1: N) {
coor <- read. table(file = paste("./coor/coor_", i, ".csv", sep = ""), sep = ",", header = F)
cyc <- ncol(coor)
gebv <- matrix(NA, nrow = nrow(pheno), ncol = cyc)
for (j in 1: cyc) {
y <- pheno
y$status[coor[(1: sum(coor[, j] > 0, na.rm = T)), j]] <- NA
CV<−asreml(status ~ Batch + Age,
random = ~ giv(AnimalID),
ginverse = list(AnimalID = Ginv),
rcov = ~units,
family = asreml. binomial(link = “logit”),
data = y,
maxiter = 50)

gebv[,j] <- coef(CV)$random
write. table(gebv, file = paste("GEBVs_chip_ref_coor_", i, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
res[(j+(i-1)*cyc), ] <- roc(as.vector(pheno$status[coor[(1:sum(coor[, j]>0, na.rm=T)), j]]), as.vector(gebv[coor[(1: sum(coor[, j] > 0, na.rm = T)), j], j])) $auc
write. table(res, "results_of_roc_op_chip_ref.csv", sep = ",", row.names = F, quote = F)
}
}
# The conventional BLUP estimation method is similar to the weighted GBLUP, and we can use ainv to replace Ginv in the model The mean AUC by method is 0.579. The results of the 50-fold cross-validation method are according to Table 7:

３、「魚チップ１号」遺伝子チップがヒラメ耐病性育種における応用
候補個体のゲノム推定育種価（ＧＥＢＶ）を推定するために、まず候補個体遺伝子型（「魚チップ１号」遺伝子チップ分類から取得される）を発明者の既存の参考グループ遺伝子型と組み合わせ、さらにＲを用いて加重Ｇマトリックスを構築し、最後に用意済みの加重Ｇマトリックスと表現型データをＡＳＲｅｍｌ－Ｒに代入し、加重ＧＢＬＵＰ方法を用いて各家系成体のＧＥＢＶを推定し、さらに成体ＧＥＢＶの平均値を相応の家系のＧＥＢＶとする。各家系を感染生存率に従って高生存率家系（生存率が５５％より高い）と低生存率家系（生存率が５５％より低い）、かつ各家系ＧＥＢＶと感染生存率の間のＡＵＣ値を計算し、さらに該ＡＵＣ値を２で取得したＡＵＣ値と比較し、近いひいては２で取得したＡＵＣより高い場合、発明者が設計される遺伝子チップがゲノム選択技術の要件を満たしかつヒラメ耐病性選別において優れた応用効果を有することを示す。各家系ＧＥＢＶと感染生存率の間のＡＵＣ値を計算しかつ該ＡＵＣ値を２で取得したＡＵＣ値と比較し、それにより発明者が設計される遺伝子チップとゲノム選択技術がヒラメ耐病性選別における実際の応用効果を検証するために用いられるもの。ＡＵＣ値を推定する前、子の代の感染生存率を

で変換を実施する。変換後、平均値より高い家系の生存率を１と記載し、平均値より低いものを０と記載する。遺伝子チップ方法がマーカーに対する要件を満たし、すべての候補個体の分類結果に組み合わせを行い、各チップの分類結果を単独で参考グループ遺伝子型と合併する必要がある。 3. Application of "fish chip No. 1" gene chip in flounder disease resistance breeding ) is combined with the inventor's existing reference group genotypes, and R is used to construct a weighted G matrix, and finally the prepared weighted G matrix and phenotypic data are substituted into ASReml-R, weighted GBLUP The method is used to estimate the GEBV of each pedigree adult, and the mean of the adult GEBV is taken as the GEBV of the corresponding pedigree. Each family was classified according to infection survival rate, high survival rate families (>55% survival rate) and low survival rate families (less than 55% survival rate), and AUC values between each family's GEBV and infection survival rate. and further compare the AUC value with the AUC value obtained in 2, and if it is close or even higher than the AUC obtained in 2, the gene chip designed by the inventor meets the requirements of genome selection technology and flounder disease resistance It shows that it has excellent application effect in sorting. Calculate the AUC value between each pedigree GEBV and the infection survival rate and compare the AUC value with the AUC value obtained in 2, whereby the gene chip and genome selection technology designed by the inventors is used for flounder disease resistance screening. It is used to verify the actual application effect in Before estimating the AUC value, the infection survival rate of offspring was

to perform the conversion. After transformation, family survival rates above the mean are described as 1 and those below the mean as 0. The gene chip method should meet the requirements for markers, combine the classification results of all candidate individuals, and merge the classification results of each chip alone with the reference group genotype.

解析結果によると１６の候補群の子の代家系における６の高生存率家系の平均生存率が６２．４％であるおよび１０の低生存率家系の平均的な生存率が３３．４７％（表８）であることがわかる。そのうち、高生存率家系成体の平均的なＧＥＢＶが２．１０で、低生存率家系成体の平均的なＧＥＢＶが１．３４である。計算はよると、これらのヒラメ家系のＧＥＢＶ値を利用してはその感染生存率が予測される正確性が０．７９４に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。 The results of the analysis showed that in the 16 candidate groups of progeny of offspring, 6 high survival families had a mean survival rate of 62.4% and 10 low survival families had a mean survival rate of 33.47% ( Table 8). Of which, the average GEBV for adults in high survival families is 2.10, and the average GEBV for adults in low survival families is 1.34. Calculations show that using the GEBV values of these flounder families, the accuracy of predicting their infection survival rate can reach 0.794, which is close to the theoretical value. Therefore, the gene chip designed by the inventors can be effectively applied to the screening of flounder for disease-resistant traits.

具体的な操作方法は以下のとおりである：
ＰＬＩＮＫとＲを利用して候補個体分類ファイルを利用し、さらに候補個体から後続検証に用いられる個体を選定し、これらの個体情報を一つのテキストファイルに貯蔵し、家系番号・個体番号・父本番号・母本番号・性別と表現型値の配列に従ってテキストファイルを用意し、各行に一つの個体で、各個体の各項目情報はｔａｂｌｅ区切り記号を用いて区切りを実施する。発明を実施するための形態：
＃各ＳＮＰチップの分類結果を読み取り、結果はｘｘｘ．ｂｅｄ、ｘｘｘ．ｂｉｍとｘｘｘ．ｆａｍに貯蔵される。
ｐｌｉｎｋ－－ｖｃｆｏｐ１．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ１
ｐｌｉｎｋ－－ｖｃｆｏｐ２．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ２
ｐｌｉｎｋ－－ｖｃｆｏｐ３．ｖｃｆ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ３

Ｒにおいて各ファイルにおけるＳＮＰの命名方法を変換し、具体的な操作方法は以下のとおりである：
＃Ｒパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃データを読み取る
ｏｐ１＜－ｆｒｅａｄ（“ｏｐ１．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｏｐ２＜－ｆｒｅａｄ（“ｏｐ２．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
ｏｐ３＜－ｆｒｅａｄ（“ｏｐ３．ｂｉｍ”，ｈｅａｄｅｒ＝Ｆ）
＃ＳＮＰ名称を変換する
ｏｐ１＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｏｐ１）），ｏｐ１＄Ｖ１，ｓｅｐ＝“ ”），ｏｐ１＄Ｖ４，ｓｅｐ＝ “：”）
ｏｐ２＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｏｐ２）），ｏｐ２＄Ｖ１，ｓｅｐ＝“ ”），ｏｐ２＄Ｖ４，ｓｅｐ＝ “：”）
ｏｐ３＄Ｖ２＜－ｐａｓｔｅ（ｐａｓｔｅ（ｒｅｐ（“ｒｓ”，ｎｒｏｗ（ｏｐ３）），ｏｐ３＄Ｖ１，ｓｅｐ＝“ ”），ｏｐ３＄Ｖ４，ｓｅｐ＝ “：”）
＃各チップが取得した分類情報を出力する
ｗｒｉｔｅ．ｔａｂｌｅ（ｏｐ１＄Ｖ２， “ｓｎｐｓ＿ｏｐ１．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｏｐ２＄Ｖ２， “ｓｎｐｓ＿ｏｐ２．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｏｐ３＄Ｖ２， “ｓｎｐｓ＿ｏｐ３．ｔｘｔ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃整理後のｂｉｍファイルを出力する
ｆｗｒｉｔｅ（ｏｐ１， “ｏｐ１．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ２， “ｏｐ２．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ３， “ｏｐ３．ｂｉｍ”，ｓｅｐ＝ “￥ｔ”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃Ｒを終了する

検証個体情報を用意し、かつタイトルが「ｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ」のファイルに入れ、ファイルは以下の方式に従って整理する：
１７０１．ＣＥＬ１７０１．ＣＥＬ０００－９
１７０２．ＣＥＬ１７０２．ＣＥＬ０００－９
＃ＰＬＩＮＫソフトウェアを利用して各ＳＮＰチップに含まれる個体を抽出し、その方式は以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ１－－ｋｅｅｐｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿ｃａｎ＿１
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ２－－ｋｅｅｐｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿ｃａｎ＿２
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ３－－ｋｅｅｐｓｅｌｅｃｔｅｄ＿ｉｎｄｉ．ｔｘｔ－－ｍａｋｅ－ｂｅｄ－－ｏｕｔｏｐ＿ｃａｎ＿３

＃検証に用いられる個体の遺伝子型ファイルを出力する
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿ｃａｎ＿１－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ１
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿ｃａｎ＿２－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ２
ｐｌｉｎｋ－－ｂｆｉｌｅｏｐ＿ｃａｎ＿３－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ３
さらにＲにおいて以下のコマンドを用いて遺伝子型ファイルを出力する：
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｏｐ１＜－ｆｒｅａｄ（“ｏｐ１．ｒａｗ”）
ｏｐ２＜－ｆｒｅａｄ（“ｏｐ２．ｒａｗ”）
ｏｐ３＜－ｆｒｅａｄ（“ｏｐ３．ｒａｗ”）
ｏｐ１［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ２［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ３［，ｃ（１：６）：＝ＮＵＬＬ］
ｆｗｒｉｔｅ（ｏｐ１， “ｇｅｎｏ＿ｏｐ１＿ｃａｎ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ２， “ｇｅｎｏ＿ｏｐ２＿ｃａｎ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｆｗｒｉｔｅ（ｏｐ３， “ｇｅｎｏ＿ｏｐ３＿ｃａｎ．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ａ）ＰＬＩＮＫとＲを利用して参考グループ遺伝子型をそれぞれ各チップ候補個体遺伝子型と組み合わせを実施し、具体的な操作方法は以下のとおりである：
ｐｌｉｎｋ－－ｂｆｉｌｅ …／ｏｐ＿Ｒｅｆ－－ｅｘｔｒａｃｔｓｎｐｓ＿ｏｐ１．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿Ｒｅｆ＿ｏｐ１
ｐｌｉｎｋ－－ｂｆｉｌｅ …／ｏｐ＿Ｒｅｆ－－ｅｘｔｒａｃｔｓｎｐｓ＿ｏｐ２．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿Ｒｅｆ＿ｏｐ２
ｐｌｉｎｋ－－ｂｆｉｌｅ …／ｏｐ＿Ｒｅｆ－－ｅｘｔｒａｃｔｓｎｐｓ＿ｏｐ３．ｔｘｔ－－ｒｅｃｏｄｅＡ－－ｏｕｔｏｐ＿Ｒｅｆ＿ｏｐ３
さらにＲにおいて以下のコマンドを用い、遺伝子型ファイルを処理し、出力する：
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
＃参考グループ遺伝子型を読み取りかつそれに処理を実施する
ｏｐ１＿ｒｅｆ＜－ｆｒｅａｄ（“ｏｐ＿Ｒｅｆ＿ｏｐ１．ｒａｗ”）
ｏｐ２＿ｒｅｆ＜－ｆｒｅａｄ（“ｏｐ＿Ｒｅｆ＿ｏｐ２．ｒａｗ”）
ｏｐ３＿ｒｅｆ＜－ｆｒｅａｄ（“ｏｐ＿Ｒｅｆ＿ｏｐ３．ｒａｗ”）
ｏｐ１＿ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ２＿ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ３＿ｒｅｆ［，ｃ（１：６）：＝ＮＵＬＬ］
ｏｐ１＿ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｏｐ１＿ｒｅｆ）
ｏｐ２＿ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｏｐ２＿ｒｅｆ）
ｏｐ３＿ｒｅｆ＜－ａｓ．ｍａｔｒｉｘ（ｏｐ３＿ｒｅｆ）
＃候補個体遺伝子型を処理する：読み取り、組み合わせ
ｏｐ１＿ｃａｎ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ１＿ｃａｎ．ｃｓｖ”））
ｏｐ２＿ｃａｎ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ２＿ｃａｎ．ｃｓｖ”））
ｏｐ３＿ｃａｎ＜－ａｓ．ｍａｔｒｉｘ（ｆｒｅａｄ（“ｇｅｎｏ＿ｏｐ３＿ｃａｎ．ｃｓｖ”））
ｇｅｎｏ＿ｏｐ１＜－ｒｂｉｎｄ（ｏｐ１＿ｃａｎ，ｏｐ１＿ｒｅｆ）
ｇｅｎｏ＿ｏｐ２＜－ｒｂｉｎｄ（ｏｐ２＿ｃａｎ，ｏｐ２＿ｒｅｆ）
ｇｅｎｏ＿ｏｐ３＜－ｒｂｉｎｄ（ｏｐ３＿ｃａｎ，ｏｐ３＿ｒｅｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｏｐ１， “ｇｅｎｏ＿ｏｐ１．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｏｐ２， “ｇｅｎｏ＿ｏｐ２．ｃｓｖ”，ｓｅｐ＝“ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
ｗｒｉｔｅ．ｔａｂｌｅ（ｇｅｎｏ＿ｏｐ３， “ｇｅｎｏ＿ｏｐ３．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｒｏｗ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） The specific operation method is as follows:
PLINK and R are used to use the candidate individual classification file, select the individual to be used for subsequent verification from the candidate individual, store this individual information in a single text file, and store the family number, individual number, and father book. A text file is prepared according to the sequence of number, parental book number, sex and phenotype value, one individual per line, and each item information of each individual is separated using a table delimiter. MODES FOR CARRYING OUT THE INVENTION:
# Read the classification result of each SNP chip, the result is xxx. bed, xxx. bim and xxx. Stored in fam.
plink --vcf op1. vcf --make-bed --out op1
plink --vcf op2. vcf --make-bed --out op2
plink --vcf op3. vcf --make-bed --out op3

Convert the SNP naming method in each file in R, the specific operation method is as follows:
# unload R package library(data.table)
# read data op1 <- fread("op1.bim", header = F)
op2 <-fread("op2.bim", header = F)
op3 <-fread("op3.bim", header = F)
# Convert SNP names op1$V2 <- paste (paste(rep(“rs”, nrow(op1)), op1$V1, sep = “ ”), op1$V4, sep = “:”)
op2$V2<-paste(paste(rep("rs", nrow(op2)), op2$V1, sep=""), op2$V4, sep=":")
op3$V2<-paste(paste(rep("rs", nrow(op3)), op3$V1, sep=""), op3$V4, sep=":")
# output the classification information acquired by each chip write. table (op1$V2, "snps_op1.txt", sep = "\t", col.names = F, row.names = F, quote = F)
write. table (op2$V2, "snps_op2.txt", sep = "\t", col.names = F, row.names = F, quote = F)
write. table (op3$V2, "snps_op3.txt", sep = "\t", col.names = F, row.names = F, quote = F)
# fwrite (op1, “op1.bim”, sep = “\t”, col.names = F, row.names = F, quote = F) to output the bim file after organization
fwrite(op2, "op2.bim", sep = "\t", col.names = F, row.names = F, quote = F)
fwrite(op3, "op3.bim", sep = "\t", col.names = F, row.names = F, quote = F)
# Exit R

Prepare verification individual information and put it in a file with the title "selected_indi.txt", and organize the file according to the following method:
1701. CEL 1701. CEL 0 0 0 -9
1702. CEL 1702. CEL 0 0 0 -9
# Using PLINK software to extract the individuals contained in each SNP chip, the method is as follows:
plink --bfile op1 --keep selected_indi. txt --make-bed --out op_can_1
plink --bfile op2 --keep selected_indi. txt --make-bed --out op_can_2
plink --bfile op3 --keep selected_indi. txt --make-bed --out op_can_3

# Output the individual genotype file used for verification plink --bfile op_can_1 --recode A --out op1
plink --bfile op_can_2 --recode A --out op2
plink --bfile op_can_3 --recode A --out op3
In addition, output the genotype file using the following command in R:
library(data.table)
op1 <-fread("op1.raw")
op2 <-fread("op2.raw")
op3 <-fread("op3.raw")
op1[, c(1:6) := NULL]
op2[, c(1:6) := NULL]
op3[, c(1:6) := NULL]
fwrite(op1, "geno_op1_can.csv", sep = ",", row.names = F, quote = F)
fwrite(op2, "geno_op2_can.csv", sep = ",", row.names = F, quote = F)
fwrite(op3, "geno_op3_can.csv", sep = ",", row.names = F, quote = F)
a) Use PLINK and R to combine the genotypes of the reference group with the genotypes of each chip candidate individual, and the specific operation method is as follows:
plink --bfile .../op_Ref --extract snps_op1. txt --recode A --out op_Ref_op1
plink --bfile .../op_Ref --extract snps_op2. txt --recode A --out op_Ref_op2
plink --bfile .../op_Ref --extract snps_op3. txt --recode A --out op_Ref_op3
Further process and output the genotype file in R using the following command:
library(data.table)
# Read the reference group genotype and perform processing on it op1_ref <- fread("op_Ref_op1.raw")
op2_ref <-fread(“op_Ref_op2.raw”)
op3_ref <-fread(“op_Ref_op3.raw”)
op1_ref[, c(1:6) := NULL]
op2_ref[, c(1:6) := NULL]
op3_ref[, c(1:6) := NULL]
op1_ref <- as. matrix (op1_ref)
op2_ref <- as. matrix(op2_ref)
op3_ref <- as. matrix(op3_ref)
# Process candidate individual genotypes: read, combine op1_can <- as. matrix(fread(“geno_op1_can.csv”))
op2_can <- as. matrix(fread(“geno_op2_can.csv”))
op3_can <- as. matrix(fread(“geno_op3_can.csv”))
geno_op1 <-rbind(op1_can, op1_ref)
geno_op2 <- rbind(op2_can, op2_ref)
geno_op3 <-rbind(op3_can, op3_ref)
write. table(geno_op1, "geno_op1.csv", sep = ",", row.names = F, quote = F)
write. table(geno_op2, "geno_op2.csv", sep = ",", row.names = F, quote = F)
write. table(geno_op3, "geno_op3.csv", sep = ",", row.names = F, quote = F)

ｂ）ｂ）における処理済みの４つの遺伝子型ファイルを利用し、Ｒにおいてそれぞれ加重Ｇマトリックスを構築し、加重Ｇマトリックスの構築方法は１）における記述と同様である b) Use the four genotype files processed in b) to construct a weighted G matrix respectively in R, and the construction method of the weighted G matrix is the same as described in 1).

ｃ）ＡＳＲｅｍｌ－Ｒを用いて候補個体のＧＥＢＶを推定し、コードは以下のとおりである：
＃必要なＲパッケージと関数をアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｌｉｂｒａｒｙ（ａｓｒｅｍｌ）
＃＃＃ｏｐ１＃＃＃
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ｐｈｅｎｏｔｙｐｅ＿ｏｐ１．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｔ）
Ｇｉｎｖ＜－ｆｒｅａｄ（“…／Ｇｉｎｖ＿ｏｐ１．ｃｓｖ”）
ａｔｔｒ（Ｇｉｎｖ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
ｏｐ１＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），ｒｃｏｖ＝～ｕｎｉｔｓ，ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），ｎａ．ｍｅｔｈｏｄ．Ｘ＝ “ｏｍｉｔ”，ｄａｔａ＝ｐｈｅｎｏ，ｍａｘｉｔｅｒ＝５０）
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｅｆ（ｏｐ１）＄ｒａｎｄｏｍ， “ｇｅｂｖ＿ｏｐ１．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）

＃＃＃ｏｐ２＃＃＃
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ ｐｈｅｎｏｔｙｐｅ＿ｏｐ２．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｔ）
Ｇｉｎｖ＜－ｆｒｅａｄ（“ …／Ｇｉｎｖ＿ｏｐ２．ｃｓｖ”）
ａｔｔｒ（Ｇｉｎｖ， “ ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
ｏｐ２＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），ｒｃｏｖ＝～ｕｎｉｔｓ，ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），ｎａ．ｍｅｔｈｏｄ．Ｘ＝ “ ｏｍｉｔ”，ｄａｔａ＝ｐｈｅｎｏ，ｍａｘｉｔｅｒ＝５０）
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｅｆ（ｏｐ２）＄ｒａｎｄｏｍ， “ ｇｅｂｖ＿ｏｐ２．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ）
＃＃＃ｏｐ３＃＃＃
ｐｈｅｎｏ＜－ａｓｒｅｍｌ．ｒｅａｄ．ｔａｂｌｅ（“ ｐｈｅｎｏｔｙｐｅ＿ｏｐ３．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｈｅａｄｅｒ＝Ｔ）
Ｇｉｎｖ＜－ｆｒｅａｄ（“ …／Ｇｉｎｖ＿ｏｐ３．ｃｓｖ”）
ａｔｔｒ（Ｇｉｎｖ， “ｒｏｗＮａｍｅｓ”）＜－ｐａｓｔｅ（ｐｈｅｎｏ［，１］）
ｏｐ３＜－ａｓｒｅｍｌ（ｓｔａｔｕｓ～Ｂａｔｃｈ＋Ａｇｅ，ｒａｎｄｏｍ＝～ｇｉｖ（ＡｎｉｍａｌＩＤ），ｇｉｎｖｅｒｓｅ＝ｌｉｓｔ（ＡｎｉｍａｌＩＤ＝Ｇｉｎｖ），ｒｃｏｖ＝～ｕｎｉｔｓ，ｆａｍｉｌｙ＝ａｓｒｅｍｌ．ｂｉｎｏｍｉａｌ（ｌｉｎｋ＝ “ｌｏｇｉｔ”），ｎａ．ｍｅｔｈｏｄ．Ｘ＝ “ｏｍｉｔ”，ｄａｔａ＝ｐｈｅｎｏ，ｍａｘｉｔｅｒ＝５０）
ｗｒｉｔｅ．ｔａｂｌｅ（ｃｏｅｆ（ｏｐ３）＄ｒａｎｄｏｍ， “ｇｅｂｖ＿ｏｐ３．ｃｓｖ”，ｓｅｐ＝ “ ，”，ｃｏｌ．ｎａｍｅｓ＝Ｆ，ｑｕｏｔｅ＝Ｆ） c) Estimate the candidate individual's GEBV using ASReml-R, the code is:
# library (data.table) to unload the required R packages and functions
library (asreml)
###op1###
pheno <- asreml. read. table("phenotype_op1.csv", sep = ",", header = T)
Ginv<-fread(".../Ginv_op1.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
op1 <- asreml(status ~ Batch + Age, random = ~ giv (AnimalID), ginverse = list (AnimalID = Ginv), rcov = ~ units, family = asreml.binomial (link = "logit"), na.method. X = “omit”, data = pheno, maxiter = 50)
write. table(coef(op1)$random, "gebv_op1.csv", sep = ",", col.names = F, quote = F)

###op2###
pheno <- asreml. read. table("phenotype_op2.csv", sep = ",", header = T)
Ginv<-fread(".../Ginv_op2.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
op2 <- asreml(status ~ Batch + Age, random = ~ giv (AnimalID), ginverse = list (AnimalID = Ginv), rcov = ~ units, family = asreml.binomial (link = "logit"), na.method. X = “omit”, data = pheno, maxiter = 50)
write. table(coef(op2)$random, "gebv_op2.csv", sep = ",", col.names = F, quote = F)
###op3###
pheno <- asreml. read. table("phenotype_op3.csv", sep = ",", header = T)
Ginv<-fread(".../Ginv_op3.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
op3 <- asreml(status ~ Batch + Age, random = ~ giv (AnimalID), ginverse = list (AnimalID = Ginv), rcov = ~ units, family = asreml.binomial (link = "logit"), na.method. X = “omit”, data = pheno, maxiter = 50)
write. table(coef(op3)$random, "gebv_op3.csv", sep = ",", col.names = F, quote = F)

ｄ）推定されるすべての候補個体のＧＥＢＶに従って相応の家系ＧＥＢＶを計算し、かつ公式

を用いて各家系の感染生存率に変換を行い、変換後の平均値より高い家系の生存率を１に設定し、平均値より低いものを０に設定する。最後に、各家系ＧＥＢＶと変換後の生存率の間のＡＵＣ値を計算し、ＡＵＣ値の計算方法は以下のとおりである：
＃必要なＲパッケージをアンロードする
ｌｉｂｒａｒｙ（ｄａｔａ．ｔａｂｌｅ）
ｌｉｂｒａｒｙ（ｐＲＯＣ）
＃各家系ＧＥＢＶおよび相応の感染生存率を一つのファイルに整理し、家系番号・ＧＥＢＶと変換後の感染生存率の配列に従って配列し、各行は一つの家系情報のみを含み、整理後のファイルを読み取る
ｒｅｓ＜－ｆｒｅａｄ（“…／ｇｅｂｖ＿ａｎｄ＿ｓｒ＿ｏｐ＿ｃａｎ．ｃｓｖ”）
＃子の代の家系生存率を変換する
ｒｅｓ＄ＳＲ＿ｔｒａｎｓ＜－ｅｘｐ（ｒｅｓ＄ＳＲ）／（１＋ｅｘｐ（ｒｅｓ＄ＳＲ））
＃変換後の生存率を平均値に従ってさらに０と１に変換する
ＳＲ＿ｂｉｎａｒｙ＜－ｍａｔｒｉｘ（ＮＡ，ｎｒｏｗ＝ｎｒｏｗ（ｒｅｓ），ｎｃｏｌ＝１）
ＳＲ＿ｂｉｎａｒｙ［ｗｈｉｃｈ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ＞ｍｅａｎ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ）），］＜－１
ＳＲ＿ｂｉｎａｒｙ［ｗｈｉｃｈ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ＜ｍｅａｎ（ｒｅｓ＄ＳＲ＿ｔｒａｎｓ）），］＜－０
＃ＡＵＣ値を計算する
ｒｏｃ（ＳＲ＿ｂｉｎａｒｙ［，１］，ｒｅｓ＄ＧＥＢＶ）
計算を経て、１６のヒラメ家系のＧＥＢＶを取得し、計算は各家系ＧＥＢＶと相応の感染生存率の間のＡＵＣ（正確性）が０．７９４であることを示す。各家系ＧＥＢＶおよび感染生存率は表８による。 d) Calculate the corresponding pedigree GEBV according to the estimated GEBV of all candidate individuals, and formula

is used to convert the infection survival rate of each family, and the survival rate of families higher than the mean after conversion is set to 1, and the survival rate lower than the mean is set to 0. Finally, the AUC value between each pedigree GEBV and post-conversion survival rate was calculated, and the method for calculating the AUC value is as follows:
# library (data.table) to unload the required R packages
library (pROC)
# Organize each family GEBV and corresponding infection survival rate into one file, arranged according to the order of family number/GEBV and infection survival rate after conversion, each line contains only one family information, file after sorting read res <- fread(".../gebv_and_sr_op_can.csv")
# Translate pedigree viability res$SR_trans <- exp(res$SR) / (1 + exp(res$SR))
# Further transform the post-transformed viability to 0 and 1 according to the mean SR_binary <- matrix(NA, nrow = nrow(res), ncol = 1)
SR_binary[which (res$SR_trans > mean(res$SR_trans)), ] <- 1
SR_binary[which (res$SR_trans < mean(res$SR_trans)), ] <- 0
# Calculate AUC value roc(SR_binary[, 1], res$GEBV)
Through calculation, the GEBV of 16 flounder families are obtained, and the calculation shows that the AUC (accuracy) between each family's GEBV and the corresponding infection survival rate is 0.794. Each kindred GEBV and infection survival rate are according to Table 8.

候補群における１６の子の代家系を感染生存率に従って６の高生存率家系（平均的な生存率が６２．４％である）と１０の低生存率家系（平均的な生存率が３３．４７％である）の二大類（表８）に分け、高生存率と低生存率家系の成体のＧＥＢＶを比較し、高生存率家系成体の平均的なＧＥＢＶが２．１０で、低生存率家系成体の平均的なＧＥＢＶが１．３４であることがわかる。計算はよると、これらのヒラメ家系のＧＥＢＶ値を使用してはその感染生存率が予測される正確性が０．７９４に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。

The 16 offspring in the candidate group were divided into 6 high-survival families (with an average survival rate of 62.4%) and 10 low-survival families (with an average survival rate of 33.4%) according to infection survival rate. 47%) were divided into two categories (Table 8) and compared the GEBV of adults in high-survival and low-survival families. It can be seen that the average GEBV for an adult family is 1.34. Calculations show that using the GEBV values of these flounder families, the accuracy of predicting their infection survival rate can reach 0.794, which is close to the theoretical value. Therefore, the gene chip designed by the inventors can be effectively applied to the screening of flounder for disease-resistant traits.

上記の結果によると、「魚チップ１号」遺伝子チップを用いてヒラメ候補グループの個体に遺伝子分類を行い、加重ＧＢＬＵＰを用いてゲノム育種値（ＧＥＢＶ）を計算し、ＧＥＢＶ数値の大きさに従ってヒラメ耐病性親魚の選別を行い、これらの親魚を用いて育成される後代種苗の耐感染生存率が顕著に高まり、それにより「魚チップ１号」遺伝子チップがヒラメ耐病性優良品種の育成において普及応用を実行することができることがわかる。 According to the above results, the “Fish Chip No. 1” gene chip was used to perform genetic classification on the individuals of the flounder candidate group, the weighted GBLUP was used to calculate the genomic breeding value (GEBV), and the flounder according to the magnitude of the GEBV number Disease-resistant parent fish are selected, and the infection-resistant survival rate of progeny seedlings grown using these parent fish is remarkably increased. It turns out that it is possible to execute

産業中の実施可能性
本発明より提供されるヒラメ耐病性に関連するＳＮＰローカスの遺伝子チップはヒラメ耐病性個体の選別に利用可能であり、かつ実際の選択正確性は理論値に近いため、ヒラメ耐病性優良品種の選択正確性を高め、育種期間を短縮させ、これにより、ヒラメ耐病性優良品種の選別のための遺伝子チップ技術を提供し、魚類耐病性優良品種の選別のための遺伝子チップ育種の新しい道を切り開くことができる。 Feasibility in Industry The gene chip of the SNP locus associated with flounder disease resistance provided by the present invention can be used for selection of flounder disease-resistant individuals, and the actual selection accuracy is close to the theoretical value. Increase the accuracy of selection of excellent disease-resistant varieties and shorten the breeding period, thereby providing gene chip technology for selecting excellent disease-resistant flounder varieties and gene-chip breeding for selecting excellent disease-resistant fish varieties. can open up new avenues for

Claims

A method for selecting disease-resistant individuals of flounder, wherein the disease related to disease resistance is Edwardziellasis, the method is detected using a gene chip, and the gene chip is a set of SNP locus associated with disease resistance of flounder and the set of SNP loci includes 48697 SNP loci, and the 48697 SNP loci are in 48697 sequences whose sequences are SEQ ID NO: 1-SEQ ID NO: 48697, A method for selecting a disease-resistant individual of Japanese flounder, characterized in that it is the 36th base of each sequence .

The method for selecting disease-resistant flounder individuals according to claim 1 ,
The method includes:
1) A step of extracting individual genomic DNA in the candidate group and performing detection using the gene chip to obtain results of genotyping of SNP markers;
2) From the SNP set of the reference group, the results of genotyping of SNP loci similar to those of the gene chip are extracted, and the results of genotyping of the SNPs of the reference group and genotypes of the candidate group obtained using the chip are obtained. merging the result of the determination;
3) Using the combined genotype and the type represented by the reference group, the weighted GBLUP method is used to estimate the estimated breeding value GEBV of the candidate group, and the disease resistance potential of the tested individuals based on the GEBV value. determining capabilities;
including
and using the genotype of the reference group to estimate the prediction accuracy using the weighted best linear unbiased estimation method,
Among them, a 5-fold cross-validation method is used in estimating prediction accuracy, and the area under the characteristic curve AUC is used as an index for determining prediction accuracy, and the closer the AUC is to 1, the higher the prediction accuracy. A method for selecting disease-resistant flounder individuals.