JP2022518304A - Flounder disease-resistant breeding gene chip and its applications - Google Patents

Flounder disease-resistant breeding gene chip and its applications Download PDF

Info

Publication number
JP2022518304A
JP2022518304A JP2020556756A JP2020556756A JP2022518304A JP 2022518304 A JP2022518304 A JP 2022518304A JP 2020556756 A JP2020556756 A JP 2020556756A JP 2020556756 A JP2020556756 A JP 2020556756A JP 2022518304 A JP2022518304 A JP 2022518304A
Authority
JP
Japan
Prior art keywords
chip
flounder
gene chip
breeding
disease
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2020556756A
Other languages
Japanese (ja)
Other versions
JP7158496B2 (en
Inventor
松林 陳
茜 周
昇 盧
亜東 陳
洋 劉
文騰 徐
仰真 李
磊 王
英明 楊
娜 王
希紅 李
Original Assignee
中国水産科学研究院黄海水産研究所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国水産科学研究院黄海水産研究所 filed Critical 中国水産科学研究院黄海水産研究所
Publication of JP2022518304A publication Critical patent/JP2022518304A/en
Application granted granted Critical
Publication of JP7158496B2 publication Critical patent/JP7158496B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/124Animal traits, i.e. production traits, including athletic performance or the like
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/80Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in fisheries management
    • Y02A40/81Aquaculture, e.g. of fish

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

本発明はヒラメの耐病性優良品種の選別に用いられる遺伝子チップを提供し、魚類優良品種の育成において遺伝子チップが欠乏するという問題を解決し、従来の育種技術の欠点を補完し、魚類の耐病性と高生産性を兼ね備える優れた優良品種育成のために、新たな分子育種方法を提供し、魚類育種技術の世代交代を実現させ、魚類育種産業の高速な発展を推進することを目的とする。本発明が提供するラメ耐病性に関連するSNPローカスの遺伝子チップはヒラメ耐病性個体の選別に利用可能であり、かつ実際の選択正確性は理論値に近いため、ヒラメ耐病性優良品種の選択正確性を高め、育種期間を短縮させ、これにより、ヒラメ耐病性優良品種の選別のために遺伝子チップ技術を提供し、魚類耐病性優良品種の選別のための遺伝子チップ育種の新しい道を切り開くことができる。The present invention provides a gene chip used for selecting excellent disease-resistant varieties of flatfish, solves the problem of lack of gene chips in breeding excellent fish varieties, complements the shortcomings of conventional breeding techniques, and makes fish disease-resistant. The purpose is to provide a new molecular breeding method, realize a generational change in fish breeding technology, and promote the rapid development of the fish breeding industry in order to cultivate excellent excellent varieties that have both sex and high productivity. .. The gene chip of SNP locus related to flounder disease resistance provided by the present invention can be used for selection of flounder disease resistant individuals, and the actual selection accuracy is close to the theoretical value. By increasing sex and shortening the breeding period, it is possible to provide gene chip technology for the selection of excellent flounder disease resistant varieties and open up a new path for gene chip breeding for the selection of excellent fish disease resistant varieties. can.

Description

本発明は水産遺伝育種の技術分野に属し、具体的にヒラメ耐病性優良品種選別に用いられる遺伝子チップの製造方法および応用に関する。 The present invention belongs to the technical field of fishery genetic breeding, and specifically relates to a method and application for producing a gene chip used for selection of excellent flounder disease-resistant varieties.

水産養殖業は中国食品の重要なソースであり、魚類養殖業はまた水産養殖業の基幹産業でもあり、2015年に魚類養殖の生産量は28457万トンで、水産養殖生産量全体の57.6%である。養殖魚類はすでに中国タンパク質の重要な由来となった。 Aquaculture is an important source of Chinese food, and fish farming is also a key industry of fish farming. In 2015, fish farming produced 284.57 million tons, 57.6 of the total fish farming production. %. Farmed fish have already become an important source of Chinese protein.

しかし、魚類養殖業の急速な発展につれて、優良な品種が欠乏し、養殖種類の品質が劣化する。養殖規模が拡大し、集約化水準の向上および養殖環境の悪化は水産養殖病害の頻繁な発生を引き起こし、養殖製品の薬物残留の深刻化等問題も魚類養殖業の持続可能な発展を深刻に制約する。魚類のみにとって、高密度な養殖で形成した免疫抑制のため、養殖魚類の耐病性の低下を引き起こす。魚類の免疫耐病性メカニズムおよび耐病性の分子遺伝に対する研究はいまだに進まないため、分子レベルで魚類病害の予防案を提出しにくい。さらに、耐病性機能遺伝子と耐病性分子マーカーが欠乏し、耐病性優良品種の育成を行いにくく、そのため現在養殖生産は耐病性が低下する野生または人工繁殖した多世代の種苗のみに依存し、流行病が魚類養殖における頻繁な発生を引き起こす。不完全な統計によると、中国における魚類養殖業では毎年病害による直接的経済損失は100億人民元にも達する。病害はすでに中国における魚類養殖業の持続可能な発展を制約するボトルネックとなった。 However, with the rapid development of the fish farming industry, there is a shortage of good varieties and the quality of the farmed varieties deteriorates. The scale of aquaculture has expanded, the level of aquaculture has improved, and the aquaculture environment has deteriorated, causing frequent outbreaks of aquaculture diseases. do. For fish only, the immunosuppression formed by high-density farming causes a decrease in disease resistance of farmed fish. Since research on the immune disease resistance mechanism of fish and the molecular genetics of disease resistance has not progressed yet, it is difficult to submit a preventive plan for fish diseases at the molecular level. In addition, disease-resistant functional genes and disease-resistant molecular markers are deficient, making it difficult to breed excellent disease-resistant varieties, and therefore aquaculture production currently depends only on wild or artificially propagated multi-generational seedlings with reduced disease resistance and is endemic. The disease causes frequent outbreaks in fish farming. According to incomplete statistics, the direct economic loss due to disease in the fish farming industry in China reaches RMB 10 billion every year. Diseases have already become a bottleneck that constrains the sustainable development of fish farming in China.

ヒラメは世界的な海水養殖魚類で、中国における海水養殖の主導的な魚類の一つでもある。但し、ヒラメ養殖業においても病害が頻発し、死亡率が高いという問題が存在する。ヒラメ等養殖魚類を脅かす主要な病害は細菌性疾患・ウイルス性疾患を含む。その中危害が大きい疾患はそれぞれエドワジエラ症・ビブリオ症とリンホシスチス病が挙げられる。抗生物質系薬物またはワクチン等疾患予防措置は一定の効果があるが、水産養殖における病害問題を根本的に解決できない。かつ抗生物質系薬物に魚体内に蓄積しやすく、養殖魚類の商品品質を低下させ、消費者の健康に潜在的な危害を有し、病原菌に薬剤耐性および養殖環境を深刻に汚染するという問題を引き起こしやすいため、水産養殖業における応用はますます制約される。同時に、抗生物質の使用はまた人々日増しに伸びる無残留の無公害水産品の需要を満たすことができない。そのため、魚類耐病性優良品種の選別は中国の水産領域において解決が急がれる重大な課題の一つである。 Flounder is a world-class seawater-cultured fish and is one of the leading fish in seawater-cultured fish in China. However, there is a problem that diseases frequently occur in the flatfish farming industry and the mortality rate is high. Major diseases that threaten farmed fish such as flatfish include bacterial and viral diseases. Among them, the most harmful diseases are edwardsiellosis, vibrio disease and lymphocystis disease, respectively. Although disease preventive measures such as antibiotic drugs or vaccines have certain effects, they cannot fundamentally solve the disease problems in aquaculture. In addition, antibiotic-based drugs tend to accumulate in the body of fish, degrading the quality of farmed fish products, causing potential harm to consumer health, and causing serious drug resistance to pathogens and seriously contaminating the farming environment. Its propensity to cause is increasingly constraining its application in the aquaculture industry. At the same time, the use of antibiotics also fails to meet the ever-growing demand for non-residual, pollution-free marine products. Therefore, selection of excellent fish disease-resistant varieties is one of the urgent issues to be solved in the fishery field of China.

今まで、魚類優良品種の選別は主に表現型性状の選別に基づき、グループ選別・家系選別・交雑選別とBLUP選別等を含み、主に体長・体重等測定しやすい表現型値に基づいて算出される育種値に基づいて選択を行う。分子マーカーができた後、重要な経済性状に関連する分子マーカーを特定することにより経済性状に選択を行い、従来の分子マーカー補助選別に用いられる分子マーカーの数量が非常に限られ、単一遺伝子性状または品質性状の選択効果に優れたが、多重遺伝子が決定される数量性状に対する選択効果がかんばしくない。耐病性性状は複数の遺伝子が制御される数量性状であり、直接測定しにくく、選択正確性が非常に低く、そのため耐病性優良品種に対する選別は長い間伸び悩み、魚類耐病性新品種の育成を制約し、新たしい育種技術でこの課題を解決する必要が急がれる。 Until now, the selection of excellent fish varieties has been calculated mainly based on phenotypic properties, including group selection, family line selection, crossbreeding selection, BLUP selection, etc., and mainly based on phenotypic values that are easy to measure such as body length and weight. Make selections based on the breeding values that are given. After the molecular marker is created, the selection is made economically by identifying the molecular marker related to the important economic property, and the number of molecular markers used in the conventional molecular marker auxiliary selection is very limited, and a single gene is used. The selection effect of the property or quality property is excellent, but the selection effect on the quantity property in which the multiplex gene is determined is not good. Disease resistance is a quantitative property in which multiple genes are controlled, it is difficult to measure directly, and selection accuracy is very low. Therefore, selection for excellent disease resistance varieties has been sluggish for a long time, limiting the breeding of new fish disease resistance varieties. However, there is an urgent need to solve this problem with new breeding technologies.

遺伝子チップは、DNAチップ・DNAマイクロアレイともいい、フォトエッチング技術を用い、シリコンウェハを固相担体とし、選択最適化を経た大量のDNA配列をオリゴヌクレオチドに合成し、特殊な処理を経たスライドにつけ、変性・固定を経た後にDNAマイクロアレイを形成する。核酸分子交雑技術に基づき、遺伝子チップは数万ひいては数十万のDNAフラグメントに並行化交雑と解析を同時に実行でき、高スループット・並行性・高効率・サンプルの数量が少ないという利点を有する。現在、遺伝子チップはすでに人類の疾患・腫瘍の診断と、動植物の遺伝学的解析および遺伝育種に広範に応用される。動物育種において、遺伝子チップはすでに牧畜、特に乳牛・豚等種の優良品種の選別に応用されたことに成功した。例えば、乳牛において、すでにBovine3Kchip・Bovine25K・SNPchip・BovineHD700K・BovinLD7K等多くの遺伝子チップが相次いで開発される。今まで、北米・欧州・オーストラリア等はBovineSNP50Beadchipチップ(54KSNP)を用いてゲノムSNPマーカー分類検測の汎用プラットフォームとして、かつ大規模参考グループが取得したSNP分類結果に基づき、産乳量・繁殖力・耐病性等多種の経済性状のフルゲノム関連解析を行い、乳牛ゲノムの選択体系を構築する。ゲノム選択を通して新生雄牛の初期選択を実現させ、乳牛育種の世代間隔を短縮させ、遺伝的進歩を促進させ、種雄牛の選択効率を大いに向上させ、養殖と育種コストを顕著に節約する。 Gene chips, also called DNA chips and DNA microarrays, use photoetching technology to synthesize a large amount of DNA sequences that have undergone selective optimization into oligonucleotides using a silicon wafer as a solid phase carrier, and attach them to slides that have undergone special treatment. After undergoing denaturation and fixation, a DNA microarray is formed. Based on the nucleic acid molecular crossing technique, the gene chip can simultaneously perform parallel crossing and analysis on tens of thousands or even hundreds of thousands of DNA fragments, and has the advantages of high throughput, concurrency, high efficiency, and a small number of samples. Currently, gene chips are already widely applied to the diagnosis of human diseases and tumors, genetic analysis of animals and plants, and genetic breeding. In animal breeding, gene chips have already been successfully applied to livestock farming, especially to select excellent breeds such as dairy cows and pigs. For example, in dairy cows, many gene chips such as Bovine3Kchip, Bovine25K, SNPchip, BovineHD700K, and BovinLD7K have already been developed one after another. Until now, North America, Europe, Australia, etc. have used the Bovine SNP50Bedchip chip (54KSNP) as a general-purpose platform for genomic SNP marker classification and inspection, and based on the SNP classification results acquired by a large-scale reference group, milk production, fertility, etc. Perform full genome-wide association studies of various economic properties such as disease resistance, and construct a selection system for dairy cow genomes. Through genome selection, early selection of newborn bulls is achieved, intergenerational intervals of dairy cow breeding are shortened, genetic progress is promoted, breeding bull selection efficiency is greatly improved, and aquaculture and breeding costs are significantly saved.

但し、水産養殖動物に、今まで育種用遺伝子チップ、特に耐病性育種遺伝子チップの報道がいまだに見られない。 However, there have been no reports of breeding gene chips, especially disease-resistant breeding gene chips, in aquaculture animals.

本発明はヒラメの耐病性優良品種の選別に用いられる遺伝子チップを提供し、魚類優良品種の育成において遺伝子チップが欠乏するという問題を解決し、従来育種技術の欠点を補完し、魚類の耐病性と高生産性を兼ね備える優れた優良品種育成のために、新たな分子育種方法を提供し、魚類育種技術の世代交代を実現させ、魚類育種産業の高速な発展を推進することを目的とする。 The present invention provides a gene chip used for selecting excellent disease-resistant varieties of flatfish, solves the problem of lack of gene chips in breeding excellent fish varieties, complements the shortcomings of conventional breeding techniques, and provides fish disease resistance. The purpose is to provide a new molecular breeding method, realize a generational change in fish breeding technology, and promote the rapid development of the fish breeding industry in order to cultivate excellent excellent varieties that have both high productivity and high productivity.

本発明はまずヒラメ耐病性に関連するSNPローカスを提供し、前記SNPローカスは配列がSEQNO:1SEQIDNO:48697である中のいずれか一つの配列の第36位の塩基である。 The present invention first provides an SNP locus related to flounder disease resistance, and the SNP locus is the base at position 36 of any one of the sequences SEQNO: 1SEQIDNO: 48697.

本発明のSNPローカスはヒラメ耐病性優良品種の選別に利用できる。 The SNP locus of the present invention can be used for selecting excellent varieties of flounder disease resistance.

本発明が提供されるSNPローカスはヒラメ耐病性優良品種の選別用の検測製品の製造にも用いられることができる。 The SNP locus provided by the present invention can also be used in the production of a inspection product for selecting excellent varieties of flounder disease resistance.

前記検測製品は、好ましくは遺伝子チップである。 The inspection product is preferably a gene chip.

本発明のさらなる方面は、ヒラメ耐病性優良品種の選別に用いられる遺伝子チップを提供し、それはヒラメ耐病性に関連するSNPローカスを検測することができる。 A further aspect of the present invention provides a genetic chip used for selection of flounder disease resistant excellent varieties, which can detect SNP locus associated with flounder disease resistance.

本発明のさらなる方面はヒラメ耐病性個体の選別方法を提供し、この方法は、上記の遺伝子チップを用いて実行することができる。 Further aspects of the present invention provide a method for selecting flounder disease resistant individuals, which can be performed using the gene chips described above.

前記方法は、以下のステップを含む:
1)候補グループにおける個体ゲノムDNAを抽出し、かつ上記の遺伝子チップを利用して検測してSNPマーカーの遺伝子型判定の結果を取得する。
2)参考グループにおけるSNPグループより遺伝子チップと同様であるSNPローカスの遺伝子型判定の結果を抽出し、さらに参考グループのSNPの遺伝子型判定の結果と候補グループからチップを利用して取得した遺伝子型判定の結果とを合併する。
3)合併された遺伝子型と参考グループが表現する型とを利用し、加重GBLUP方法を用いて候補グループの推定育種値(GEBV)を推定し、さらにGEBV値に基づいて被検測個体の耐病性潜在能力を決定する。
The method comprises the following steps:
1) The individual genomic DNA in the candidate group is extracted, and the above gene chip is used for inspection to obtain the result of genotyping of the SNP marker.
2) The result of genotyping of SNP locus similar to the gene chip was extracted from the SNP group in the reference group, and the result of genotyping of SNP in the reference group and the genotype obtained from the candidate group using the chip. Combine with the judgment result.
3) Using the combined genotype and the type expressed by the reference group, the estimated breeding value (GEBV) of the candidate group is estimated using the weighted GBLUP method, and the disease resistance of the test-tested individual is further estimated based on the GEBV value. Determine sexual potential.

参考グループの遺伝子型を利用し、加重最良線形不偏推定量(加重GBLUP)を用いて予測正確性を推定する。そのうち、5倍交差検証方法を予測正確性の判定方法に利用し、特徴曲線下面積(AUC)を予測正確性の判定指標とある。AUCは1に近いほど、予測正確性は高い。 The genotype of the reference group is used to estimate the prediction accuracy using the weighted best linear biased estimator (weighted GBLUP). Among them, the 5-fold cross-validation method is used as the prediction accuracy determination method, and the area under the feature curve (AUC) is used as the prediction accuracy determination index. The closer the AUC is to 1, the higher the prediction accuracy.

本発明が提供されるヒラメ耐病性に関連するSNPローカスの遺伝子チップはヒラメ耐病性個体の選別に利用することが可能であり、かつ実際の選択正確性が理論値に近いため、ヒラメ耐病性優良品種の選択正確性が向上し、育種期間を短縮することができる。これにより、ヒラメ耐病性優良品種の選別のために遺伝子チップ技術を提供し、魚類耐病性に優れた品種を選別するための、遺伝子チップによる育種という新しい道を切り開いた。 The gene chip of SNP locus related to flounder disease resistance provided by the present invention can be used for selection of flounder disease resistant individuals, and the actual selection accuracy is close to the theoretical value, so that the flounder disease resistance is excellent. The accuracy of selecting varieties can be improved and the breeding period can be shortened. This provided gene chip technology for the selection of excellent flounder disease-resistant varieties, and opened up a new path of breeding with gene chips for selecting fish disease-resistant varieties.

本発明はヒラメ耐病性優良品種育成の遺伝子チップの製造と応用方法を確立し、ヒラメ等魚類の耐病性優良品種の育成のために新たな分子育種の技術手段を提供することを目的とする。 An object of the present invention is to establish a method for producing and applying a gene chip for breeding excellent disease-resistant varieties of flatfish, and to provide a new technical means for molecular breeding for breeding excellent disease-resistant varieties of fish such as flatfish.

次に本発明が関する専門用語に対する説明は以下のとおりである:
SNP:SingleNucleotidePolymorphismの略語で、すなわち一塩基多型で、ゲノムレベルにおいてヌクレオチド単体の変異により引き起こされるDNA配列の多型である。
Next, the explanation for the technical terms related to the present invention is as follows:
SNP: An abbreviation for Single Nucleotide Polymorphism, a single nucleotide polymorphism, a polymorphism in a DNA sequence caused by mutation of a single nucleotide at the genomic level.

遺伝子チップ:微細加工技術を通し、数万乃至百万特定のDNA配列フラグメントを、シリコンウエハー・スライド等支持物に規則的に配置して固定し、構成される二次元DNAプローブアレーで、遺伝物質(DNA等)に遺伝子分類および分子検測を行うことができる。 Gene chip: A two-dimensional DNA probe array composed of tens of thousands to millions of specific DNA sequence fragments that are regularly placed and fixed on a support such as a silicon wafer or slide through microfabrication technology. Gene classification and molecular inspection can be performed on (DNA, etc.).

縮退塩基:コドンの縮退性に基づき、一つの記号をよく用いて不特定の二つまたはそれ以上の塩基を代替する。 Nucleic acid notation: Based on the degenerate property of a codon, one symbol is often used to substitute two or more unspecified bases.

例えばRはA/Gを示し、YはC/Tを示し、MはA/Cを示し、KはG/Tを示し、SはG/Cを示し、WはA/Tを示すなど。 For example, R indicates A / G, Y indicates C / T, M indicates A / C, K indicates G / T, S indicates G / C, W indicates A / T, and so on.

参考グループ:ゲノムの選択において、人為感染等検測を通して取得した表現型データを有するグループは、通常表現型性状を有する大型グループから選別されるグループ全体を代表する表現型分布で、かつゲノム再配列を行い、遺伝子型データを取得し、実際のゲノム選択計算を行う個体の集合である。 Reference group: In the selection of the genome, the group having the phenotypic data obtained through the inspection such as anthropogenic infection is the phenotypic distribution representing the whole group selected from the large group having the normal phenotypic properties, and the genome rearrangement. Is a set of individuals who perform the above, acquire genotype data, and perform actual genome selection calculation.

候補グループ:ゲノムの選択において、候補グループとはゲノム再配列を通し、遺伝子型データを取得したが、表現型データがないグループで、該グループは育種の潜在力を有し、後続の実際優良品種育成作業に用いる予定がある個体の集合である。 Candidate group: In the selection of the genome, the candidate group is a group in which genotype data is acquired through genome rearrangement but no phenotypic data, and the group has breeding potential and is a subsequent excellent cultivar. It is a set of individuals that are planned to be used for breeding work.

GBLUP:GenomicBestLinearUnbiasedPredictionの略語で、すなわちゲノム最良線形不偏推定量で、ゲノムにおける高密度な分子マーカーを用いて個体間の成因関係(Gマトリックス)を利用し、ゲノム育種値の推定する方法である。 GBLUP: An abbreviation for GenomicBestLinearUnbiasedPrescription, that is, a method for estimating a genomic breeding value by using a genetic relationship (G matrix) between individuals using a high-density molecular marker in the genome with a genomic best linear biased estimator.

GEBV:GenomicEstimatedBreedingValuesの略語で、すなわちゲノム推定育種価で、フルゲノムにおけるすべてのマーカーまたはハプロタイプの効果推定を加算して取得する。 GEBV: An abbreviation for Genomic Estimated Breeding Values, ie, the genomic estimated breeding value, obtained by adding up the effect estimates of all markers or haplotypes in the full genome.

次に実施例を組み合わせて本発明に詳細な記述を行う。 Next, a detailed description will be given to the present invention by combining examples.

実施例1:「魚チップ1号」遺伝子チップSNPローカスの選別およびチップ製造
1、ヒラメ耐エドワージエラタルダ参考グループの作成および表現型性状の測定
ヒラメゲノムが選定される参考グループと候補グループの個体はいずれも本課題チームが2003年以来作成したヒラメ家系より由来され、多年の育成過程において、次第に韓国・日本および中国のヒラメグループの急速な成長、耐病性や耐逆性等優良な性状より由来される。
Example 1: Selection of "fish chip No. 1" gene chip SNP locus and chip production 1. Creation of a reference group for flounder-resistant Edwardie latalda and measurement of phenotypic properties Individuals in the reference group and candidate group from which the flounder genome is selected Are all derived from the flounder family created by this task team since 2003, and gradually derived from the excellent properties such as rapid growth, disease resistance and reversal resistance of the flounder groups in Korea, Japan and China during the many years of breeding process. Will be done.

特に2013年から、ヒラメ養殖業におけるエドワージエラタルダが日増しに深刻化する情勢に対し、ヒラメ耐エドワージエラタルダ家系選別の研究を実施する。 In particular, from 2013, we will carry out research on the selection of flounder-resistant Edwardie latarda families in response to the increasingly serious situation in the flounder farming industry.

2013-2015年、当年に作成したヒラメ家系に腹腔接種人為感染エドワージエラタルダ検測を連続的に実施し、感染した検測魚苗に鰭棘を収集し、生長と耐病性表現型を測定し、2013年・2014年と2015年にサンプル4577匹・5942匹と6919匹を採取し、ヒラメエドワージエラタルダゲノムが選択される参考グループを選択して作成するために用いられるもの。 In 2013-2015, the flounder family created in the same year was continuously infused with peritoneal infection, and Edwardie eratalda was continuously examined. , 2013, 2014 and 2015, samples of 4757, 5942 and 6919 were collected and used to select and create a reference group from which the flounder Edwardia talda genome is selected.

感染検測のサンプルより、96家系(2013年に32、2014年に10、2015年に48)を選定し、各家系は死亡率に従って等比率の死亡と生存した個体10-15を選定し、ゲノムが選択される参考グループを組成し、選定した個体の感染検測の結果(死亡または生存)を参考グループの表現型性状(表2)とする。

Figure 2022518304000001
From the infection detection sample, 96 families (32 in 2013, 10 in 2014, 48 in 2015) were selected, and each family selected equal proportions of mortality and surviving individuals 10-15 according to the mortality rate. , A reference group in which the genome is selected is formed, and the result of infection detection (death or survival) of the selected individual is used as the phenotypic property of the reference group (Table 2).
Figure 2022518304000001

2、ヒラメフルゲノム再配列およびSNPローカス評価
ヒラメ参考グループはDNA抽出・検測を経た後、利用可能な個体を計931有する(表3)。

Figure 2022518304000002
2. Flounder full genome rearrangement and SNP locus evaluation The flounder reference group has a total of 931 individuals that can be used after DNA extraction and inspection (Table 3).
Figure 2022518304000002

931の参考グループの個体のゲノムDNAを抽出し、DNA検測が合格した後、次世代配列ライブラリを作成し、ライブラリ作成類型は両端DNAライブラリ(挿入フラグメント350bp)で、IlluminaHiseqX10配列プラットフォームを用いて配列とデータエクスポートを完成し、品質管理で取得された平均的なデータ量は2G/個体である。本課題チームが提供されるヒラメゲノム配列(GenBankID:PRJNA73673)を参考ゲノムとして、BWA(http://bio-bwa.sourceforge.net/)ソフトウェアを用いて配列比較を行い、その後Samtools(http://www.htslib.org/)ソフトウェアを用いてSNP予測と評価を行い、42.2MのSNP集合を取得する。 After extracting the genomic DNA of the individual of the reference group of 931 and passing the DNA inspection, the next-generation sequence library is created, and the library creation type is the double-ended DNA library (insertion fragment 350 bp), which is sequenced using the IlluminaHiseqX10 sequence platform. And the data export is completed, and the average amount of data acquired by quality control is 2G / individual. Using the Hirame genome sequence (GenBankID: PRJNA73673) provided by this task team as a reference genome, sequence comparison was performed using BWA (http://bio-bwa.sourceforge.net/) software, and then Samtools (http: /). /Www.https.org/) SNP prediction and evaluation are performed using software, and an SNP set of 42.2M is obtained.

3、SNPローカス評価と選別
(1)ステップ2で取得した42.2MのヒラメSNPマーカーに対し、以下の規格に選別を行う:
見逃し率>0.1、最小対立遺伝子頻度(MAF)<0.05のサイトを削除し、繰り返し配列または反復配列におけるサイトを削除し、ハドウィンバランスに適合しないサイトを削除し、3.4MのヒラメSNPマーカーを取得する。
3. SNP locus evaluation and selection (1) The 42.2M flatfish SNP marker acquired in step 2 is selected according to the following standards:
Sites with miss rate> 0.1, minimum allele frequency (MAF) <0.05 were removed, sites in repeats or repeats were removed, sites that did not fit the hadwin balance were removed, and 3.4 M. Obtain a flatfish SNP marker.

(2)ステップ(1)で選別した3.4M分子マーカーに対し、SNPローカス効果値解析と推定育種値計算を実施し、
ヒラメゲノム選択計算はBayesCπアルゴリズムを用い、解析モデル等式:


Figure 2022518304000003
モデルにおいて、yは表現型値で、uはグループ平均値で、グループqiはマーカー効果が正規分布qi~N(0、
Figure 2022518304000004
)で、mはマーカーの総数で、Xはqiに対応する関連マトリックスで、eは残差である。 (2) For the 3.4M molecular markers selected in step (1), SNP locus effect value analysis and estimated breeding value calculation were performed.
The flatfish genome selection calculation uses the BayesCπ algorithm, and the analysis model equation:


Figure 2022518304000003
In the model, y is a phenotypic value, u is a group mean value, and group qi has a normal distribution of marker effects qi to N (0,
Figure 2022518304000004
), M is the total number of markers, X is the related matrix corresponding to qi, and e is the residual.

R言語パックが提供されるBayesCπアルゴリズムを用い、組み合わせ済みの遺伝子型データgenotype.csvと表現型データphonetype.csvを組み合わせ、フルゲノム再配列の参考グループの計931のヒラメ個体にゲノム選択計算を行う。その後取得したSNPローカス効果値を最大から最小への並べ替え、推定育種値<10-5のサイトを削除し、計864229のSNPローカスを取得して遺伝子チップSNPローカスの選定に用いられる。 Combined genotype data genotype. Using the BayesCπ algorithm provided by the R language pack. csv and phenotypic data phonetype. Combining csv, genome selection calculation is performed on a total of 931 flatfish individuals in the reference group for full genome rearrangement. After that, the acquired SNP locus effect values are rearranged from the maximum to the minimum, sites with estimated breeding values <10-5 are deleted, and a total of 864229 SNP locus are acquired and used for selection of gene chip SNP locus.

(3)さらにAffymetrixAxiom遺伝子分類プローブ設計生物解析プロセスを用いてステップ(2)で選別したSNPにそれぞれプローブ設計と評価を行い、プローブ変換可能性評価点数<0.6のサイトを削除する。また、SNPがゲノム全体をカバーしかつ均等に分布され、SNPのフランキング配列35bp内に他のSNPが存在せず、SNPのフランキング配列35bpのGC含有量が30-70%で、最終的に48697のヒラメSNPマーカーを選別してチップ製造に用いられることを保証し、前記48697のSNP分子マーカーの配列記録は配列リストにある。
米国ThermoFisher社製AffymetrixAxiomチップの生産技術を用いてヒラメSNPチップ(遺伝子チップ)を製造し、延べ48697のヒラメSNPローカスを含み、各チップは24のサンプルを同時に検測することができる。
(3) Further, probe design and evaluation are performed on each SNP selected in step (2) using the AffymetrixAxiom gene classification probe design biological analysis process, and sites having a probe conversion possibility evaluation score <0.6 are deleted. In addition, the SNP covers the entire genome and is evenly distributed, there are no other SNPs in the SNP flanking sequence 35bp, and the GC content of the SNP flanking sequence 35bp is 30-70%, which is the final. The 48697 SNP molecular markers are selected and guaranteed to be used in chip production, and the sequence record of the 48697 SNP molecular markers is in the sequence list.
A flatfish SNP chip (gene chip) is produced using the production technology of the AffymetrixAxiom chip manufactured by Thermo Fisher, USA, and contains a total of 48697 flatfish SNP locus, and each chip can simultaneously inspect 24 samples.

実施例2、「魚チップ1号」遺伝子チップの使用方法
1、チップ検測用サンプルの製造と交雑
少量のヒラメ鰭棘(米の粒ほどの大きさ)を採取し、DNA抽出キット(中国・天根)を利用して鰭棘ゲノムDNAを抽出し、1%のアガロースゲル電気泳動と核酸分光光度計を利用してDNA品質と濃度を検測し、最終的な合格サンプルの規格は以下のとおりである:電気泳動はDNAに単一のストリップが生じることを求め、フラグメント長さが10kbより大きく、完全性に優れ、生分解は生じず、サンプル品質のDNA検測結果。紫外分光光度計を用いてA260/280を検測する:1.8-2.0、A260/230>1.5、濃度が20ng/μlより低くなく、総量が4μgより小さくない。その後米国ThermoFisher社製SNPチップ検測サンプルの製造標準操作プロセスに従って(https://www.thermofisher.com/)チップ検測用サンプルの製造を行う。1.4ug以上の高品質なDNAテンプレートを2ml*96の深穴板に加え、変性剤を加えて常温変性を行い、変性10min後に変性を停止し、単鎖DNAを取得する。チップサイト増幅用48697ペアプライマーと等温増幅酵素・dNTP等を深穴板に加え、深穴板を封止し、37℃で等温増幅を行う。24h増幅した後、増幅産物を断片化し、等体積のイソプロパノールを加え、-20℃の冷蔵庫において沈殿する。24h沈殿した後、4℃、3000gを用いて遠心してDNA産物の沈殿物を取得し、37℃で残りのイソプロパノールを除去し、沈殿物を溶解させ、交雑液を取得し、交雑液は5%のゲル電気泳動の検測結果の質量を用い、増幅産物品質の検測結果は、ストリップが明晰で、輝度が高い。交雑液は温度制御増幅装置を用いて交雑を行い、条件が95℃10min、48℃3minで、その後交雑液を持続的に48℃に維持する。チップブロックを交雑液に浸漬し、48℃の交雑炉において24h交雑させ、その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各信号点は一つのプローブ交雑の結果で、各サイトの交雑結果を取得した後、チップスキャンの結果はAxiomAnalysisSuite(AxAS)ソフトウェア(米国ThermoFisher社製)を用いて解析を行う。
Example 2, How to use the "fish chip No. 1" gene chip 1. Production and crossing of a sample for chip inspection A small amount of fin spines (about the size of a grain of rice) was collected and a DNA extraction kit (China, China). The fin spine genomic DNA was extracted using Amane), and the DNA quality and concentration were measured using 1% agarose gel electrophoresis and a nucleic acid spectrophotometer. As is: Electrophoresis requires that a single strip be formed in the DNA, the fragment length is greater than 10 kb, the completeness is excellent, no biodegradation occurs, and the sample quality DNA test results. A260 / 280 is measured using an ultraviolet spectrophotometer: 1.8-2.0, A260 / 230> 1.5, the concentration is not less than 20 ng / μl and the total amount is not less than 4 μg. After that, a sample for chip inspection is manufactured according to a standard operation process for manufacturing an SNP chip inspection sample manufactured by Thermo Fisher, USA (https://www.thermovisher.com/). A high-quality DNA template of 1.4 ug or more is added to a 2 ml * 96 deep hole plate, denaturation is performed by adding a denaturant, and denaturation is stopped after 10 minutes of denaturation to obtain single-stranded DNA. A 48697 pair primer for chip site amplification and an isothermal amplification enzyme, dNTP, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37 ° C. After amplification for 24 hours, the amplified product is fragmented, equal volumes of isopropanol are added and precipitated in a refrigerator at −20 ° C. After precipitating for 24 hours, centrifuge at 4 ° C. and 3000 g to obtain a precipitate of DNA product, remove the remaining isopropanol at 37 ° C., dissolve the precipitate, obtain a hybrid solution, and the hybrid solution is 5%. Using the mass of the gel electrophoresis test result, the amplification product quality test result shows that the strip is clear and the brightness is high. The crossed liquid is crossed using a temperature control amplification device, and the conditions are 95 ° C. for 10 min and 48 ° C. for 3 min, and then the crossed liquid is continuously maintained at 48 ° C. By immersing the chip block in a crossing solution and crossing it in a crossing furnace at 48 ° C. for 24 hours, then elution, connecting the fluorescent protein, fixing the fluorescent protein, and scanning the crossing probe and fluorescent signal, each signal point becomes one. After acquiring the crossing results of each site from the results of the two probe crosses, the results of the chip scan are analyzed using the AxiomAnalysisSuit (AxAS) software (manufactured by Thermo Fisher, USA).

2、チップ検測と遺伝子分類データ解析
(1)被検測グループサンプルの採取とDNA抽出
再配列済みの一部ヒラメ個体DNAを選択し、前記チップを用いて検測を実施し、チップ遺伝子分類の正確性および再配列と遺伝子チップ分類を用いて取得した遺伝子型データにゲノム選択計算を実施する再現性に検測を行う。その後ヒラメ育種の選択過程に、家系が用いられる候補成体の個体を確立し、ゲノムDNAの抽出を行いかつチップで検測を実施し、遺伝子チップがヒラメゲノム選択育種における応用効果を検証する。採用個体の情報は表4による。

Figure 2022518304000005
Figure 2022518304000006
Figure 2022518304000007
2. Chip inspection and gene classification data analysis (1) Collection of test group sample and DNA extraction Select some rearranged individual fluffy DNA, perform inspection using the chip, and classify the chip gene. The accuracy and reproducibility of performing genome selection calculations on the genotype data obtained using the rearrangement and gene chip classification will be checked. After that, in the selection process of flatfish breeding, an individual of a candidate adult whose family line is used is established, genomic DNA is extracted and inspection is performed with a chip, and the applied effect of the gene chip in flatfish genome selective breeding is verified. Information on the adopted individuals is shown in Table 4.
Figure 2022518304000005
Figure 2022518304000006
Figure 2022518304000007

(2)チップ検測
遺伝子チップ検測の標準プロセスに従い、AffymetrixGeneTitan遺伝子チップ処理システムを用いてプローブ交雑・染色とチップスキャンを完成する。具体的な操作方法は以下のとおりである:4μgの高品質なDNAテンプレートを2ml*96の深穴板に加え、変性剤を加えて変性(28℃)を行い、変性10min後に変性停止液(反応時間が10minより長くない)変性を停止し、単鎖DNAを取得する。チップサイト増幅用48697ペアプライマーと等温増幅酵素・dNTPと反応液等を深穴板に加え、深穴板を封止し、37℃で等温増幅を22-36h行う。好ましくは24h増幅した後、高温65℃で20-30minで反応液を不活性化させ、その後37℃の培養箱に移転して40min培養し、断片化酵素と反応液[41}を加え、断片化増幅産物を断片化し、既存の反応液と等体積のイソプロパノールを加え、反応液が澄むまで反応液を均一に混合させ、その後-20℃の冷蔵庫において産物を沈殿させる。24h沈殿した後、4℃、3、000gで40-60min遠心してDNA産物の沈殿物を取得し、上澄液を除去し、沈殿物を保留し、37℃で残りのイソプロパノールを完全に除去し、沈殿物を溶解させ、交雑液を取得する。交雑液は温度制御増幅装置を用いて交雑を行い、条件が95℃10min、48℃3minで、その後交雑液を持続的に48℃に維持する。チップブロックを交雑液に浸漬し、48℃の交雑炉において24h交雑させる。その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各サイトの交雑結果を取得し、チップスキャン結果はAxiomAnalysisSuite(AxAS)ソフトウェア(米国ThermoFisher社製)を用いて解析を行う。
(2) Chip inspection According to the standard process of gene chip inspection, probe crossing / staining and chip scanning are completed using the Affymetrix GeneTitan gene chip processing system. The specific operation method is as follows: Add 4 μg of high-quality DNA template to a 2 ml * 96 deep hole plate, add a denaturing agent to perform denaturation (28 ° C.), and after 10 minutes of denaturation, a denaturation stop solution ( (Reaction time is not longer than 10 min) Stop denaturation and obtain single-stranded DNA. A 48697 pair primer for chip site amplification, an isothermal amplification enzyme, dNTP, a reaction solution, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37 ° C. for 22-36 hours. Preferably, after amplification for 24 hours, the reaction solution is inactivated at a high temperature of 65 ° C. for 20-30 min, then transferred to a culture box at 37 ° C. and cultured for 40 minutes, and the fragmentation enzyme and the reaction solution [41} are added to the fragment. The chemical amplification product is fragmented, the same volume of isopropanol as the existing reaction solution is added, the reaction solution is uniformly mixed until the reaction solution is clear, and then the product is precipitated in a refrigerator at −20 ° C. After precipitation for 24 hours, centrifuge at 4 ° C. at 3,000 g for 40-60 min to obtain a DNA product precipitate, remove the supernatant, retain the precipitate, and completely remove the remaining isopropanol at 37 ° C. , Dissolve the precipitate and obtain a hybrid. The crossed liquid is crossed using a temperature control amplification device, and the conditions are 95 ° C. for 10 min and 48 ° C. for 3 min, and then the crossed liquid is continuously maintained at 48 ° C. The chip block is immersed in a crossing solution and crossed in a crossing furnace at 48 ° C. for 24 hours. After that, by elution, connecting the fluorescent protein, fixing the fluorescent protein, scanning the crossing probe and the fluorescent signal, the crossing result of each site is obtained, and the chip scan result is the Axiom Analysis Suite (AxAS) software (manufactured by Thermo Fisher, USA). Is used for analysis.

(3)データ解析
AxASソフトウェア(米国ThermoFisher社製)を利用してチップスキャンの結果を解析し、各サンプルの遺伝子分類結果を取得する。解析結果によると、チップの平均的な分類率が98.77%で、分類効果が抜群である。そのうち、高品質なSNP比率が74.61%で、各サンプルはいずれも高品質な分類情報を生成することができる。
(3) Data analysis The result of chip scan is analyzed using AxAS software (manufactured by Thermo Fisher, USA), and the gene classification result of each sample is obtained. According to the analysis results, the average classification rate of chips is 98.77%, and the classification effect is outstanding. Among them, the high-quality SNP ratio is 74.61%, and each sample can generate high-quality classification information.

実施例3「魚チップ1号」遺伝子チップがヒラメ耐病性育種における応用
1、「魚チップ1号」遺伝子チップ分類の効果検証
参考グループから一部の個体を選定して遺伝子チップ分類の信頼性を検証するために用いられるもの、これらの選定される個体は再配列の遺伝子型でもあり、「魚チップ1号」遺伝子チップ分類を用いて取得される遺伝子型でもある。発明者の既存のヒラメ参考グループから一部の個体遺伝子チップを選定して分類・統計を行う。チップを用いて取得した遺伝子型(0/1/2はAA/Aa/aaを示す)と再配列して取得した遺伝子型の一致性および再配列とチップデータを利用して推定したGEBVの関連係数を統計することによりチップ分類の効果を評価する。分類結果の一致性が88%以上に達しかつGEBVの間の関連係数が0.9以上に達する場合、チップが優れた分類結果を有すると見なす。
Example 3 Application of "Fish Chip No. 1" Gene Chip to Disease-Resistant Breeding 1. Verification of Effect of "Fish Chip No. 1" Gene Chip Classification Select some individuals from the reference group to improve the reliability of gene chip classification. Used for validation, these selected individuals are also genotypes of rearrangement and genotypes obtained using the "Fish Chip No. 1" gene chip classification. Some individual gene chips are selected from the inventor's existing flounder reference group for classification and statistics. Consistency between genotypes obtained using chips (0/1/2 indicates AA / Aa / aa) and genotypes obtained by rearrangement, and the relationship between rearrangements and GEBV estimated using chip data Evaluate the effect of chip classification by statistic of coefficients. If the consistency of the classification results reaches 88% or higher and the correlation coefficient between GEBV reaches 0.9 or higher, the chip is considered to have excellent classification results.

解析結果によると、「魚チップ1号」遺伝子チップ分類を利用して取得したサイト情報の90.08%は再配列と同様で、2組のGEBVの間の関連係数は0.958である。そのため、本発明が開発されるヒラメ遺伝子チップの分類結果は再配列と基本的に一致し、ヒラメに正確な遺伝子分類を行うことができる。 According to the analysis results, 90.08% of the site information obtained using the "Fish Chip No. 1" gene chip classification is similar to the rearrangement, and the correlation coefficient between the two sets of GEBV is 0.958. Therefore, the classification result of the flounder gene chip for which the present invention is developed basically matches the rearrangement, and accurate gene classification can be performed on the flounder.

具体的な操作方法は以下のとおりである:
PLINKソフトウェアを利用してチップデータを読み取り、サーバーにおいて以下のコマンドを入力して上記データに処理を行う:
plink --vcf op2-1.vcf --make-bed --out op_Val_1
plink --vcf cs2-2.vcf --make-bed --out op_Val_2
plink --vcf op2-3.vcf --make-bed --out op_Val_3
plink --vcf op2-4.vcf --make-bed --out op_Val_4
The specific operation method is as follows:
Read the chip data using the LINK software and enter the following command on the server to process the above data:
pink --vcf op2-1. vcf --make-bed --out op_Val_1
pink --vcf cs2-2. vcf --make-bed --out op_Val_2
pink --vcf op2-3. vcf --make-bed --out op_Val_3
pink --vcf op2-4. vcf --make-bed --out op_Val_4

読み取りを経て、4つのvcfにおける情報は表5による:

Figure 2022518304000008
After reading, the information in the four vcfs is according to Table 5.
Figure 2022518304000008

a)RにおいてSNPを再命名しかつ4つのファイルにおける共通となるマーカー情報を抽出し、コマンドは以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
#cs_Val_1とcs_Val_2のサイト情報を読み取る
val_1 <- fread(“op_Val_1.bim”, header = F)
val_2 <- fread(“op_Val_2.bim”, header = F)
val_3 <- fread(“op_Val_3.bim”, header = F)
val_4 <- fread(“op_Val_4.bim”, header = F)
#SNP命名方式を統一しかつ再命名したファイルを出力する
val_1$V2 <- paste(paste(rep(“rs”, nrow(val_1)), val_1$V1, sep =“ ”), val_1$V4, sep = “:”)
val_2$V2 <- paste(paste(rep(“rs”, nrow(val_2)), val_2$V1, sep = “ ”), val_2$V4, sep = “:”)
val_3$V2 <- paste(paste(rep(“rs”, nrow(val_3)), val_3$V1, sep =“ ”), val_3$V4, sep = “:”)
val_4$V2 <- paste(paste(rep(“rs”, nrow(val_4)), val_4$V1, sep =“ ”), val_4$V4, sep = “:”)
write.table(val_1, “op_Val_1.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(val_2, “op_Val_2.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(val_3, “op_Val_3.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(val_4, “op_Val_4.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
#共通となるサイト情報を抽出しかつ出力する
comm <- Reduce(intersect, list (a = val_1$V2, b = val_2$V2, c = val_3$V2, d = val_4$V2))
write.table(comm, “common_snps.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
a) Rename the SNP in R and extract the common marker information in the four files, the command is:
# Unload the required R package library (data.table)
# Read the site information of cs_Val_1 and cs_Val_1 val_1 <-fread (“op_Val_1.bim”, header = F)
val_1 <-fread ("op_Val_2.bim", header = F)
val_3 <-fread (“op_Val_3.bim”, header = F)
val_4 <-fread ("op_Val_4.bim", header = F)
# Outputs a file with a unified SNP naming method and renamed. sep = “:”)
val_2 $ V2 <-paste (paste (rep ("rs", now (val_2)), val_2 $ V1, sep = ""), val_2 $ V4, sep = ":")
val_3 $ V2 <-paste (paste (rep ("rs", now (val_3)), val_3 $ V1, sep = ""), val_3 $ V4, sep = ":")
val_4 $ V2 <-paste (paste (rep ("rs", now (val_4)), val_4 $ V1, sep = ""), val_4 $ V4, sep = ":")
write. table (val_11, "op_Val_1.bim", sep = "\ t", col.names = F, low.names = F, quote = F)
write. table (val_2, "op_Val_2.bim", sep = "\ t", col.names = F, low.names = F, quote = F)
write. table (val_3, "op_Val_3.bim", sep = "\ t", col.names = F, low.names = F, quote = F)
write. table (val_4, "op_Val_4.bim", sep = "¥ t", col.names = F, low.names = F, quote = F)
# Extract and output common site information com <-Reduction (intersect, list (a = val_1 $ V2, b = val_2 $ V2, c = val_3 $ V2, d = val_4 $ V2))
write. table (com, “common_sns.txt”, sep = “¥ t”, coll.names = F, row.names = F, quote = F)

b)PLINKソフトウェアを利用して4つのファイルを組み合わせ、かつ共通となるマーカーを保留し、コマンドは以下のとおりである:
plink --bfile op_Val_1 --merge-list merge_op.txt --extract common_snps.txt --recode A --out op_chip
ファイル「merge_op.txt」に以下の情報が貯蔵されている:
op_Val_2.bed op_Val_2.bim op_Val_2.fam
op_Val_3.bed op_Val_3.bim op_Val_3.fam
op_Val_4.bed op_Val_4.bim op_Val_4.fam
b) Combine the four files using the PLLINK software and reserve the common marker, the command is:
pink --bfile op_Val_1 --merge-list merge_op. pxt --- extract common_snps. pxt --recode A --out op_chip
The following information is stored in the file "merge_op.txt":
op_Val_2. bed op_Val_2. Bim op_Val_2. fam
op_Val_3. bed op_Val_3. Bim op_Val_3. fam
op_Val_4. bed op_Val_4. Bim op_Val_4. fam

c)PLINKソフトウェアを利用して参考グループから同様である個体とサイトを抽出し、上記の4つのファイルにおける.fam情報をファイルに整理しかつ「op_chip_indi.txt」と命名し、「…」はファイルカタログを表し、コマンドは以下のとおりである:
plink --bfile …/Val_ref --keep op_chip_indi.txt --extract common_snps.txt --recode A --out op_rseq
処理を経て、上記4つのファイルが共通となるマーカー数が11、719で、参考グループにおいて検索可能な個体数が95である。
c) Extract similar individuals and sites from the reference group using the PLLINK software, and in the above four files. Organize the fam information into a file and name it "op_chip_indi.txt", where "..." represents the file catalog and the command is:
pink --- bfile ... / Val_ref --- keep op_chip_indi. pxt --- extract common_snps. pxt --recode A --out op_rseq
After the processing, the number of markers common to the above four files is 11,719, and the number of individuals that can be searched in the reference group is 95.

d)R統計チップと再配列分類の一致性を利用し、その方法は以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
#それぞれチップと再配列が取得した分類情報を読み取る
chip <- fread(“op_chip.raw”)
rseq <- fread(“op_rseq.raw”)
#ファイルにおける個体配列を統一する
fid <- data.frame(rseq$FID)
colnames(fid) <- “FID”
chip <- data.table(merge(fid, chip, sort = F))
#ファイルにおける最初の6列を削除し、かつ遺伝子型を出力する
chip[, c(1: 6):= NULL]
rseq[, c(1: 6):= NULL]
fwrite(chip, “geno_op_chip.csv”, sep = “ ,”, row.names = F, quote = F)
fwrite(rseq, “geno_op_rseq.csv”, sep = “ ,”, row.names = F, quote = F)
#一致性を統計する
sum(chip == rseq) / (nrow(chip) * ncol(chip)) * 100
統計によると、上記95の個体は計1、113、305のマーカーを有し、完全に同様であるマーカー数が1、002、829であることがわかった。そのため、チップと再配列分類の結果は90.08%完全に一致する。
d) Utilizing the match between the R statistical chip and the rearrangement classification, the method is as follows:
# Unload the required R package library (data.table)
# Chip <-fread (“op_chip.raw”) that reads the classification information acquired by the chip and rearrangement, respectively.
rseq <-fread ("op_rseq.raw")
# Unify individual sequences in files fid <-data. frame (rseq $ FID)
colnames (fid) <-"FID"
chip <-data. table (merge (fid, chip, sort = F))
# Delete the first 6 columns in the file and output the genotype chip [, c (1: 6): = NULL]
rseq [, c (1: 6): = NULL]
file (chip, “geno_op_chip.csv”, sep = “,”, low.names = F, quote = F)
fwrite (rseq, “geno_op_rseq.csv”, sep = “,”, low.names = F, quote = F)
# Statistical consistency sum (chip == rseq) / (nlow (chip) * ncol (chip)) * 100
According to statistics, it was found that the 95 individuals had a total of 1,113,305 markers, and the number of markers that were completely similar was 1,002,829. Therefore, the results of chip and rearrangement classification are 90.08% in perfect agreement.

e)GEBV推定の正確性を保証するために、PLINKソフトウェアを利用して参考グループにおける残りの個体の遺伝子型を抽出し、コマンドは以下のとおりである:
plink --bfile …/Val_ref --remove op_chip_indi.txt --extract common_snps.txt --recode A --out ref
e) To ensure the accuracy of the GEBV estimation, genotypes of the remaining individuals in the reference group were extracted using the PLLINK software, and the commands are:
pink --bfile… / Val_ref --remove op_chip_indi. pxt --- extract common_snps. pxt --recode A --out ref

f)Rを利用して参考グループを合併するおよび個体の遺伝子型を検証し、その方法は以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
#それぞれチップと再配列が取得した分類情報を読み取る
chip <- as.matrix(fread(“geno_op_chip.csv”))
rseq <- as.matrix(fread(“geno_op_rseq.csv”))
ref <- fread(“ref.raw”)
#ファイルにおける最初の6列を削除する
ref[, c(1: 6):= NULL]
ref <- as.matrix(ref)
#遺伝子型ファイルを組み合わせかつ組み合わせた後の遺伝子型ファイルを出力する
geno_chip <- rbind(chip, ref)
geno_rseq <- rbind(rseq, ref)
write.table(geno_chip, “geno_Val_Chip.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(geno_rseq, “geno_Val_Rseq.csv”, sep = “ ,”, row.names = F, quote = F)
f) Use R to merge reference groups and verify genotypes of individuals, the method of which is:
# Unload the required R package library (data.table)
# Read the classification information acquired by the chip and rearrangement, respectively. Chip <-as. matrix (fread (“geno_op_chip.csv”))
rseq <-as. matrix (fread (“geno_op_rseq.csv”))
ref <-fread (“ref.raw”)
# Delete the first 6 columns in the file ref [, c (1: 6): = NULL]
ref <-as. matrix (ref)
# Combine genotype files and output the genotype file after the combination geno_chip <-rbind (chip, ref)
geno_rseq <-rbind (rseq, ref)
write. table (geno_chip, “geno_Val_Chip.csv”, sep = “,”, low.names = F, quote = F)
write. table (geno_rseq, “geno_Val_Rseq.csv”, sep = “,”, low.names = F, quote = F)

g)g)において取得した2つのxxx.csvファイルがRにおいて加重GBLUP方法を用いてGEBVを推定する。具体的な操作方法は以下のとおりである(Linux環境):
#必要なRパッケージと関数をアンロードする
library(parallel)
library(data.table)
library(asreml)
library(pROC)
source(“ginv.R”)
関数ginvの定義は以下のとおりである:
ginv <- function(invG) {
Ginv <- data.frame(row = rep(1: nrow(invG), nrow(invG)), column = rep(1: nrow(invG), each = nrow(invG)), value = as.numeric(invG), lower.mat = as.logical(lower.tri(invG, diag = T)))
Ginv <- Ginv[Ginv$lower.mat == T, c(“row”, “column”, “value”)]
Ginv <- Ginv[order(Ginv$row, Ginv$column), ]
return(Ginv)

#遺伝子型、表現型情報を読み取る
geno_chip <- as.matrix(fread(“geno_Val_Chip.csv”, nThread = 10))
geno_rseq <- as.matrix(fread(“geno_Val_Rseq.csv”, nThread = 10))
pheno <- asreml.read.table(“pheno_op_Val.csv”, header=T, sep=“ ,”)
rnames <- as.matrix(fread(“pheno_op_Val.csv”))[, 1]
M_1 <- geno_chip
M_2 <- geno_rseq
#各サイトの二次等位遺伝子頻度を計算する
pi_1 <- round(colSums(M_1) / (2 * nrow(M_1)), 3)
pi_2 <- round(colSums(M_2) / (2 * nrow(M_2)), 3)
#Pマトリックスを構築する
P_1 <- matrix(2 * pi_1, byrow = T, nrow = nrow(M_1), ncol = ncol(M_1))
P_2 <- matrix(2 * pi_2, byrow = T, nrow = nrow(M_2), ncol = ncol(M_2))
#Zマトリックスを構築する
Z_1 <- as.matrix(M_1 - P_1)
Z_2 <- as.matrix(M_2 - P_2)
#等式分子の項を構築する
ZZt_1 <- do.call(‘rbind’, mclapply(1: nrow(Z_1), FUN = function(x) {tcrossprod(Z_1[x, ], Zt_1)}, mc.cores = 20))
ZZt_2 <- do.call(‘rbind’, mclapply(1: nrow(Z_2), FUN = function(x) {tcrossprod(Z_2[x, ], Zt_2)}, mc.cores = 20))
#等式分母の項を構築する
denom_1 <- 2 * (sum(pi_1 * (1 - pi_1)))
denom_2 <- 2 * (sum(pi_2 * (1 - pi_2)))
#Gマトリックスを構築する
G_ chip <- ZZt_1 / denom_1
G_ rseq <- ZZt_2 / denom_2
diag(G_chip) <- diag(G_chip) + 0.01
diag(G_rseq) <- diag(G_rseq) + 0.01
#Gマトリックスインバージョン
invG_chip0 <- solve(G_chip)
invG_rseq0 <- solve(G_rseq)
#GマトリックスインバージョンをASRemlが利用可能な三列の形式に変換し、かつ出力する
Ginv_chip <- ginv(invG_chip)
Ginv_rseq <- ginv(invG_rseq)
write.table(Ginv_chip, “Ginv_op_Val_Chip_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(Ginv_rseq, “Ginv_op_Val_Rseq_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
#GEBVを計算する
attr(Ginv_chip, “rowNames”) <- paste(rnames)
attr(Ginv_rseq, “rowNames”) <- paste(rnames)
WGBLUP1 <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv_chip),
rcov = ~units,
family = asreml.binomial(link = “logit”)
data = pheno,
maxiter = 100,
na.method.X = ‘omit’)
gebv01 <- coef(WGBLUP1)$random
write.table(gebv01, “GEBVs_at_iter_0_chip.csv”, sep = “ ,”, row.names = F, quote = F)

WGBLUP2 <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv_rseq),
rcov = ~units,
family = asreml.binomial(link = “logit”),
data = pheno,
maxiter = 100,
na.method.X = ‘omit’)
gebv02 <- coef(WGBLUP2)$random
write.table(gebv02, “GEBVs_at_iter_0_rseq.csv”, sep = “ ,”, row.names = F, quote = F)

#GBLUPプロセス・チップ部分を加重し、forサイクルを用い、6回の加重反復過程を実行する
#マーカーの効果値を推定する
Zt1 <- t(Z_1)
p1 <- do.call(‘rbind’, mclapply(1: nrow(Zt1), FUN = function(x) {Zt1[x, ] %*% invG01}, mc.cores = 5))
p2 <- p1 %*% gebv01
u01 <- p2 / pp_1
Varu01 <- u01 * u01 * 2 * pi_1 * (1 - pi_1)
write.table(u01, “ markerEff_chip_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(Varu01, “markerVar_chip_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
#各マーカーの重みを推定する
D <- rep(1, ncol(M_1))
D0 <- rep(1, ncol(M_1))
pre_D <- D0
u <- u01
cal <- 6
inter <- 1
for(t in 1: cal) {
d <- as.vector(u * u * 2 * pi_1 * (1 - pi_1))
for(i in 1: (length(d) / inter)) {
di <- mean(d[(inter * (i - 1) + 1): (inter * i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di

D <- sum(D0) / sum(pre_D) * pre_D
write.table(D, file = paste(“weights_chip_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
#加重Gマトリックスを推定する
p1 <- do.call(‘cbind’, mclapply(1: ncol(Z_1), FUN = function(x) {Z_1[, x] * D[x]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt1}, mc.cores = 5))
G <- p2 / pp_1
write.table(G, file = paste(“G_chip_at_iter_”, t, “.csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
diag(G) <- diag(G) + 0.01
invG <- solve(G)
write.table(invG, file = paste(“invG_chip_at_iter_”, t, “.csv”, sep =“ ”), sep=“,”, row.names=F, quote = F)
Ginv <- ginv(invG)
write.table(Ginv, file = paste(“Ginv_chip_at_iter_”, t, “.csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
attr(Ginv,“rowNames”) <- paste(rnames)
WGBLUP <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml.binomial(link = “logit”),
data = pheno,
maxiter = 100,
na.method.X =‘omit’)
gebv <- coef(WGBLUP)$random
write.table(gebv, file = paste(“GEBVs_chip_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
#加重後マーカーの効果値を推定する
p1 <- do.call(‘rbind’, mclapply(1: nrow(Zt1), FUN = function(x) {D[x] * Zt1[x, ]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc.cores = 5))
p3 <- do.call(‘rbind’, mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv01}, mc.cores = 5))
u <- p3 / pp_1
Varu <- u * u * 2 * pi_1 * (1 - pi_1)
write.table(u, file = paste(“markerEff_chip_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
write.table(Varu, file = paste(“ markerVar_chip_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)

#GBLUPプロセス・再配列部分を加重し、forサイクルを用い、6回の加重反復過程を実行する
Zt2 <- t(Z_2)
p1 <- do.call(‘rbind’, mclapply(1: nrow(Zt2), FUN = function(x) {Zt2[x, ] %*% invG02}, mc.cores = 5))
p2 <- p1 %*% gebv02
u02 <- p2 / pp_2
Varu02 <- u02 * u02 * 2 * pi_2 * (1 - pi_2)
write.table(u02, “ markerEff_rseq_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(Varu02, “ markerVar_rseq_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
#各マーカーの重みを推定する
D <- rep(1, ncol(M_2))
D0 <- rep(1, ncol(M_2))
pre_D <- D0
u <- u02
cal <- 6
inter <- 1
for(t in 1: cal) {
d <- as.vector(u * u * 2 * pi_2 * (1 - pi_2))
for(i in 1: (length(d) / inter)) {
di <- mean(d[(inter * (i - 1) + 1): (inter * i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di

D <- sum(D0) / sum(pre_D) * pre_D
write.table(D, file = paste(“ weights_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
#加重Gマトリックスを推定する
p1 <- do.call(‘cbind’, mclapply(1: ncol(Z_2), FUN = function(x) {Z_2[, x] * D[x]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt2}, mc.cores = 5))
G <- p2 / pp_2
write.table(G, file = paste(“G_rseq_at_iter_”, t, “.csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
diag(G) <- diag(G) + 0.01
invG <- solve(G)
write.table(invG, file = paste(“invG_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
Ginv <- ginv(invG)
write.table(Ginv, file = paste(“Ginv_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
attr(Ginv,“ rowNames”) <- paste(rnames)
WGBLUP <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml.binomial(link = ”logit”),
data = pheno,
maxiter = 100,
na.method.X = ‘omit’)
gebv <- coef(WGBLUP)$random
write.table(gebv, file = paste(“ GEBVs_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
#加重後マーカーの効果値を推定する
p1 <- do.call(’rbind’, mclapply(1: nrow(Zt2), FUN = function(x) {D[x] * Zt2[x, ]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc.cores = 5))
p3 <- do.call(‘rbind’, mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv02}, mc.cores = 5))
u <- p3 / pp_2
Varu <- u * u * 2 * pi_2 * (1 - pi_2)
write.table(u, file = paste(“markerEff_rseq_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
write.table(Varu, file = paste(“ markerVar_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)

計算を経て、第4回反復した時に加重GBLUP方法は安定に近づき、そのためこの時の反復結果について後続の研究を実施する。検証に用いられる95の個体GEBVの間における関連係数は0.958で、これらの個体はGEBVは表6による:
g) Two xxx obtained in g). The csv file estimates GEBV in R using the weighted GBLUP method. The specific operation method is as follows (Linux environment):
# Unload the required R packages and functions library (parallell)
library (data.table)
library (asreml)
library (prOC)
source (“ginv.R”)
The definition of the function ginv is as follows:
ginv <-function (invG) {
Ginv <-data. frame (low = rep (1: low (invG), low (invG)), logic = rep (1: low (invG), reach = narrow (invG)), value = as.numeric (invG), low. = As.logical (lower.tri (invG, diag = T)))
Ginv <-Ginv [Ginv $ lower. mat == T, c (“low”, “column”, “value”)]
Ginv <-Ginv [order (Ginv $ row, Ginv $ volume),]
return (Ginv)
}
# Read genotype and phenotype information geno_chip <-as. matrix (fread (“geno_Val_Chip.csv”, nThread = 10))
geno_rseq <-as. matrix (fread (“geno_Val_Rseq.csv”, nThread = 10))
pheno <-asreml. read. table ("pheno_op_Val.csv", header = T, sep = ",")
rnames <-as. matrix (fread (“pheno_op_Val.csv”)) [, 1]
M_1 <-geno_chip
M_2 <-geno_rseq
# Calculate the frequency of secondary coordinating genes at each site pi_1 <-round (colSums (M_1) / (2 * now (M_1)), 3)
pi_2 <-round (colSums (M_2) / (2 * now (M_2)), 3)
# Build a P matrix P_1 <-matrix (2 * pi_1, byrow = T, now = now (M_1), ncol = ncol (M_1))
P_2 <-matrix (2 * pi_2, byrow = T, now = now (M_2), ncol = ncol (M_2))
# Build a Z matrix Z_1 <-as. matrix (M_1-P_1)
Z_2 <-as. matrix (M_2-P_2)
# ZZt_1 to construct the term of equation numerator <-do. call ('rbind', mclapply (1: now (Z_1), FUN = faction (x) {tcrossprod (Z_1 [x,], Zt_1)}, mc.cores = 20))
ZZt_2 <-do. call ('rbind', mclapply (1: now (Z_2), FUN = faction (x) {tcrossprod (Z_2 [x,], Zt_2)}, mc.cores = 20))
# Construct the term of equation denominator denom_1 <-2 * (sum (pi_1 * (1-pi_1)))
denom_2 <-2 * (sum (pi_2 * (1-pi_2)))
# Build the G matrix G_chip <-ZZt_1 / denom_1
G_rseq <-ZZt_2 / denom_2
diag (G_chip) <-diag (G_chip) + 0.01
diag (G_rseq) <-diag (G_rseq) + 0.01
# G Matrix Inversion invG_chip0 <-solve (G_chip)
invG_rseq0 <-solve (G_rseq)
# G Matrix inversion is converted to a three-column format that ASReml can use and output Ginv_chip <-ginv (invG_chip)
Ginv_rseq <-ginv (invG_rseq)
write. table (Ginv_chip, "Ginv_op_Val_Chip_at_itter_0.csv", sep = ",", low.names = F, quote = F)
write. table (Ginv_rseq, "Ginv_op_Val_Rseq_at_ita_0.csv", sep = ",", low.names = F, quote = F)
#Calculate GEBV attr (Ginv_chip, “lowNames”) <-paste (rnames)
attr (Ginv_rseq, “lowNames”) <-paste (runames)
WGBLUP1 <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv_chip),
rkov = ~ units,
family = asreml. Binomial (link = “logit”)
data = pheno,
maximizer = 100,
na. method. X ='omit')
gebv01 <-coef (WGBLUP1) $ random
write. table (gebv01, "GEBVs_at_ita_0_chip.csv", sep = ",", low.names = F, quote = F)

WGBLUP2 <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv_rseq),
rkov = ~ units,
family = asreml. Binomial (link = “logit”),
data = pheno,
maximizer = 100,
na. method. X ='omit')
gebv02 <-coef (WGBLUP2) $ random
write. table (gebv02, "GEBVs_at_ita_0_rseq.csv", sep = ",", low.names = F, quote = F)

#GBLUP process ・ Zt1 <-t (Z_1) that estimates the effect value of the marker that weights the chip portion and executes the weighted iteration process 6 times using the for cycle.
p1 <-do. call ('rbind', mclappley (1: now (Zt1), FUN = faction (x) {Zt1 [x,]% *% invG01}, mc.cores = 5))
p2 <-p1% *% gebv01
u01 <-p2 / pp_1
Varu01 <-u01 * u01 * 2 * pi_1 * (1-pi_1)
write. table (u01, "markerEff_chip_at_itter_0.csv", sep = ",", low.names = F, quaote = F)
write. table (Varu01, "markerVar_chip_at_ita_0.csv", sep = ",", low.names = F, quote = F)
# Estimate the weight of each marker D <-rep (1, ncol (M_1))
D0 <-rep (1, ncol (M_1))
pre_D <-D0
u <-u01
cal <-6
inter <-1
for (t in 1: cal) {
d <-as. vector (u * u * 2 * pi_1 * (1-pi_1))
for (i in 1: (length (d) / inter)) {
di <-mean (d [(inter * (i-1) + 1): (inter * i)])
pre_D [(inter * (i-1) + 1): (inter * i)] <-di
}
D <-sum (D0) / sum (pre_D) * pre_D
write. table (D, file = paste ("weights_chip_at_ita_", t, ".csv", sep = ""), sep = ",", low.names=F, quote = F)
# Estimate the weighted G matrix p1 <-do. call ('cbind', mclapply (1: ncol (Z_1), FUN = faction (x) {Z_1 [, x] * D [x]}, mc.cores = 5))
p2 <-do. call ('rbind', mclapply (1: now (p1), FUN = faction (x) {p1 [x,]% *% Zt1}, mc.cores = 5))
G <-p2 / pp_1
write. table (G, file = paste ("G_chip_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
diag (G) <-diag (G) + 0.01
invG <-solve (G)
write. table (invG, file = paste ("invG_chip_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
Ginv <-ginv (invG)
write. table (Ginv, file = paste ("Ginv_chip_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
attr (Ginv, "lowNames") <-paste (runames)
WGBLUP <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv),
rkov = ~ units,
family = asreml. Binomial (link = “logit”),
data = pheno,
maximizer = 100,
na. method. X ='omit')
gevv <-coef (WGBLUP) $ random
write. table (gebv, file = paste ("GEBVs_chip_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
# Estimate the effect value of the post-weighted marker p1 <-do. call ('rbind', mclapply (1: now (Zt1), FUN = faction (x) {D [x] * Zt1 [x,]}, mc.cores = 5))
p2 <-do. call ('rbind', mclappley (1: now (p1), FUN = faction (x) {p1 [x,]% *% invG}, mc.cores = 5))
p3 <-do. call ('rbind', mclapply (1: now (p2), FUN = faction (x) {p2 [x,]% *% gebv01}, mc.cores = 5))
u <-p3 / pp_1
Varu <-u * u * 2 * pi_1 * (1-pi_1)
write. table (u, file = paste ("markerEff_chip_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
write. table (Varu, file = paste ("markerVar_chip_at_itter_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
}
#GBLUP process-Zt2 <-t (Z_2) that weights the rearranged part and executes a weighted iteration process 6 times using a for cycle.
p1 <-do. call ('rbind', mclappley (1: now (Zt2), FUN = faction (x) {Zt2 [x,]% *% invG02}, mc.cores = 5))
p2 <-p1% *% gebv02
u02 <-p2 / pp_2
Varu02 <-u02 * u02 * 2 * pi_2 * (1-pi_2)
write. table (u02, "markerEff_rsq_at_itter_0.csv", sep = ",", low.names = F, quaote = F)
write. table (Varu02, "markerVar_rseq_at_ita_0.csv", sep = ",", low.names = F, quote = F)
# Estimate the weight of each marker D <-rep (1, ncol (M_2))
D0 <-rep (1, ncol (M_2))
pre_D <-D0
u <-u02
cal <-6
inter <-1
for (t in 1: cal) {
d <-as. vector (u * u * 2 * pi_2 * (1-pi_2))
for (i in 1: (length (d) / inter)) {
di <-mean (d [(inter * (i-1) + 1): (inter * i)])
pre_D [(inter * (i-1) + 1): (inter * i)] <-di
}
D <-sum (D0) / sum (pre_D) * pre_D
write. table (D, file = paste ("weights_rsq_at_itter_", t, ".csv", sep = ""), sep = ",", low.names=F, quote = F)
# Estimate the weighted G matrix p1 <-do. call ('cbind', mclapply (1: ncol (Z_2), FUN = faction (x) {Z_2 [, x] * D [x]}, mc.cores = 5))
p2 <-do. call ('rbind', mclapply (1: now (p1), FUN = faction (x) {p1 [x,]% *% Zt2}, mc.cores = 5))
G <-p2 / pp_2
write. table (G, file = paste ("G_rseq_at_itter_", t, ".csv", sep = ""), sep = ",", row.names=F, quote = F)
diag (G) <-diag (G) + 0.01
invG <-solve (G)
write. table (invG, file = paste (“invG_rseq_at_itter_”, t, “.csv”, sep = “”), sep = “,”, low.names = F, quote = F)
Ginv <-ginv (invG)
write. table (Ginv, file = paste (“Ginv_rsq_at_itter_”, t, “.csv”, sep = “”), sep = “,”, low.names = F, quote = F)
attr (Ginv, "lowNames") <-paste (runames)
WGBLUP <-asreml (status ~ 1,
random = ~ giv (IID),
giverse = list (IID = Ginv),
rkov = ~ units,
family = asreml. Binomial (link = "logit"),
data = pheno,
maximizer = 100,
na. method. X ='omit')
gevv <-coef (WGBLUP) $ random
write. table (gebv, file = paste ("GEBVs_rseq_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
# Estimate the effect value of the post-weighted marker p1 <-do. call ('rbind', mclapply (1: now (Zt2), FUN = faction (x) {D [x] * Zt2 [x,]}, mc.cores = 5))
p2 <-do. call ('rbind', mclappley (1: now (p1), FUN = faction (x) {p1 [x,]% *% invG}, mc.cores = 5))
p3 <-do. call ('rbind', mclapply (1: now (p2), FUN = faction (x) {p2 [x,]% *% gebv02}, mc.cores = 5))
u <-p3 / pp_2
Varu <-u * u * 2 * pi_2 * (1-pi_2)
write. table (u, file = paste ("markerEff_rsq_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
write. table (Varu, file = paste ("markerVar_rseq_at_ita_", t, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
}
After calculation, the weighted GBLUP method approaches stability at the 4th iteration, so subsequent studies will be conducted on the results of the iterations at this time. The correlation coefficient among the 95 individual GEBVs used for validation was 0.958, and these individuals had GEBVs according to Table 6.

Figure 2022518304000009
Figure 2022518304000010
Figure 2022518304000011
Figure 2022518304000009
Figure 2022518304000010
Figure 2022518304000011

2、「魚チップ1号」遺伝子チップサイトが参考グループにおける検証
発明者の既存の参考グループから「魚チップ1号」遺伝子チップの設計サイトを抽出し、これらのサイト情報を利用して加重GBLUPを実施し、かつ5倍の交差検証方法のランダムな組分けを用いて加重GBLUP予測正確性の評価方法として、被検測者の操作特徴曲線下面積(AUC)を加重GBLUPの評価方法で正確性の指標とする。解析モデルは一般化線形混合モデルを用いる。組分けのランダム誤差を減らすために、データセットに10回の組分けを行い、各組は5回計算する。そのため、延べ50回計算し、50回のAUCの平均値を最終的な評価結果とする。
2. Verification of "Fish Chip No. 1" gene chip site in the reference group Extract the design site of "Fish Chip No. 1" gene chip from the existing reference group of the inventor, and use these site information to perform weighted GBLUP. As a method for evaluating the weighted GBLUP prediction accuracy by using random grouping of the cross-validation method of 5 times, the area under the operating characteristic curve (AUC) of the subject is evaluated by the weighted GBLUP evaluation method. It is used as an index of. A generalized linear mixed model is used as the analytical model. In order to reduce the random error of grouping, the data set is grouped 10 times, and each group is calculated 5 times. Therefore, a total of 50 times are calculated, and the average value of 50 times of AUC is used as the final evaluation result.

解析結果によると、ヒラメ参考グループにおいてSNPチップと同様であるマーカーを用いてゲノム選択を実施し、AUC(正確性)値が0.885で、従来のBLUP方法によるAUC(0.579)値より高い。そのため、発明者が設計されるチップサイトを用いてはゲノム選択を順調かつ高効率に実施することができる。 According to the analysis results, genome selection was performed using markers similar to SNP chips in the flatfish reference group, and the AUC (accuracy) value was 0.885, which was higher than the AUC (0.579) value by the conventional BLUP method. high. Therefore, the genome selection can be carried out smoothly and efficiently by using the chip site designed by the inventor.

具体的な操作方法は以下のとおりである:
fcGENE、BEAGLEとPLINKソフトウェアを利用してヒラメ参考グループから抽出されるチップ設計サイト:欠損サイトを充填しかつ遺伝子型ファイルを出力し、コマンドは以下のとおりである:
fcgene --ped geno_op_Rseq_chip.ped --map geno_op_Rseq_chip.map --oformat beagle --out plink2beagle

java -Xmx5120m -jar beagle.jar unphased=plink2beagle.bgl missing=0 niterations=20 gprobs=true out=imputed_geno

gunzip imputed_geno.plink2beagle.bgl.phased.gz

fcgene --bgl imputed_geno.plink2beagle.bgl.phased --pedinfo plink2beagle_pedinfo.txt --snpinfo plink2beagle_snpinfo.txt --oformat plink --out beagle2plink

plink --file beagle2plink --recode A --out genotype_op_chip_Rseq
さらにRに以下のコマンドを入力する:
library(data.table)
geno <- fread(“genotype_op_chip_Rseq.raw”)
geno[, c(1: 6):= NULL]
fwrite(geno, “genotype_op_chip_Rseq.csv”, sep = “ ,”, row.names = F, quote = F)
The specific operation method is as follows:
Chip design site extracted from flounder reference group using fcGENE, BEAGLE and PLLINK software: Filling the defective site and outputting the genotype file, the command is:
fcgene --- ped geno_op_Rseq_chip. ped --map geno_op_Rseq_chip. map --- offormat beagle --- out pink2beagle

Java-Xmx5120m-jar beagle. jar unphased = pink2beagle. bgl misising = 0 nitterations = 20 gprobs = true out = imputed_geno

gzipip imputed_geno. pink2beagle. bgl. phased. gz

fcgene --bgl imputed_geno. pink2beagle. bgl. phased --- pedinfo pink2beagle_pedinfo. pxt --- snpinfo pink2beagle_snpinfo. pxt --- offormat pink --- out beagle2link

pink --file beagle2link --recode A --out genotipe_op_chip_Rseq
Then enter the following command in R:
library (data.table)
geno <-fread ("genotype_op_chip_Rseq.raw")
geno [, c (1: 6): = NULL]
fwrite (geno, “genotype_op_chip_Rseq.csv”, sep = “,”, low.names = F, quote = F)

a)Rにおいてa)で取得した遺伝子型ファイルを利用してRにおいて加重GBLUPを実施する。加重GBLUPの具体的な方法は1)におけるh)部分を参照して実施する。 a) In R, weighted GBLUP is performed in R using the genotype file obtained in a). The specific method of weighted GBLUP is carried out with reference to the h) part in 1).

b)構築済みの加重GマトリックスをASReml-Rに代入して交差検証方法を行う。交差検証方法前に組分けを実施する必要がある:Rにおいて関数sample(1:931、931)を用いてすべての個体にランダムソートを行い、さらにソート後のデジタルを5列に分け、各列に含まれる要素個数はそれぞれ186・186・186・186と187である。上記の過程を10回繰り返し、延べ10のファイルを取得する。この10のファイルを同一のフォルダに入れて使用に備える。解析は一般化線形混合モデルを用い、異なる検測ロットと個体齢を固定効果として、各個体はランダム効果として適合を実施する。5倍の交差検証方法の具体的な実施方法は以下のとおりである:
#必要なRパッケージと関数をアンロードする
library(parallel)
library(asreml)
library(pROC)
#表現型、Gマトリックスインバージョンの三列形式を読み取る
pheno <- asreml.read.table(“phenotype_931.csv”, header=T, sep=“ ,”, na.string = NA)
ped <- asreml.read.table(“pedigree_931.csv”, header=T, sep=“ ,”, na.string = NA)
ainv <- asreml.Ainverse(ped)$ginv
Ginv <- fread(“…/ Ginv_at_iter_4.csv ”)
attr(Ginv,“rowNames”) <- paste(pheno[, 1])
#外部サイクルの回数を設定する
N <- 10
#結果変数を設定する
res <- matrix(NA, nrow = 5 * N, ncol = 1)
colnames(res) <- c(”auc”)
#交差検証方法を行いかつ検証結果を出力する
for (i in 1: N) {
coor <- read.table(file = paste(“./coor/coor_”, i, “.csv”, sep =“ ”), sep = “ ,”, header = F)
cyc <- ncol(coor)
gebv <- matrix(NA, nrow = nrow(pheno), ncol = cyc)
for (j in 1: cyc) {
y <- pheno
y$status[coor[(1: sum(coor[, j] > 0, na.rm = T)), j]] <- NA
CV <- asreml(status ~ Batch + Age,
random = ~giv(AnimalID),
ginverse = list(AnimalID = Ginv),
rcov = ~units,
family = asreml.binomial(link = “logit”),
data = y,
maxiter = 50)

gebv[, j] <- coef(CV)$random
write.table(gebv, file = paste(“GEBVs_chip_ref_coor_”, i, “ .csv”, sep =“ ”), sep =“ ,”, row.names = F, quote = F)
res[(j + (i - 1) * cyc), ] <- roc(as.vector(pheno$status[coor[(1: sum(coor[, j] > 0, na.rm = T)), j]]), as.vector(gebv[coor[(1: sum(coor[, j] > 0, na.rm = T)), j], j]))$auc
write.table(res, “results_of_roc_op_chip_ref.csv”, sep = “ ,”, row.names = F, quote = F)


#従来のBLUPの推定方法は加重GBLUPと同様で、モデルにおいてainvを用いてGinvを代替すればよい
上記のコードを実行した後、加重GBLUP方法によるAUC平均値は0.885で、従来のBLUP方法によるAUC平均値は0.579である。50回の交差検証方法の結果は表7による:
b) Substitute the constructed weighted G matrix into ASReml-R to perform cross-validation. It is necessary to carry out grouping before the cross-validation method: Random sort is performed on all individuals using the function sample (1: 931, 931) in R, and the sorted digital is further divided into 5 columns, and each column is divided. The number of elements included in is 186, 186, 186, 186 and 187, respectively. The above process is repeated 10 times to obtain a total of 10 files. Put these 10 files in the same folder to prepare for use. The analysis uses a generalized linear mixed model, with different measurement lots and individual ages as fixed effects, and each individual performing adaptation as a random effect. The specific method of implementing the 5-fold cross-validation method is as follows:
# Unload the required R packages and functions library (parallell)
library (asreml)
library (prOC)
# Phenotype, read the three-column format of G-matrix inversion pheno <-asreml. read. table ("phenotype_931.csv", header = T, sep = ",", na.string = NA)
ped <-asreml. read. table (“pedigree_931.csv”, header = T, sep = “,”, na.string = NA)
ainv <-asreml. Ainverse (ped) $ ginv
Ginv <-fread ("... / Ginv_at_ita_4.csv")
attr (Ginv, "lowNames") <-paste (pheno [, 1])
# Set the number of external cycles N <-10
# Set the result variable res <-matrix (NA, now = 5 * N, ncol = 1)
colnames (res) <-c ("auc")
# Perform cross-validation method and output verification result for (i in 1: N) {
coor <-read. file (file = paste ("./coor/coor_", i, ".csv", sep = ""), sep = ",", header = F)
cyc <-ncol (coor)
gevv <-matrix (NA, now = now (pheno), ncol = cyc)
for (j in 1: cyc) {
y <-pheno
y $ status [coor [(1: sum (coor [, j]> 0, na.rm = T)), j]] <-NA
CV <-asreml (status-Batch + Age,
random = ~ giv (AnimalID),
ginverse = list (AnimalID = Ginv),
rkov = ~ units,
family = asreml. Binomial (link = “logit”),
data = y,
maximizer = 50)

gebv [, j] <-coef (CV) $ random
write. table (gebv, file = paste ("GEBVs_chip_ref_coor_", i, ".csv", sep = ""), sep = ",", low.names = F, quote = F)
res [(j + (i-1) * cyc),] <-loc (as.vector (pheno $ status [coor [(1: sum (coor [, j]> 0, na.rm = T))), j]]), as.vector (gebv [coor [(1: sum (coor [, j]> 0, na.rm = T)), j], j])) $ auc
write. table (res, "results_of_loc_op_chip_ref.csv", sep = ",", low.names = F, quote = F)
}
}
# Conventional BLUP estimation methods are similar to weighted GBLUPs, and ainvs can be used to replace Ginvs in the model. After executing the above code, the AUC mean by weighted GBLUP methods is 0.885, which is a conventional BLUP. The AUC averages by method are 0.579. The results of the 50 cross-validation methods are shown in Table 7.

Figure 2022518304000012
Figure 2022518304000013
Figure 2022518304000012
Figure 2022518304000013

3、「魚チップ1号」遺伝子チップがヒラメ耐病性育種における応用
候補個体のゲノム推定育種価(GEBV)を推定するために、まず候補個体遺伝子型(「魚チップ1号」遺伝子チップ分類から取得される)を発明者の既存の参考グループ遺伝子型と組み合わせ、さらにRを用いて加重Gマトリックスを構築し、最後に用意済みの加重Gマトリックスと表現型データをASReml-Rに代入し、加重GBLUP方法を用いて各家系成体のGEBVを推定し、さらに成体GEBVの平均値を相応の家系のGEBVとする。各家系を感染生存率に従って高生存率家系(生存率が55%より高い)と低生存率家系(生存率が55%より低い)、かつ各家系GEBVと感染生存率の間のAUC値を計算し、さらに該AUC値を2で取得したAUC値と比較し、近いひいては2で取得したAUCより高い場合、発明者が設計される遺伝子チップがゲノム選択技術の要件を満たしかつヒラメ耐病性選別において優れた応用効果を有することを示す。各家系GEBVと感染生存率の間のAUC値を計算しかつ該AUC値を2で取得したAUC値と比較し、それにより発明者が設計される遺伝子チップとゲノム選択技術がヒラメ耐病性選別における実際の応用効果を検証するために用いられるもの。AUC値を推定する前、子の代の感染生存率を

Figure 2022518304000014
で変換を実施する。変換後、平均値より高い家系の生存率を1と記載し、平均値より低いものを0と記載する。遺伝子チップ方法がマーカーに対する要件を満たし、すべての候補個体の分類結果に組み合わせを行い、各チップの分類結果を単独で参考グループ遺伝子型と合併する必要がある。 3. Application of "Fish Chip No. 1" gene chip in scallop disease-resistant breeding In order to estimate the genome estimated breeding value (GEBV) of candidate individuals, first obtain the candidate individual genotype ("Fish Chip No. 1" gene chip classification). Is combined with the inventor's existing reference group genotype, and a weighted G matrix is constructed using R. Finally, the prepared weighted G matrix and phenotype data are substituted into ASReml-R, and the weighted GBLUP. The GEBV of each adult family is estimated using the method, and the average value of the adult GEBV is taken as the GEBV of the corresponding family. High survival rate (survival rate is higher than 55%) and low survival rate family (survival rate is lower than 55%) according to the infection survival rate of each family, and AUC value between GEBV and infection survival rate of each family. And further compare the AUC value with the AUC value obtained in 2, and if it is close and thus higher than the AUC obtained in 2, the gene chip designed by the inventor meets the requirements of genome selection technology and is resistant to fluff. It is shown that it has an excellent application effect in sorting. The AUC value between each family GEBV and infection survival rate is calculated and the AUC value is compared with the AUC value obtained in 2, and the gene chip and genome selection technology designed by the inventor are used for flounder disease resistance selection. Used to verify the actual application effect in. Before estimating the AUC value, determine the infection survival rate of the offspring.
Figure 2022518304000014
Perform the conversion with. After the conversion, the survival rate of the family whose family value is higher than the average value is described as 1, and the survival rate of the family whose value is lower than the average value is described as 0. It is necessary that the gene chip method meets the requirements for the marker, the classification results of all candidate individuals are combined, and the classification results of each chip are independently combined with the reference group genotype.

解析結果によると16の候補群の子の代家系における6の高生存率家系の平均生存率が62.4%であるおよび10の低生存率家系の平均的な生存率が33.47%(表8)であることがわかる。そのうち、高生存率家系成体の平均的なGEBVが2.10で、低生存率家系成体の平均的なGEBVが1.34である。計算はよると、これらのヒラメ家系のGEBV値を利用してはその感染生存率が予測される正確性が0.794に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。 According to the analysis results, the average survival rate of 6 high survival rate families in the offspring of 16 candidate groups is 62.4% and the average survival rate of 10 low survival rate families is 33.47% ( It can be seen that Table 8). Among them, the average GEBV of adults with high survival rate is 2.10, and the average GEBV of adults with low survival rate is 1.34. According to the calculation, the accuracy of predicting the infection survival rate can reach 0.794 by using the GEBV value of these flounder families, which is close to the theoretical value. Therefore, the gene chip designed by the inventor can be effectively applied to the selection of flounder disease resistance properties.

具体的な操作方法は以下のとおりである:
PLINKとRを利用して候補個体分類ファイルを利用し、さらに候補個体から後続検証に用いられる個体を選定し、これらの個体情報を一つのテキストファイルに貯蔵し、家系番号・個体番号・父本番号・母本番号・性別と表現型値の配列に従ってテキストファイルを用意し、各行に一つの個体で、各個体の各項目情報はtable区切り記号を用いて区切りを実施する。発明を実施するための形態:
#各SNPチップの分類結果を読み取り、結果はxxx.bed、xxx.bimとxxx.famに貯蔵される。
plink --vcf op1.vcf --make-bed --out op1
plink --vcf op2.vcf --make-bed --out op2
plink --vcf op3.vcf --make-bed --out op3

Rにおいて各ファイルにおけるSNPの命名方法を変換し、具体的な操作方法は以下のとおりである:
#Rパッケージをアンロードする
library(data.table)
#データを読み取る
op1 <- fread(“op1.bim”, header = F)
op2 <- fread(“op2.bim”, header = F)
op3 <- fread(“op3.bim”, header = F)
#SNP名称を変換する
op1$V2 <- paste(paste(rep(“rs”, nrow(op1)), op1$V1, sep =“ ”), op1$V4, sep = “:”)
op2$V2 <- paste(paste(rep(“rs”, nrow(op2)), op2$V1, sep =“ ”), op2$V4, sep = “:”)
op3$V2 <- paste(paste(rep(“rs”, nrow(op3)), op3$V1, sep =“ ”), op3$V4, sep = “:”)
#各チップが取得した分類情報を出力する
write.table(op1$V2, “snps_op1.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(op2$V2, “snps_op2.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(op3$V2, “snps_op3.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
#整理後のbimファイルを出力する
fwrite(op1, “op1.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
fwrite(op2, “op2.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
fwrite(op3, “op3.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
#Rを終了する

検証個体情報を用意し、かつタイトルが「selected_indi.txt」のファイルに入れ、ファイルは以下の方式に従って整理する:
1701.CEL 1701.CEL 0 0 0 -9
1702.CEL 1702.CEL 0 0 0 -9
#PLINKソフトウェアを利用して各SNPチップに含まれる個体を抽出し、その方式は以下のとおりである:
plink --bfile op1 --keep selected_indi.txt --make-bed --out op_can_1
plink --bfile op2 --keep selected_indi.txt --make-bed --out op_can_2
plink --bfile op3 --keep selected_indi.txt --make-bed --out op_can_3

#検証に用いられる個体の遺伝子型ファイルを出力する
plink --bfile op_can_1 --recode A --out op1
plink --bfile op_can_2 --recode A --out op2
plink --bfile op_can_3 --recode A --out op3
さらにRにおいて以下のコマンドを用いて遺伝子型ファイルを出力する:
library(data.table)
op1 <- fread(“op1.raw”)
op2 <- fread(“op2.raw”)
op3 <- fread(“op3.raw”)
op1[, c(1:6):= NULL]
op2[, c(1:6):= NULL]
op3[, c(1:6):= NULL]
fwrite(op1, “geno_op1_can.csv”, sep = “ ,”, row.names = F, quote = F)
fwrite(op2, “geno_op2_can.csv”, sep = “ ,”, row.names = F, quote = F)
fwrite(op3, “geno_op3_can.csv”, sep = “ ,”, row.names = F, quote = F)
a)PLINKとRを利用して参考グループ遺伝子型をそれぞれ各チップ候補個体遺伝子型と組み合わせを実施し、具体的な操作方法は以下のとおりである:
plink --bfile …/op_Ref --extract snps_op1.txt --recode A --out op_Ref_op1
plink --bfile …/op_Ref --extract snps_op2.txt --recode A --out op_Ref_op2
plink --bfile …/op_Ref --extract snps_op3.txt --recode A --out op_Ref_op3
さらにRにおいて以下のコマンドを用い、遺伝子型ファイルを処理し、出力する:
library(data.table)
#参考グループ遺伝子型を読み取りかつそれに処理を実施する
op1_ref <- fread(“op_Ref_op1.raw”)
op2_ref <- fread(“op_Ref_op2.raw”)
op3_ref <- fread(“op_Ref_op3.raw”)
op1_ref[, c(1: 6):= NULL]
op2_ref[, c(1: 6):= NULL]
op3_ref[, c(1: 6):= NULL]
op1_ref <- as.matrix(op1_ref)
op2_ref <- as.matrix(op2_ref)
op3_ref <- as.matrix(op3_ref)
#候補個体遺伝子型を処理する:読み取り、組み合わせ
op1_can <- as.matrix(fread(“geno_op1_can.csv”))
op2_can <- as.matrix(fread(“geno_op2_can.csv”))
op3_can <- as.matrix(fread(“geno_op3_can.csv”))
geno_op1 <- rbind(op1_can, op1_ref)
geno_op2 <- rbind(op2_can, op2_ref)
geno_op3 <- rbind(op3_can, op3_ref)
write.table(geno_op1, “geno_op1.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(geno_op2, “geno_op2.csv”, sep =“ ,”, row.names = F, quote = F)
write.table(geno_op3, “geno_op3.csv”, sep = “ ,”, row.names = F, quote = F)
The specific operation method is as follows:
A candidate individual classification file is used using PLLNK and R, and individuals to be used for subsequent verification are selected from the candidate individuals, and these individual information is stored in one text file, and the family number, individual number, and father's book are stored. A text file is prepared according to an array of numbers, mother numbers, genders, and phenotypic values, and each item is separated by one individual on each line, and each item information of each individual is separated by using a table delimiter. A form for carrying out the invention:
# Read the classification result of each SNP chip, and the result is xxx. bed, xxx. Bim and xxx. Stored in fam.
pink --- vcf op1. vcf --make-bed --out op1
pink --vcf op2. vcf --make-bed --out op2
pink --vcf op3. vcf --make-bed --out op3

The SNP nomenclature in each file is converted in R, and the specific operation method is as follows:
#R Unloading the package library (data.table)
# Read data op1 <-fread (“op1.bim”, header = F)
op2 <-fread (“op2.bim”, header = F)
op3 <-fread (“op3.bim”, header = F)
# Convert SNP name op1 $ V2 <-paste (paste (rep ("rs", now (op1)), op1 $ V1, sep = ""), op1 $ V4, sep = ":")
op2 $ V2 <-paste (paste (rep ("rs", now (op2)), op2 $ V1, sep = ""), op2 $ V4, sep = ":")
op3 $ V2 <-paste (paste (rep ("rs", now (op3)), op3 $ V1, sep = ""), op3 $ V4, sep = ":")
# Write to output the classification information acquired by each chip. table (op1 $ V2, "snaps_op1.txt", sep = "\ t", col.names = F, row.names = F, quote = F)
write. table (op2 $ V2, “snaps_op2.txt”, sep = “¥ t”, col.names = F, row.names = F, quote = F)
write. table (op3 $ V2, “snaps_op3.txt”, sep = “¥ t”, col.names = F, row.names = F, quote = F)
# Output the organized Bim file fwrite (op1, "op1.bim", sep = "\ t", col.names = F, row.names = F, quote = F)
file (op2, “op2.bim”, sep = “¥ t”, col.names = F, low.names = F, quote = F)
file (op3, “op3.bim”, sep = “¥ t”, col.names = F, low.names = F, quote = F)
End # R

Prepare the verification individual information and put it in a file with the title "selected_indi.txt", and organize the files according to the following method:
1701. CEL 1701. CEL 0 0 0-9
1702. CEL 1702. CEL 0 0 0-9
The individual contained in each SNP chip is extracted using #LINK software, and the method is as follows:
pink --bfile op1 -- keep selected_indi. pxt --make-bed --out op_can_1
pink --bfile op2 -- keep selected_indi. pxt --make-bed --out op_can_2
pink --bfile op3 -- keep selected_indi. pxt --make-bed --out op_can_3

# Output the genotype file of the individual used for verification pink --bfile op_can_1 --recode A --out op1
pink --bfile op_can_2 --recode A --out op2
pink --bfile op_can_3 --recode A --out op3
Furthermore, in R, the genotype file is output using the following command:
library (data.table)
op1 <-fread (“op1.raw”)
op2 <-fread (“op2.raw”)
op3 <-fread (“op3.raw”)
op1 [, c (1: 6): = NULL]
op2 [, c (1: 6): = NULL]
op3 [, c (1: 6): = NULL]
fwrite (op1, “geno_op1_can.csv”, sep = “,”, low.names = F, quote = F)
fwrite (op2, “geno_op2_can.csv”, sep = “,”, low.names = F, quote = F)
fwrite (op3, “geno_op3_can.csv”, sep = “,”, low.names = F, quote = F)
a) Using PLLNK and R, the reference group genotypes are combined with each chip candidate individual genotype, and the specific operation method is as follows:
pink --bfile… / op_Ref --extract snaps_op1. pxt --recode A --out op_Ref_op1
pink --bfile… / op_Ref --extract snaps_op2. pxt --recode A --out op_Ref_op2
pink --bfile… / op_Ref --extract snaps_op3. pxt --recode A --out op_Ref_op3
In addition, use the following command in R to process and output the genotype file:
library (data.table)
# Reference group Read the genotype and perform processing on it op1_ref <-fread (“op_Ref_op1.raw”)
op2_ref <-fread (“op_Ref_op2.raw”)
op3_ref <-fread (“op_Ref_op3.raw”)
op1_ref [, c (1: 6): = NULL]
op2_ref [, c (1: 6): = NULL]
op3_ref [, c (1: 6): = NULL]
op1_ref <-as. matrix (op1_ref)
op2_ref <-as. matrix (op2_ref)
op3_ref <-as. matrix (op3_ref)
# Process candidate individual genotypes: read, combine op1_can <-as. matrix (fread (“geno_op1_can.csv”))
op2_can <-as. matrix (fread (“geno_op2_can.csv”))
op3_can <-as. matrix (fread (“geno_op3_can.csv”))
geno_op1 <-rbind (op1_can, op1_ref)
geno_op2 <-rbind (op2_can, op2_ref)
geno_op3 <-rbind (op3_can, op3_ref)
write. table (geno_op1, “geno_op1.csv”, sep = “,”, low.names = F, quote = F)
write. table (geno_op2, "geno_op2.csv", sep = ",", low.names = F, quote = F)
write. table (geno_op3, "geno_op3.csv", sep = ",", low.names = F, quote = F)

b)b)における処理済みの4つの遺伝子型ファイルを利用し、Rにおいてそれぞれ加重Gマトリックスを構築し、加重Gマトリックスの構築方法は1)における記述と同様である b) Using the four processed genotype files in b), a weighted G matrix is constructed in R, respectively, and the method for constructing the weighted G matrix is the same as the description in 1).

c)ASReml-Rを用いて候補個体のGEBVを推定し、コードは以下のとおりである:
#必要なRパッケージと関数をアンロードする
library(data.table)
library(asreml)
###op1###
pheno <- asreml.read.table(“phenotype_op1.csv”, sep = “ ,”, header = T)
Ginv <- fread(“…/Ginv_op1.csv”)
attr(Ginv, “rowNames”) <- paste(pheno[, 1])
op1 <- asreml(status ~ Batch + Age, random = ~giv(AnimalID), ginverse = list(AnimalID = Ginv), rcov = ~units, family = asreml.binomial(link = “logit”), na.method.X = “omit”, data = pheno, maxiter = 50)
write.table(coef(op1)$random, “gebv_op1.csv”, sep = “ ,”, col.names = F, quote = F)

###op2###
pheno <- asreml.read.table(“ phenotype_op2.csv”, sep = “ ,”, header = T)
Ginv <- fread(“ …/Ginv_op2.csv”)
attr(Ginv, “ rowNames”) <- paste(pheno[, 1])
op2 <- asreml(status ~ Batch + Age, random = ~giv(AnimalID), ginverse = list(AnimalID = Ginv), rcov = ~units, family = asreml.binomial(link = “logit”), na.method.X = “ omit”, data = pheno, maxiter = 50)
write.table(coef(op2)$random, “ gebv_op2.csv”, sep = “ ,”, col.names = F, quote = F)
###op3###
pheno <- asreml.read.table(“ phenotype_op3.csv”, sep = “ ,”, header = T)
Ginv <- fread(“ …/Ginv_op3.csv”)
attr(Ginv, “rowNames”) <- paste(pheno[, 1])
op3 <- asreml(status ~ Batch + Age, random = ~giv(AnimalID), ginverse = list(AnimalID = Ginv), rcov = ~units, family = asreml.binomial(link = “logit”), na.method.X = “omit”, data = pheno, maxiter = 50)
write.table(coef(op3)$random, “gebv_op3.csv”, sep = “ ,”, col.names = F, quote = F)
c) Estimate the GEBV of a candidate individual using ASReml-R and the code is:
# Unload the required R packages and functions library (data.table)
library (asreml)
#### op1 ####
pheno <-asreml. read. table ("phenotype_op1.csv", sep = ",", header = T)
Ginv <-fread ("... / Ginv_op1.csv")
attr (Ginv, “rowNames”) <-paste (pheno [, 1])
op1 <-asreml (status ~ Batch + Age, random = ~ give (AnimalID), giverse = list (AnimalID = Ginv), logov = ~ units, random = inl. X = “omit”, data = pheno, maximizer = 50)
write. table (coef (op1) $ random, "gebv_op1.csv", sep = ",", col.names = F, quote = F)

#### op2 ####
pheno <-asreml. read. table ("phenotype_op2.csv", sep = ",", header = T)
Ginv <-fread (“… / Ginv_op2.csv”)
attr (Ginv, "lowNames") <-paste (pheno [, 1])
op2 <-asreml (status ~ Batch + Age, random = ~ give (AnimalID), giverse = list (AnimalID = Ginv), logov = ~ units, random = inl. X = “omit”, data = pheno, maximizer = 50)
write. table (coef (op2) $ random, "gebv_op2.csv", sep = ",", col.names = F, quote = F)
#### op3 ####
pheno <-asreml. read. table ("phenotype_op3.csv", sep = ",", header = T)
Ginv <-fread (“… / Ginv_op3.csv”)
attr (Ginv, “rowNames”) <-paste (pheno [, 1])
op3 <-asreml (status ~ Batch + Age, random = ~ give (AnimalID), giverse = list (AnimalID = Ginv), logov = ~ units, random = inl. X = “omit”, data = pheno, maximizer = 50)
write. table (coef (op3) $ random, "gebv_op3.csv", sep = ",", col.names = F, quote = F)

d)推定されるすべての候補個体のGEBVに従って相応の家系GEBVを計算し、かつ公式

Figure 2022518304000015
を用いて各家系の感染生存率に変換を行い、変換後の平均値より高い家系の生存率を1に設定し、平均値より低いものを0に設定する。最後に、各家系GEBVと変換後の生存率の間のAUC値を計算し、AUC値の計算方法は以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
library(pROC)
#各家系GEBVおよび相応の感染生存率を一つのファイルに整理し、家系番号・GEBVと変換後の感染生存率の配列に従って配列し、各行は一つの家系情報のみを含み、整理後のファイルを読み取る
res <- fread(“…/gebv_and_sr_op_can.csv”)
#子の代の家系生存率を変換する
res$SR_trans <- exp(res$SR) / (1 + exp(res$SR))
#変換後の生存率を平均値に従ってさらに0と1に変換する
SR_binary <- matrix(NA, nrow = nrow(res), ncol = 1)
SR_binary[which(res$SR_trans > mean(res$SR_trans)), ] <- 1
SR_binary[which(res$SR_trans < mean(res$SR_trans)), ] <- 0
#AUC値を計算する
roc(SR_binary[, 1], res$GEBV)
計算を経て、16のヒラメ家系のGEBVを取得し、計算は各家系GEBVと相応の感染生存率の間のAUC(正確性)が0.794であることを示す。各家系GEBVおよび感染生存率は表8による。 d) Calculate and formulate the appropriate pedigree GEBV according to the estimated GEBV of all candidate individuals
Figure 2022518304000015
Is used to convert the infection survival rate of each family, the survival rate of the family higher than the average value after conversion is set to 1, and the survival rate of the family lower than the average value is set to 0. Finally, the AUC value between each family GEBV and the post-conversion survival rate is calculated, and the calculation method of the AUC value is as follows:
# Unload the required R package library (data.table)
library (prOC)
# Organize each family GEBV and corresponding infection survival rate into one file, arrange according to the sequence of family number / GEBV and converted infection survival rate, each line contains only one family information, and the file after organization Read res <-fread ("... / gebv_and_sr_op_can.csv")
# Convert the family survival rate of the child's generation res $ SR_trans <-exp (res $ SR) / (1 + exp (res $ SR))
# Convert the survival rate after conversion to 0 and 1 according to the average value SR_binary <-matrix (NA, now = now (res), ncol = 1)
SR_binary [which (res $ SR_trans> mean (res $ SR_trans)),] <-1
SR_binary [which (res $ SR_trans <mean (res $ SR_trans)),] <-0
# Loc to calculate AUC value (SR_binary [, 1], res $ GEBV)
Through the calculations, GEBV of 16 flounder families is obtained, and the calculation shows that the AUC (accuracy) between each family GEBV and the corresponding infection survival rate is 0.794. Table 8 shows the GEBV and infection survival rate of each family.

Figure 2022518304000016
候補群における16の子の代家系を感染生存率に従って6の高生存率家系(平均的な生存率が62.4%である)と10の低生存率家系(平均的な生存率が33.47%である)の二大類(表8)に分け、高生存率と低生存率家系の成体のGEBVを比較し、高生存率家系成体の平均的なGEBVが2.10で、低生存率家系成体の平均的なGEBVが1.34であることがわかる。計算はよると、これらのヒラメ家系のGEBV値を使用してはその感染生存率が予測される正確性が0.794に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。
Figure 2022518304000016
16 offspring in the candidate group were infected according to survival rate, 6 high survival rate families (average survival rate is 62.4%) and 10 low survival rate families (average survival rate 33. It is divided into two major categories (Table 8) (47%), and the GEBV of adults with high survival rate and low survival rate is compared. The average GEBV of adult with high survival rate is 2.10, and the low survival rate. It can be seen that the average GEBV of an adult family is 1.34. According to the calculation, the accuracy of predicting the survival rate of infection can reach 0.794 using the GEBV values of these flounder families, which is close to the theoretical value. Therefore, the gene chip designed by the inventor can be effectively applied to the selection of flounder disease resistance properties.

上記の結果によると、「魚チップ1号」遺伝子チップを用いてヒラメ候補グループの個体に遺伝子分類を行い、加重GBLUPを用いてゲノム育種値(GEBV)を計算し、GEBV数値の大きさに従ってヒラメ耐病性親魚の選別を行い、これらの親魚を用いて育成される後代種苗の耐感染生存率が顕著に高まり、それにより「魚チップ1号」遺伝子チップがヒラメ耐病性優良品種の育成において普及応用を実行することができることがわかる。 According to the above results, the individual of the flounder candidate group was genetically classified using the "Fish Chip No. 1" gene chip, the genome breeding value (GEBV) was calculated using the weighted GBLUP, and the flounder was calculated according to the magnitude of the GEBV value. Disease-resistant parent fish are selected, and the survival rate of progeny seedlings bred using these parent fish is significantly increased. As a result, the "Fish Chip No. 1" gene chip is widely applied in the breeding of excellent flounder disease-resistant varieties. It turns out that you can do.

産業中の実施可能性
本発明より提供されるヒラメ耐病性に関連するSNPローカスの遺伝子チップはヒラメ耐病性個体の選別に利用可能であり、かつ実際の選択正確性は理論値に近いため、ヒラメ耐病性優良品種の選択正確性を高め、育種期間を短縮させ、これにより、ヒラメ耐病性優良品種の選別のための遺伝子チップ技術を提供し、魚類耐病性優良品種の選別のための遺伝子チップ育種の新しい道を切り開くことができる。
Industrial feasibility The gene chip of SNP locus related to flounder disease resistance provided by the present invention can be used for selection of flounder disease resistant individuals, and the actual selection accuracy is close to the theoretical value. Increased selection accuracy of excellent disease-resistant varieties and shortened breeding period, thereby providing gene chip technology for selection of excellent flounder disease-resistant varieties, and gene chip breeding for selection of excellent fish disease-resistant varieties. Can pave the way for new roads.

Claims (7)

ヒラメ耐病性に関連するSNPローカスであって、前記SNPローカスは、配列がSEQ NO:1-SEQ ID NO:48697である中のいずれか一つの配列の第36位の塩基であることを特徴とするヒラメ耐病性に関連するSNPローカス。 An SNP locus related to flounder disease resistance, wherein the SNP locus is a base at position 36 of any one of the sequences in which the sequence is SEQ NO: 1-SEQ ID NO: 48697. SNP locus related to flounder disease resistance. 請求項1に記載のヒラメ耐病性に関連するSNPローカスの、ヒラメ耐病性品種の選別育成への応用。 Application of the SNP locus related to flounder disease resistance according to claim 1 to the selection and breeding of flounder disease resistant varieties. 請求項1に記載のヒラメ耐病性に関連するSNPローカスの、ヒラメ耐病性に優れた品種を選別育成するための検測製品の製造への応用。 Application of the SNP locus related to the flounder disease resistance according to claim 1 to the manufacture of a inspection product for selecting and cultivating a variety having excellent flounder disease resistance. 前記検測製品は遺伝子チップであることを特徴とする請求項3に記載の応用。 The application according to claim 3, wherein the inspection product is a gene chip. ヒラメ耐病性優良品種の選別に用いられる遺伝子チップであって、前記遺伝子チップは請求項1に記載のヒラメ耐病性に関連するSNPローカスを検測できることを特徴とするヒラメ耐病性優良品種の選別に用いられる遺伝子チップ。 A gene chip used for selecting an excellent flounder disease-resistant variety, wherein the gene chip can detect SNP locus related to the flounder disease resistance according to claim 1, and is used for selecting an excellent flounder disease-resistant variety. The gene chip used. ヒラメ耐病性個体の選別方法であって、前記方法は請求項3に記載の遺伝子チップを用いて検測することを特徴とするヒラメ耐病性個体の選別方法。 A method for selecting a flounder disease-resistant individual, wherein the method is a method for selecting a flounder disease-resistant individual, which comprises performing a measurement using the gene chip according to claim 3. 請求項6に記載のヒラメ耐病性個体の選別方法であって、
前記方法は、
1)候補グループにおける個体ゲノムDNAを抽出し、かつ上記の遺伝子チップを利用して検測して、SNPマーカーの遺伝子型判定の結果を取得するステップと、
2)参考グループのSNP集合より遺伝子チップと同様であるSNPローカスの遺伝子型判定の結果を抽出し、さらに参考グループのSNPの遺伝子型判定の結果とチップを利用して取得した候補グループの遺伝子型判定の結果とを合併するステップ、
3)合併された遺伝子型と参考グループが表現した型とを利用し、加重GBLUP方法を用いて候補グループの推定育種値GEBVを推定し、さらにGEBV値に基づいて被検測個体の耐病性潜在能力を決定するステップと、
を含み、
かつ、参考グループの遺伝子型を利用し、加重最良線形不偏推定方法を用いて予測正確性を推定し、
その中、予測正確性の推定において5倍交差検証方法を利用し、特徴曲線の下の面積AUCを予測正確性を判定する指標とし、AUCが1に近いほど予測正確性は高い
ことを特徴とするヒラメ耐病性個体の選別方法。
The method for selecting a flounder disease-resistant individual according to claim 6.
The method is
1) A step of extracting individual genomic DNA in a candidate group and performing a test using the above gene chip to obtain the result of genotyping of an SNP marker.
2) The result of genotyping of SNP locus, which is similar to the gene chip, is extracted from the SNP set of the reference group, and the result of genotyping of SNP of the reference group and the genotype of the candidate group obtained using the chip. Steps to merge with the judgment result,
3) Using the combined genotype and the type expressed by the reference group, the estimated breeding value GEBV of the candidate group is estimated using the weighted GBLUP method, and the disease resistance potential of the test-tested individual is further estimated based on the GEBV value. Steps to determine ability and
Including
In addition, the genotype of the reference group is used to estimate the prediction accuracy using the weighted best linear bias estimation method.
Among them, the 5-fold cross-validation method is used to estimate the prediction accuracy, and the area AUC under the feature curve is used as an index to judge the prediction accuracy. The closer the AUC is to 1, the higher the prediction accuracy is. How to select flounder disease resistant individuals.
JP2020556756A 2019-12-17 2019-12-17 Disease resistance breeding gene chip of flounder and its application Active JP7158496B2 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/125857 WO2021119980A1 (en) 2019-12-17 2019-12-17 Gene chip for disease resistance breeding of olive flounder and application thereof

Publications (2)

Publication Number Publication Date
JP2022518304A true JP2022518304A (en) 2022-03-15
JP7158496B2 JP7158496B2 (en) 2022-10-21

Family

ID=71001219

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2020556756A Active JP7158496B2 (en) 2019-12-17 2019-12-17 Disease resistance breeding gene chip of flounder and its application

Country Status (3)

Country Link
JP (1) JP7158496B2 (en)
CN (1) CN111278994B (en)
WO (1) WO2021119980A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102602152B1 (en) * 2023-08-04 2023-11-13 제주대학교 산학협력단 A method for producing superior seeds using egg quality evaluation of artificially fertilized eggs of flounder mothers.

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114015789A (en) * 2021-12-06 2022-02-08 中国水产科学研究院黄海水产研究所 Genome selection method for cultivating disease-resistant improved Dongxiang spots
CN114410746B (en) * 2022-03-29 2022-07-12 中国海洋大学三亚海洋研究院 Dongxiang spot molecule source-tracing selection breeding method and application thereof
CN114990243B (en) * 2022-06-21 2023-06-09 中国水产科学研究院黄海水产研究所 Method for screening vibriosis-resistant cynoglossus semilaevis marker combination and cynoglossus semilaevis disease-resistant individual
CN115287340A (en) * 2022-08-12 2022-11-04 厦门大学 Breeding method of pseudosciaena crocea visceral white-spot disease-resistant excellent strain based on whole genome selection
CN115992265B (en) * 2023-03-22 2023-07-14 中山大学 Grouper whole genome liquid phase chip and application thereof

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101911918B (en) * 2010-03-18 2011-06-15 中国水产科学研究院黄海水产研究所 Selective breeding method for disease-resistant superior strains of paralichthys olivaceus
CN104304096B (en) * 2014-08-26 2016-01-27 中国水产科学研究院黄海水产研究所 The selection of the sick excellent strain of the anti-Edwardsiella tarda of a kind of lefteye flounder
CN104328116A (en) * 2014-11-06 2015-02-04 中国海洋大学 SNP (single-nucleotide polymorphism) locus related to heat resistance of Paralichthys olivaceus and application thereof
CN105624318B (en) * 2016-03-24 2019-05-03 中国水产科学研究院北戴河中心实验站 One kind SNP site relevant to lefteye flounder growth traits, its screening technique and application
CN105713974B (en) * 2016-03-24 2019-04-19 中国水产科学研究院北戴河中心实验站 One kind SNP marker relevant to lefteye flounder quantitative character, its screening technique and application
CN106480189B (en) * 2016-10-18 2018-11-09 中国水产科学研究院黄海水产研究所 A kind of disease-resistant prevalent variety cultivation method of fish based on full-length genome selection

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
FISH AND SHELLFISH IMMUNOLOGY, vol. 66, JPN6022013745, 2017, pages 43 - 49, ISSN: 0004745849 *
水産育種, vol. 46, no. 1, JPN6022013744, 2016, pages 1 - 14, ISSN: 0004745850 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102602152B1 (en) * 2023-08-04 2023-11-13 제주대학교 산학협력단 A method for producing superior seeds using egg quality evaluation of artificially fertilized eggs of flounder mothers.

Also Published As

Publication number Publication date
JP7158496B2 (en) 2022-10-21
WO2021119980A1 (en) 2021-06-24
CN111278994B (en) 2023-02-07
CN111278994A (en) 2020-06-12

Similar Documents

Publication Publication Date Title
JP7158496B2 (en) Disease resistance breeding gene chip of flounder and its application
CN106480189B (en) A kind of disease-resistant prevalent variety cultivation method of fish based on full-length genome selection
CN111225986B (en) Chicken whole genome SNP chip and application thereof
Seidel Brief introduction to whole-genome selection in cattle using single nucleotide polymorphisms
CN107967409B (en) Pig whole genome low-density SNP chip and manufacturing method and application thereof
Naqvi Application of molecular genetic technologies in livestock production: potentials for developing countries
CN109837348B (en) Molecular marker associated with boar sperm concentration character
CN109554487B (en) Molecular marker associated with boar semen volume character and application
JP2024521656A (en) Genomic selection method for breeding disease-resistant varieties of Tosei-boku
CN110484636A (en) One kind molecular labeling relevant to the total teat number trait of pig and application
CN111254205A (en) Molecular marker influencing average daily gain of Jiazhi big white pig, screening method and application
CN118006736A (en) Mandarin liquid phase breeding chip and application thereof
Stolpovsky et al. Genomic Selection. I: Latest Trends and Possible Ways of Development
CN110079609A (en) A kind of molecular labeling and its application for identifying white diarrhea resistance chicken
CN117778588A (en) Preparation method and application of sheep 1K liquid phase chip based on targeted capturing sequencing
Lee et al. Analysis of single nucleotide polymorphisms related to heifer fertility in Hanwoo (Korean cattle)
CN117431324A (en) High-density SNP chip in whole genome of dairy cow and application thereof
Buaban et al. Genomic prediction of milk-production traits and somatic cell score using single-step genomic best linear unbiased predictor with random regression test-day model in Thai dairy cattle
CN111944913A (en) Cynoglossus semilaevis disease-resistant breeding gene chip and application thereof
Garrick et al. Emerging technologies for identifying superior dairy cows in New Zealand
Festa et al. Transcriptomic Prediction of Breeding Values in Loblolly Pine
CN118272546B (en) SNP marker g.reelin-936 related to propagation traits of palaemon carinicauda and application thereof
Venkataramanan Genetic evaluation of growth using random regression models
CN116356041A (en) Low-density dairy cow whole genome 30K SNP chip and application thereof
Falaknaz et al. Evaluation and comparison of SSR, ISSR, and retrotransposon molecular markers in the study of genetic diversity of Aegilops tauschii as wild ancestor of wheat

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20201103

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20220407

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220701

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20221004

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20221011

R150 Certificate of patent or registration of utility model

Ref document number: 7158496

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150