JP7158496B2 - Disease resistance breeding gene chip of flounder and its application - Google Patents

Disease resistance breeding gene chip of flounder and its application Download PDF

Info

Publication number
JP7158496B2
JP7158496B2 JP2020556756A JP2020556756A JP7158496B2 JP 7158496 B2 JP7158496 B2 JP 7158496B2 JP 2020556756 A JP2020556756 A JP 2020556756A JP 2020556756 A JP2020556756 A JP 2020556756A JP 7158496 B2 JP7158496 B2 JP 7158496B2
Authority
JP
Japan
Prior art keywords
chip
flounder
sep
snp
disease
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2020556756A
Other languages
Japanese (ja)
Other versions
JP2022518304A (en
Inventor
松林 陳
茜 周
昇 盧
亜東 陳
洋 劉
文騰 徐
仰真 李
磊 王
英明 楊
娜 王
希紅 李
Original Assignee
中国水産科学研究院黄海水産研究所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国水産科学研究院黄海水産研究所 filed Critical 中国水産科学研究院黄海水産研究所
Publication of JP2022518304A publication Critical patent/JP2022518304A/en
Application granted granted Critical
Publication of JP7158496B2 publication Critical patent/JP7158496B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/124Animal traits, i.e. production traits, including athletic performance or the like
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/80Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in fisheries management
    • Y02A40/81Aquaculture, e.g. of fish

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Description

本発明は水産遺伝育種の技術分野に属し、具体的にヒラメ耐病性優良品種選別に用いられる遺伝子チップの製造方法および応用に関する。 TECHNICAL FIELD The present invention belongs to the technical field of aquatic genetic breeding, and specifically relates to a method and application of a gene chip used for selection of excellent disease-resistant flounder varieties.

水産養殖業は中国食品の重要なソースであり、魚類養殖業はまた水産養殖業の基幹産業でもあり、2015年に魚類養殖の生産量は28457万トンで、水産養殖生産量全体の57.6%である。養殖魚類はすでに中国タンパク質の重要な由来となった。 Aquaculture is an important source of Chinese food, and fish farming is also the key industry of aquaculture. %. Farmed fish has already become an important source of Chinese protein.

しかし、魚類養殖業の急速な発展につれて、優良な品種が欠乏し、養殖種類の品質が劣化する。養殖規模が拡大し、集約化水準の向上および養殖環境の悪化は水産養殖病害の頻繁な発生を引き起こし、養殖製品の薬物残留の深刻化等問題も魚類養殖業の持続可能な発展を深刻に制約する。魚類のみにとって、高密度な養殖で形成した免疫抑制のため、養殖魚類の耐病性の低下を引き起こす。魚類の免疫耐病性メカニズムおよび耐病性の分子遺伝に対する研究はいまだに進まないため、分子レベルで魚類病害の予防案を提出しにくい。さらに、耐病性機能遺伝子と耐病性分子マーカーが欠乏し、耐病性優良品種の育成を行いにくく、そのため現在養殖生産は耐病性が低下する野生または人工繁殖した多世代の種苗のみに依存し、流行病が魚類養殖における頻繁な発生を引き起こす。不完全な統計によると、中国における魚類養殖業では毎年病害による直接的経済損失は100億人民元にも達する。病害はすでに中国における魚類養殖業の持続可能な発展を制約するボトルネックとなった。 However, with the rapid development of fish farming industry, there is a shortage of good breeds and the quality of farmed breeds is deteriorating. As the scale of aquaculture expands, the level of intensification increases, and the aquaculture environment deteriorates, aquaculture diseases will occur more frequently, and problems such as increased drug residues in aquaculture products will seriously constrain the sustainable development of the fish farming industry. do. For fish only, the immunosuppression formed by high-density aquaculture causes a decrease in disease resistance in farmed fish. Research on immune disease resistance mechanisms and molecular inheritance of disease resistance in fish is still incomplete, so it is difficult to propose preventive measures against fish diseases at the molecular level. Furthermore, disease resistance functional genes and disease resistance molecular markers are deficient, making it difficult to breed excellent disease-resistant varieties. The disease causes frequent outbreaks in fish farming. According to incomplete statistics, the direct economic loss from disease in China's fish farming industry amounts to RMB 10 billion every year. Diseases have already become a bottleneck constraining the sustainable development of fish farming industry in China.

ヒラメは世界的な海水養殖魚類で、中国における海水養殖の主導的な魚類の一つでもある。但し、ヒラメ養殖業においても病害が頻発し、死亡率が高いという問題が存在する。ヒラメ等養殖魚類を脅かす主要な病害は細菌性疾患・ウイルス性疾患を含む。その中危害が大きい疾患はそれぞれエドワジエラ症・ビブリオ症とリンホシスチス病が挙げられる。抗生物質系薬物またはワクチン等疾患予防措置は一定の効果があるが、水産養殖における病害問題を根本的に解決できない。かつ抗生物質系薬物に魚体内に蓄積しやすく、養殖魚類の商品品質を低下させ、消費者の健康に潜在的な危害を有し、病原菌に薬剤耐性および養殖環境を深刻に汚染するという問題を引き起こしやすいため、水産養殖業における応用はますます制約される。同時に、抗生物質の使用はまた人々日増しに伸びる無残留の無公害水産品の需要を満たすことができない。そのため、魚類耐病性優良品種の選別は中国の水産領域において解決が急がれる重大な課題の一つである。 Flounder is a global saltwater farmed fish, and is also one of the leading fish in saltwater farming in China. However, even in the flounder farming industry, there is a problem that disease damage occurs frequently and the mortality rate is high. Major diseases that threaten cultured fish such as flounder include bacterial and viral diseases. Among them, edwardieriasis, vibriopathy and lymphocystis disease are the most dangerous diseases, respectively. Disease prevention measures such as antibiotics and vaccines have certain effects, but they cannot fundamentally solve the problem of diseases in aquaculture. In addition, antibiotics tend to accumulate in fish bodies, degrade the product quality of farmed fish, have potential hazards to consumer health, cause drug resistance to pathogenic bacteria, and seriously pollute the farming environment. Its susceptibility to triggering increasingly limits its application in the aquaculture industry. At the same time, the use of antibiotics also cannot meet people's ever-growing demand for residue-free and pollution-free aquatic products. Therefore, the selection of fish varieties with excellent disease resistance is one of the serious problems that need to be solved urgently in China's aquaculture industry.

今まで、魚類優良品種の選別は主に表現型性状の選別に基づき、グループ選別・家系選別・交雑選別とBLUP選別等を含み、主に体長・体重等測定しやすい表現型値に基づいて算出される育種値に基づいて選択を行う。分子マーカーができた後、重要な経済性状に関連する分子マーカーを特定することにより経済性状に選択を行い、従来の分子マーカー補助選別に用いられる分子マーカーの数量が非常に限られ、単一遺伝子性状または品質性状の選択効果に優れたが、多重遺伝子が決定される数量性状に対する選択効果がかんばしくない。耐病性性状は複数の遺伝子が制御される数量性状であり、直接測定しにくく、選択正確性が非常に低く、そのため耐病性優良品種に対する選別は長い間伸び悩み、魚類耐病性新品種の育成を制約し、新たしい育種技術でこの課題を解決する必要が急がれる。 Until now, the selection of superior fish varieties has been based mainly on the selection of phenotypic characteristics, including group selection, family selection, cross selection, BLUP selection, etc., and is mainly calculated based on phenotypic values that are easy to measure, such as body length and weight. The selection is made based on the breeding value given. After the molecular markers are available, selection can be made economically by identifying the molecular markers associated with important economic properties. Although the selection effect of traits or quality traits was excellent, the selection effect of quantity traits determined by multiple genes was not good. Disease resistance traits are quantitative traits controlled by multiple genes, are difficult to measure directly, and have very low selection accuracy. However, there is an urgent need to solve this problem with new breeding techniques.

遺伝子チップは、DNAチップ・DNAマイクロアレイともいい、フォトエッチング技術を用い、シリコンウェハを固相担体とし、選択最適化を経た大量のDNA配列をオリゴヌクレオチドに合成し、特殊な処理を経たスライドにつけ、変性・固定を経た後にDNAマイクロアレイを形成する。核酸分子交雑技術に基づき、遺伝子チップは数万ひいては数十万のDNAフラグメントに並行化交雑と解析を同時に実行でき、高スループット・並行性・高効率・サンプルの数量が少ないという利点を有する。現在、遺伝子チップはすでに人類の疾患・腫瘍の診断と、動植物の遺伝学的解析および遺伝育種に広範に応用される。動物育種において、遺伝子チップはすでに牧畜、特に乳牛・豚等種の優良品種の選別に応用されたことに成功した。例えば、乳牛において、すでにBovine3Kchip・Bovine25K・SNPchip・BovineHD700K・BovinLD7K等多くの遺伝子チップが相次いで開発される。今まで、北米・欧州・オーストラリア等はBovineSNP50Beadchipチップ(54KSNP)を用いてゲノムSNPマーカー分類検測の汎用プラットフォームとして、かつ大規模参考グループが取得したSNP分類結果に基づき、産乳量・繁殖力・耐病性等多種の経済性状のフルゲノム関連解析を行い、乳牛ゲノムの選択体系を構築する。ゲノム選択を通して新生雄牛の初期選択を実現させ、乳牛育種の世代間隔を短縮させ、遺伝的進歩を促進させ、種雄牛の選択効率を大いに向上させ、養殖と育種コストを顕著に節約する。 A gene chip, also known as a DNA chip or a DNA microarray, uses photoetching technology, uses a silicon wafer as a solid phase carrier, synthesizes a large amount of DNA sequences through selective optimization into oligonucleotides, attaches them to slides that have undergone special processing, A DNA microarray is formed after denaturation and fixation. Based on nucleic acid hybridization technology, the gene chip can perform parallel hybridization and analysis on tens of thousands and even hundreds of thousands of DNA fragments at the same time, and has the advantages of high throughput, parallelism, high efficiency and small sample quantity. At present, gene chips have been widely applied to the diagnosis of human diseases and tumors, and the genetic analysis and genetic breeding of animals and plants. In animal breeding, gene chips have already been successfully applied to the selection of excellent breeds, especially dairy cows and pigs. For example, for dairy cows, many gene chips such as Bovine3Kchip, Bovine25K, SNPchip, BovineHD700K, and BovinLD7K have been developed one after another. Until now, North America, Europe, Australia, etc. have been using the BovineSNP50Beadchip chip (54KSNP) as a general-purpose platform for genomic SNP marker classification and testing, and based on the SNP classification results obtained by a large-scale reference group, milk production, fertility, and Perform full genome association analysis of various economic properties such as disease resistance, and construct a selection system for dairy cow genomes. Through genome selection, the early selection of newborn bulls can be achieved, the generation interval of dairy cattle breeding can be shortened, genetic progress can be promoted, the selection efficiency of sires can be greatly improved, and breeding and breeding costs can be significantly reduced.

但し、水産養殖動物に、今まで育種用遺伝子チップ、特に耐病性育種遺伝子チップの報道がいまだに見られない。 However, there have been no reports of breeding gene chips, especially disease-resistant breeding gene chips, for aquaculture animals.

本発明はヒラメの耐病性優良品種の選別に用いられる遺伝子チップを提供し、魚類優良品種の育成において遺伝子チップが欠乏するという問題を解決し、従来育種技術の欠点を補完し、魚類の耐病性と高生産性を兼ね備える優れた優良品種育成のために、新たな分子育種方法を提供し、魚類育種技術の世代交代を実現させ、魚類育種産業の高速な発展を推進することを目的とする。 The present invention provides a gene chip that is used to select flounder with excellent disease resistance, solves the problem of lack of gene chips in the breeding of excellent fish breeds, makes up for the shortcomings of conventional breeding techniques, and improves fish disease resistance. The aim is to provide a new molecular breeding method, realize a generational change in fish breeding technology, and promote the rapid development of the fish breeding industry in order to develop excellent breeds that combine both high productivity and high productivity.

本発明はまずヒラメ耐病性に関連するSNPローカスを提供し、前記SNPローカスは配列がSEQNO:1SEQIDNO:48697である中のいずれか一つの配列の第36位の塩基である。 The present invention first provides a SNP locus associated with flounder disease resistance, wherein the SNP locus is the 36th base of any one of the sequences SEQ NO:1 SEQ ID NO:48697.

本発明のSNPローカスはヒラメ耐病性優良品種の選別に利用できる。 The SNP locus of the present invention can be used for selection of flounder with excellent disease resistance.

本発明が提供されるSNPローカスはヒラメ耐病性優良品種の選別用の検測製品の製造にも用いられることができる。 The SNP locus to which the present invention is provided can also be used to produce inspection products for selection of excellent disease-resistant flounder varieties.

前記検測製品は、好ましくは遺伝子チップである。 Said test product is preferably a gene chip.

本発明のさらなる方面は、ヒラメ耐病性優良品種の選別に用いられる遺伝子チップを提供し、それはヒラメ耐病性に関連するSNPローカスを検測することができる。 A further aspect of the present invention provides a gene chip used for selection of flounder disease-resistant superior cultivars, which can detect SNP locus associated with flounder disease resistance.

本発明のさらなる方面はヒラメ耐病性個体の選別方法を提供し、この方法は、上記の遺伝子チップを用いて実行することができる。 A further aspect of the invention provides a method for selecting disease-resistant flounder individuals, which method can be performed using the gene chip described above.

前記方法は、以下のステップを含む:
1)候補グループにおける個体ゲノムDNAを抽出し、かつ上記の遺伝子チップを利用して検測してSNPマーカーの遺伝子型判定の結果を取得する。
2)参考グループにおけるSNPグループより遺伝子チップと同様であるSNPローカスの遺伝子型判定の結果を抽出し、さらに参考グループのSNPの遺伝子型判定の結果と候補グループからチップを利用して取得した遺伝子型判定の結果とを合併する。
3)合併された遺伝子型と参考グループが表現する型とを利用し、加重GBLUP方法を用いて候補グループの推定育種値(GEBV)を推定し、さらにGEBV値に基づいて被検測個体の耐病性潜在能力を決定する。
The method includes the following steps:
1) Extract the genomic DNA of individuals in the candidate group, and use the above-mentioned gene chip to detect and obtain genotyping results for the SNP markers.
2) From the SNP group in the reference group, the results of genotyping of SNP loci similar to the gene chip are extracted, and the results of genotyping of the SNPs in the reference group and genotypes obtained from the candidate group using the chip Merge with the judgment result.
3) Utilizing the combined genotype and the type represented by the reference group to estimate the estimated breeding value (GEBV) of the candidate group using the weighted GBLUP method, and based on the GEBV value the disease resistance of the tested individuals. Determines sexual potential.

参考グループの遺伝子型を利用し、加重最良線形不偏推定量(加重GBLUP)を用いて予測正確性を推定する。そのうち、5倍交差検証方法を予測正確性の判定方法に利用し、特徴曲線下面積(AUC)を予測正確性の判定指標とある。AUCは1に近いほど、予測正確性は高い。 The genotype of the reference group is used to estimate prediction accuracy using the weighted best linear unbiased estimator (weighted GBLUP). Among them, the 5-fold cross-validation method is used as the judgment method of prediction accuracy, and the area under the feature curve (AUC) is used as the judgment index of prediction accuracy. The closer the AUC is to 1, the higher the prediction accuracy.

本発明が提供されるヒラメ耐病性に関連するSNPローカスの遺伝子チップはヒラメ耐病性個体の選別に利用することが可能であり、かつ実際の選択正確性が理論値に近いため、ヒラメ耐病性優良品種の選択正確性が向上し、育種期間を短縮することができる。これにより、ヒラメ耐病性優良品種の選別のために遺伝子チップ技術を提供し、魚類耐病性に優れた品種を選別するための、遺伝子チップによる育種という新しい道を切り開いた。 The gene chip of SNP locus associated with flounder disease resistance provided by the present invention can be used for selection of flounder disease-resistant individuals, and the actual selection accuracy is close to the theoretical value. The selection accuracy of varieties can be improved and the breeding period can be shortened. As a result, we have provided gene chip technology for the selection of flounder cultivars with excellent disease resistance, and have opened up a new way of breeding using gene chips to select cultivars with excellent disease resistance in fish.

本発明はヒラメ耐病性優良品種育成の遺伝子チップの製造と応用方法を確立し、ヒラメ等魚類の耐病性優良品種の育成のために新たな分子育種の技術手段を提供することを目的とする。 The purpose of the present invention is to establish a method for producing and applying a gene chip for breeding flounder with excellent disease resistance, and to provide new molecular breeding technical means for breeding flounder and other fish with excellent disease resistance.

次に本発明が関する専門用語に対する説明は以下のとおりである:
SNP:SingleNucleotidePolymorphismの略語で、すなわち一塩基多型で、ゲノムレベルにおいてヌクレオチド単体の変異により引き起こされるDNA配列の多型である。
Next, explanations for the terminology with which the present invention relates are as follows:
SNP: An abbreviation for Single Nucleotide Polymorphism, that is, a single nucleotide polymorphism, which is a DNA sequence polymorphism caused by a single nucleotide mutation at the genome level.

遺伝子チップ:微細加工技術を通し、数万乃至百万特定のDNA配列フラグメントを、シリコンウエハー・スライド等支持物に規則的に配置して固定し、構成される二次元DNAプローブアレーで、遺伝物質(DNA等)に遺伝子分類および分子検測を行うことができる。 Gene chip: A two-dimensional DNA probe array consisting of tens of thousands to millions of specific DNA sequence fragments that are regularly arranged and fixed on a support such as a silicon wafer or slide through microfabrication technology. (DNA, etc.) can be subjected to genetic typing and molecular testing.

縮退塩基:コドンの縮退性に基づき、一つの記号をよく用いて不特定の二つまたはそれ以上の塩基を代替する。 Degenerate bases: Based on the degeneracy of codons, one symbol is often used to replace two or more unspecified bases.

例えばRはA/Gを示し、YはC/Tを示し、MはA/Cを示し、KはG/Tを示し、SはG/Cを示し、WはA/Tを示すなど。 For example, R indicates A/G, Y indicates C/T, M indicates A/C, K indicates G/T, S indicates G/C, W indicates A/T, and so on.

参考グループ:ゲノムの選択において、人為感染等検測を通して取得した表現型データを有するグループは、通常表現型性状を有する大型グループから選別されるグループ全体を代表する表現型分布で、かつゲノム再配列を行い、遺伝子型データを取得し、実際のゲノム選択計算を行う個体の集合である。 Reference group: In genome selection, the group with phenotypic data acquired through testing such as artificial infection is selected from a large group with normal phenotypic characteristics. It is a set of individuals who perform the genotype data, and perform the actual genome selection calculation.

候補グループ:ゲノムの選択において、候補グループとはゲノム再配列を通し、遺伝子型データを取得したが、表現型データがないグループで、該グループは育種の潜在力を有し、後続の実際優良品種育成作業に用いる予定がある個体の集合である。 Candidate group: in genome selection, the candidate group is the group that has obtained genotype data through genome rearrangement but has no phenotypic data, and has the potential for breeding, and the subsequent actual superior varieties. This is a set of individuals scheduled to be used for breeding work.

GBLUP:GenomicBestLinearUnbiasedPredictionの略語で、すなわちゲノム最良線形不偏推定量で、ゲノムにおける高密度な分子マーカーを用いて個体間の成因関係(Gマトリックス)を利用し、ゲノム育種値の推定する方法である。 GBLUP: An abbreviation for GenomicBestLinearUnbiasedPrediction, that is, a genome best linear unbiased estimator, which is a method for estimating genome breeding value by using high-density molecular markers in the genome and utilizing genetic relationships (G matrix) between individuals.

GEBV:GenomicEstimatedBreedingValuesの略語で、すなわちゲノム推定育種価で、フルゲノムにおけるすべてのマーカーまたはハプロタイプの効果推定を加算して取得する。 GEBV: Abbreviation for Genomic Estimated Breeding Values, ie Genomic Estimated Breeding Values, obtained by adding up the effect estimates of all markers or haplotypes in the full genome.

次に実施例を組み合わせて本発明に詳細な記述を行う。 Next, the present invention will be described in detail by combining examples.

実施例1:「魚チップ1号」遺伝子チップSNPローカスの選別およびチップ製造
1、ヒラメ耐エドワージエラタルダ参考グループの作成および表現型性状の測定
ヒラメゲノムが選定される参考グループと候補グループの個体はいずれも本課題チームが2003年以来作成したヒラメ家系より由来され、多年の育成過程において、次第に韓国・日本および中国のヒラメグループの急速な成長、耐病性や耐逆性等優良な性状より由来される。
Example 1: Selection of "Fish Chip No. 1" Gene Chip SNP Locus and Chip Production 1. Creation of flounder-resistant Edwardsella tarda reference group and measurement of phenotypic characteristics Individuals of reference group and candidate group for selection of flounder genome All of them are derived from the flounder family created by this project team since 2003, and in the process of breeding for many years, the rapid growth of the flounder group in Korea, Japan and China, and excellent properties such as disease resistance and reversal resistance. be done.

特に2013年から、ヒラメ養殖業におけるエドワージエラタルダが日増しに深刻化する情勢に対し、ヒラメ耐エドワージエラタルダ家系選別の研究を実施する。 In particular, from 2013, in response to the situation where Edwardsiella tarda in the flounder farming industry is becoming more serious day by day, we will conduct research on flounder-resistant Edwardsiella tarda family selection.

2013-2015年、当年に作成したヒラメ家系に腹腔接種人為感染エドワージエラタルダ検測を連続的に実施し、感染した検測魚苗に鰭棘を収集し、生長と耐病性表現型を測定し、2013年・2014年と2015年にサンプル4577匹・5942匹と6919匹を採取し、ヒラメエドワージエラタルダゲノムが選択される参考グループを選択して作成するために用いられるもの。 From 2013 to 2015, peritoneal inoculation artificial infection Edwardsiella tarda inspection was continuously carried out in flounder families created in the current year, fin spines were collected from the infected inspection fish seedlings, and growth and disease resistance phenotypes were measured. , 4577, 5942 and 6919 samples taken in 2013, 2014 and 2015 and used to select and generate a reference group from which the flounder Edwardian la tarda genome was selected.

感染検測のサンプルより、96家系(2013年に32、2014年に10、2015年に48)を選定し、各家系は死亡率に従って等比率の死亡と生存した個体10-15を選定し、ゲノムが選択される参考グループを組成し、選定した個体の感染検測の結果(死亡または生存)を参考グループの表現型性状(表2)とする。

Figure 0007158496000001
From the infection test samples, 96 families (32 in 2013, 10 in 2014, 48 in 2015) were selected, and each family selected 10-15 individuals who died and survived in an equal proportion according to mortality. , a reference group from which the genome is selected is formed, and the results of infection detection (death or survival) of the selected individuals are used as the phenotypic characteristics of the reference group (Table 2).
Figure 0007158496000001

2、ヒラメフルゲノム再配列およびSNPローカス評価
ヒラメ参考グループはDNA抽出・検測を経た後、利用可能な個体を計931有する(表3)。

Figure 0007158496000002
2. Flounder full genome rearrangement and SNP locus evaluation The flounder reference group has a total of 931 individuals available after undergoing DNA extraction and testing (Table 3).
Figure 0007158496000002

931の参考グループの個体のゲノムDNAを抽出し、DNA検測が合格した後、次世代配列ライブラリを作成し、ライブラリ作成類型は両端DNAライブラリ(挿入フラグメント350bp)で、IlluminaHiseqX10配列プラットフォームを用いて配列とデータエクスポートを完成し、品質管理で取得された平均的なデータ量は2G/個体である。本課題チームが提供されるヒラメゲノム配列(GenBankID:PRJNA73673)を参考ゲノムとして、BWA(http://bio-bwa.sourceforge.net/)ソフトウェアを用いて配列比較を行い、その後Samtools(http://www.htslib.org/)ソフトウェアを用いてSNP予測と評価を行い、42.2MのSNP集合を取得する。 The genomic DNA of 931 reference group individuals was extracted, and after passing the DNA test, the next generation sequence library was constructed. and the average amount of data acquired in quality control is 2G/individual. Using the flounder genome sequence (GenBankID: PRJNA73673) provided by this project team as a reference genome, sequence comparison was performed using BWA (http://bio-bwa.sourceforge.net/) software, and then Samtools (http:/ /www.htslib.org/) software is used for SNP prediction and evaluation to obtain a SNP set of 42.2M.

3、SNPローカス評価と選別
(1)ステップ2で取得した42.2MのヒラメSNPマーカーに対し、以下の規格に選別を行う:
見逃し率>0.1、最小対立遺伝子頻度(MAF)<0.05のサイトを削除し、繰り返し配列または反復配列におけるサイトを削除し、ハドウィンバランスに適合しないサイトを削除し、3.4MのヒラメSNPマーカーを取得する。
3. SNP locus evaluation and selection (1) Select the 42.2M flounder SNP markers obtained in step 2 according to the following standards:
Deletion of sites with a miss rate >0.1, minimum allele frequency (MAF) <0.05, deletion of sites at or in repeats, deletion of sites that do not fit the Hadwin balance, deletion of 3.4M Acquire flounder SNP markers.

(2)ステップ(1)で選別した3.4M分子マーカーに対し、SNPローカス効果値解析と推定育種値計算を実施し、
ヒラメゲノム選択計算はBayesCπアルゴリズムを用い、解析モデル等式:


Figure 0007158496000003
モデルにおいて、yは表現型値で、uはグループ平均値で、グループqiはマーカー効果が正規分布qi~N(0、
Figure 0007158496000004
)で、mはマーカーの総数で、Xはqiに対応する関連マトリックスで、eは残差である。 (2) Perform SNP locus effect value analysis and estimated breeding value calculation for the 3.4M molecular markers selected in step (1),
Flounder genome selection calculation uses the Bayes C π algorithm, analysis model equation:


Figure 0007158496000003
In the model, y is the phenotypic value, u is the group mean value, and group q is the marker effect normally distributed q ~ N(0,
Figure 0007158496000004
), m is the total number of markers, X is the association matrix corresponding to qi, and e is the residual.

R言語パックが提供されるBayesCπアルゴリズムを用い、組み合わせ済みの遺伝子型データgenotype.csvと表現型データphonetype.csvを組み合わせ、フルゲノム再配列の参考グループの計931のヒラメ個体にゲノム選択計算を行う。その後取得したSNPローカス効果値を最大から最小への並べ替え、推定育種値<10-5のサイトを削除し、計864229のSNPローカスを取得して遺伝子チップSNPローカスの選定に用いられる。 Using the BayesCπ algorithm provided with the R language pack, the combined genotype data genotype. csv and phenotype data phonetype. csv and perform genome selection calculations on a total of 931 flounder individuals in the reference group for full genome rearrangement. After that, the obtained SNP locus effect values are rearranged from the largest to the smallest, sites with estimated breeding values <10-5 are deleted, and a total of 864,229 SNP loci are obtained and used for selection of gene chip SNP locus.

(3)さらにAffymetrixAxiom遺伝子分類プローブ設計生物解析プロセスを用いてステップ(2)で選別したSNPにそれぞれプローブ設計と評価を行い、プローブ変換可能性評価点数<0.6のサイトを削除する。また、SNPがゲノム全体をカバーしかつ均等に分布され、SNPのフランキング配列35bp内に他のSNPが存在せず、SNPのフランキング配列35bpのGC含有量が30-70%で、最終的に48697のヒラメSNPマーカーを選別してチップ製造に用いられることを保証し、前記48697のSNP分子マーカーの配列記録は配列リストにある。
米国ThermoFisher社製AffymetrixAxiomチップの生産技術を用いてヒラメSNPチップ(遺伝子チップ)を製造し、延べ48697のヒラメSNPローカスを含み、各チップは24のサンプルを同時に検測することができる。
(3) Further probe design and evaluation are performed for each of the SNPs selected in step (2) using the Affymetrix Axiom genetic classification probe design bioanalysis process, and sites with probe conversion feasibility evaluation scores <0.6 are deleted. In addition, the SNPs covered and evenly distributed throughout the genome, there were no other SNPs within the 35 bp flanking sequence of the SNP, the GC content of the 35 bp flanking sequence of the SNP was 30-70%, and the final 48,697 flounder SNP markers were screened to ensure their use in chip manufacturing, and the sequence records of said 48,697 SNP molecular markers are in the sequence listing.
A flounder SNP chip (gene chip) was manufactured using Affymetrix Axiom chip production technology manufactured by ThermoFisher, USA, containing a total of 48,697 flounder SNP loci, and each chip can detect 24 samples simultaneously.

実施例2、「魚チップ1号」遺伝子チップの使用方法
1、チップ検測用サンプルの製造と交雑
少量のヒラメ鰭棘(米の粒ほどの大きさ)を採取し、DNA抽出キット(中国・天根)を利用して鰭棘ゲノムDNAを抽出し、1%のアガロースゲル電気泳動と核酸分光光度計を利用してDNA品質と濃度を検測し、最終的な合格サンプルの規格は以下のとおりである:電気泳動はDNAに単一のストリップが生じることを求め、フラグメント長さが10kbより大きく、完全性に優れ、生分解は生じず、サンプル品質のDNA検測結果。紫外分光光度計を用いてA260/280を検測する:1.8-2.0、A260/230>1.5、濃度が20ng/μlより低くなく、総量が4μgより小さくない。その後米国ThermoFisher社製SNPチップ検測サンプルの製造標準操作プロセスに従って(https://www.thermofisher.com/)チップ検測用サンプルの製造を行う。1.4ug以上の高品質なDNAテンプレートを2ml*96の深穴板に加え、変性剤を加えて常温変性を行い、変性10min後に変性を停止し、単鎖DNAを取得する。チップサイト増幅用48697ペアプライマーと等温増幅酵素・dNTP等を深穴板に加え、深穴板を封止し、37℃で等温増幅を行う。24h増幅した後、増幅産物を断片化し、等体積のイソプロパノールを加え、-20℃の冷蔵庫において沈殿する。24h沈殿した後、4℃、3000gを用いて遠心してDNA産物の沈殿物を取得し、37℃で残りのイソプロパノールを除去し、沈殿物を溶解させ、交雑液を取得し、交雑液は5%のゲル電気泳動の検測結果の質量を用い、増幅産物品質の検測結果は、ストリップが明晰で、輝度が高い。交雑液は温度制御増幅装置を用いて交雑を行い、条件が95℃10min、48℃3minで、その後交雑液を持続的に48℃に維持する。チップブロックを交雑液に浸漬し、48℃の交雑炉において24h交雑させ、その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各信号点は一つのプローブ交雑の結果で、各サイトの交雑結果を取得した後、チップスキャンの結果はAxiomAnalysisSuite(AxAS)ソフトウェア(米国ThermoFisher社製)を用いて解析を行う。
Example 2, How to use "Fish Chip No. 1" gene chip 1. Manufacture and crossbreeding of samples for chip detection The fin spine genomic DNA was extracted using 1% agarose gel electrophoresis and nucleic acid spectrophotometer, and the final acceptable sample specifications were as follows: As follows: electrophoresis sought to produce a single strip of DNA, fragment length greater than 10 kb, excellent integrity, no biodegradation, sample quality DNA detection results. A260/280 is measured using UV spectrophotometer: 1.8-2.0, A260/230>1.5, concentration not lower than 20 ng/μl, total amount not lower than 4 μg. After that, according to the manufacturing standard operating process for SNP chip test samples manufactured by ThermoFisher, USA (https://www.thermofisher.com/), samples for chip test are manufactured. 1.4 ug or more of high-quality DNA template is added to a 2 ml*96 deep-well plate, a denaturant is added, denaturation is performed at room temperature, denaturation is stopped after denaturation for 10 minutes, and single-stranded DNA is obtained. 48697 pair primers for chip site amplification, isothermal amplification enzymes, dNTPs, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37°C. After 24 h of amplification, the amplified products are fragmented, added with an equal volume of isopropanol and precipitated in a refrigerator at -20°C. After precipitation for 24 h, centrifuge at 3000 g at 4° C. to obtain a precipitate of DNA product, remove the remaining isopropanol at 37° C., dissolve the precipitate, obtain a hybrid fluid, and the hybrid fluid is 5%. Using the mass of the gel electrophoresis detection results, the amplification product quality detection results show that the strips are clear and bright. The hybrid was hybridized using a temperature-controlled amplifier under the conditions of 95°C for 10 min and 48°C for 3 min, after which the hybrid was maintained at 48°C continuously. The chip block was immersed in a hybridization solution, hybridized for 24 hours in a hybridization oven at 48°C, then eluted, fluorescent proteins were connected, the fluorescent proteins were immobilized, hybridization probes were scanned, and each signal point was scanned. After obtaining the hybridization results for each site from the results of one probe hybridization, the chip scan results are analyzed using Axiom Analysis Suite (AxAS) software (manufactured by ThermoFisher, USA).

2、チップ検測と遺伝子分類データ解析
(1)被検測グループサンプルの採取とDNA抽出
再配列済みの一部ヒラメ個体DNAを選択し、前記チップを用いて検測を実施し、チップ遺伝子分類の正確性および再配列と遺伝子チップ分類を用いて取得した遺伝子型データにゲノム選択計算を実施する再現性に検測を行う。その後ヒラメ育種の選択過程に、家系が用いられる候補成体の個体を確立し、ゲノムDNAの抽出を行いかつチップで検測を実施し、遺伝子チップがヒラメゲノム選択育種における応用効果を検証する。採用個体の情報は表4による。

Figure 0007158496000005
Figure 0007158496000006
Figure 0007158496000007
2. Chip inspection and gene classification data analysis (1) Collection of samples of the group to be tested and DNA extraction A part of rearranged flounder individual DNA is selected, the chip is used for inspection, and chip gene classification is performed. The accuracy and reproducibility of performing genome selection calculations on genotypic data obtained using rearrangement and gene chip classification are tested. Then, in the selection process of flounder breeding, establish candidate adult individuals whose pedigrees are used, perform genomic DNA extraction and chip detection, and verify the application effect of gene chips in flounder genome selection breeding. Information on the adopted individuals is shown in Table 4.
Figure 0007158496000005
Figure 0007158496000006
Figure 0007158496000007

(2)チップ検測
遺伝子チップ検測の標準プロセスに従い、AffymetrixGeneTitan遺伝子チップ処理システムを用いてプローブ交雑・染色とチップスキャンを完成する。具体的な操作方法は以下のとおりである:4μgの高品質なDNAテンプレートを2ml*96の深穴板に加え、変性剤を加えて変性(28℃)を行い、変性10min後に変性停止液(反応時間が10minより長くない)変性を停止し、単鎖DNAを取得する。チップサイト増幅用48697ペアプライマーと等温増幅酵素・dNTPと反応液等を深穴板に加え、深穴板を封止し、37℃で等温増幅を22-36h行う。好ましくは24h増幅した後、高温65℃で20-30minで反応液を不活性化させ、その後37℃の培養箱に移転して40min培養し、断片化酵素と反応液[41}を加え、断片化増幅産物を断片化し、既存の反応液と等体積のイソプロパノールを加え、反応液が澄むまで反応液を均一に混合させ、その後-20℃の冷蔵庫において産物を沈殿させる。24h沈殿した後、4℃、3、000gで40-60min遠心してDNA産物の沈殿物を取得し、上澄液を除去し、沈殿物を保留し、37℃で残りのイソプロパノールを完全に除去し、沈殿物を溶解させ、交雑液を取得する。交雑液は温度制御増幅装置を用いて交雑を行い、条件が95℃10min、48℃3minで、その後交雑液を持続的に48℃に維持する。チップブロックを交雑液に浸漬し、48℃の交雑炉において24h交雑させる。その後溶出、蛍光タンパク質を接続し、蛍光タンパク質を固定させ、交雑プローブ、蛍光信号をスキャンすることにより、各サイトの交雑結果を取得し、チップスキャン結果はAxiomAnalysisSuite(AxAS)ソフトウェア(米国ThermoFisher社製)を用いて解析を行う。
(2) Chip detection Following the standard process of gene chip detection, Affymetrix GeneTitan gene chip processing system is used to complete probe hybridization/staining and chip scanning. The specific operating method is as follows: add 4 μg of high-quality DNA template to a 2 ml*96 deep-well plate, add a denaturant to denature (28° C.), denature for 10 min, then denature stop solution ( (Reaction time not longer than 10 min) Stop denaturation and obtain single-stranded DNA. 48697 pair primers for chip site amplification, isothermal amplification enzyme/dNTP, reaction solution, etc. are added to the deep hole plate, the deep hole plate is sealed, and isothermal amplification is performed at 37° C. for 22 to 36 hours. Preferably, after amplification for 24 hours, the reaction mixture is inactivated at a high temperature of 65°C for 20-30 minutes, then transferred to a culture box at 37°C and cultured for 40 minutes. Fragment the amplified product, add an equal volume of isopropanol to the existing reaction mixture, mix the reaction mixture evenly until the reaction mixture is clear, and then place the product in a refrigerator at -20°C to precipitate. After 24 h of precipitation, the DNA product was precipitated by centrifugation at 3,000 g for 40-60 min at 4°C, the supernatant was removed, the precipitate was retained, and the remaining isopropanol was completely removed at 37°C. , to dissolve the precipitate and obtain the hybridization fluid. The hybrid was hybridized using a temperature-controlled amplifier under the conditions of 95°C for 10 min and 48°C for 3 min, after which the hybrid was maintained at 48°C continuously. The chip blocks are immersed in the hybridization solution and hybridized for 24 h in a hybridization oven at 48°C. After that, elution, fluorescent protein is attached, fluorescent protein is immobilized, hybridization probe and fluorescence signal are scanned to obtain the hybridization result of each site, and the chip scan result is obtained with Axiom Analysis Suite (AxAS) software (manufactured by ThermoFisher, USA). Analysis is performed using

(3)データ解析
AxASソフトウェア(米国ThermoFisher社製)を利用してチップスキャンの結果を解析し、各サンプルの遺伝子分類結果を取得する。解析結果によると、チップの平均的な分類率が98.77%で、分類効果が抜群である。そのうち、高品質なSNP比率が74.61%で、各サンプルはいずれも高品質な分類情報を生成することができる。
(3) Data Analysis The chip scan results are analyzed using AxAS software (manufactured by ThermoFisher, USA) to obtain gene classification results for each sample. According to the analysis results, the average classification rate of the chips is 98.77%, which has excellent classification effect. Among them, the high-quality SNP ratio is 74.61%, and each sample can generate high-quality classification information.

実施例3「魚チップ1号」遺伝子チップがヒラメ耐病性育種における応用
1、「魚チップ1号」遺伝子チップ分類の効果検証
参考グループから一部の個体を選定して遺伝子チップ分類の信頼性を検証するために用いられるもの、これらの選定される個体は再配列の遺伝子型でもあり、「魚チップ1号」遺伝子チップ分類を用いて取得される遺伝子型でもある。発明者の既存のヒラメ参考グループから一部の個体遺伝子チップを選定して分類・統計を行う。チップを用いて取得した遺伝子型(0/1/2はAA/Aa/aaを示す)と再配列して取得した遺伝子型の一致性および再配列とチップデータを利用して推定したGEBVの関連係数を統計することによりチップ分類の効果を評価する。分類結果の一致性が88%以上に達しかつGEBVの間の関連係数が0.9以上に達する場合、チップが優れた分類結果を有すると見なす。
Example 3 Application of “Fish Chip No. 1” Gene Chip to Disease Resistance Breeding of Flounder Used for validation, these selected individuals were both genotyped for rearrangement and genotyped using the "Fish Chip No. 1" gene chip classification. Classification and statistics are performed by selecting some individual gene chips from the inventor's existing flounder reference group. Consistency between genotypes obtained using a chip (0/1/2 indicates AA/Aa/aa) and genotypes obtained by rearrangement, and association between rearrangement and GEBV estimated using chip data Evaluate the effect of chip classification by statistical coefficients. A chip is considered to have a good classification result if the consistency of the classification result reaches 88% or more and the correlation coefficient between GEBV reaches 0.9 or more.

解析結果によると、「魚チップ1号」遺伝子チップ分類を利用して取得したサイト情報の90.08%は再配列と同様で、2組のGEBVの間の関連係数は0.958である。そのため、本発明が開発されるヒラメ遺伝子チップの分類結果は再配列と基本的に一致し、ヒラメに正確な遺伝子分類を行うことができる。 The analysis results show that 90.08% of the site information obtained using the “fish chip No. 1” gene chip classification is similar to the rearrangement, and the association coefficient between the two sets of GEBV is 0.958. Therefore, the classification result of the flounder gene chip developed by the present invention is basically consistent with the rearrangement, and accurate gene classification can be performed for flounder.

具体的な操作方法は以下のとおりである:
PLINKソフトウェアを利用してチップデータを読み取り、サーバーにおいて以下のコマンドを入力して上記データに処理を行う:
plink --vcf op2-1.vcf --make-bed --out op_Val_1
plink --vcf cs2-2.vcf --make-bed --out op_Val_2
plink --vcf op2-3.vcf --make-bed --out op_Val_3
plink --vcf op2-4.vcf --make-bed --out op_Val_4
The specific operation method is as follows:
Use PLINK software to read the chip data, and enter the following command on the server to process the above data:
plink --vcf op2-1. vcf --make-bed --out op_Val_1
plink --vcf cs2-2. vcf --make-bed --out op_Val_2
plink --vcf op2-3. vcf --make-bed --out op_Val_3
plink --vcf op2-4. vcf --make-bed --out op_Val_4

読み取りを経て、4つのvcfにおける情報は表5による:

Figure 0007158496000008
After reading, the information in the 4 vcf is according to Table 5:
Figure 0007158496000008

a)RにおいてSNPを再命名しかつ4つのファイルにおける共通となるマーカー情報を抽出し、コマンドは以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
#cs_Val_1とcs_Val_2のサイト情報を読み取る
val_1 <- fread(“op_Val_1.bim”, header = F)
val_2 <- fread(“op_Val_2.bim”, header = F)
val_3 <- fread(“op_Val_3.bim”, header = F)
val_4 <- fread(“op_Val_4.bim”, header = F)
#SNP命名方式を統一しかつ再命名したファイルを出力する
val_1$V2 <- paste(paste(rep(“rs”, nrow(val_1)), val_1$V1, sep =“ ”), val_1$V4, sep = “:”)
val_2$V2 <- paste(paste(rep(“rs”, nrow(val_2)), val_2$V1, sep = “ ”), val_2$V4, sep = “:”)
val_3$V2 <- paste(paste(rep(“rs”, nrow(val_3)), val_3$V1, sep =“ ”), val_3$V4, sep = “:”)
val_4$V2 <- paste(paste(rep(“rs”, nrow(val_4)), val_4$V1, sep =“ ”), val_4$V4, sep = “:”)
write.table(val_1, “op_Val_1.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(val_2, “op_Val_2.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(val_3, “op_Val_3.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(val_4, “op_Val_4.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
#共通となるサイト情報を抽出しかつ出力する
comm <- Reduce(intersect, list (a = val_1$V2, b = val_2$V2, c = val_3$V2, d = val_4$V2))
write.table(comm, “common_snps.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
a) Rename the SNPs in R and extract the common marker information in the four files, the commands are:
# library (data.table) to unload the required R packages
# Read site info for cs_Val_1 and cs_Val_2 val_1 <- fread("op_Val_1.bim", header = F)
val_2 <- fread("op_Val_2.bim", header = F)
val_3 <- fread("op_Val_3.bim", header = F)
val_4 <-fread("op_Val_4.bim", header = F)
# Unify the SNP naming scheme and output a renamed file val_1$V2 <- paste (paste(rep(“rs”, nrow(val_1)), val_1$V1, sep = “ ”), val_1$V4, sep = “:”)
val_2$V2 <- paste(paste(rep("rs", nrow(val_2)), val_2$V1, sep = ""), val_2$V4, sep = ":")
val_3$V2 <- paste(paste(rep("rs", nrow(val_3)), val_3$V1, sep=""), val_3$V4, sep=":")
val_4$V2 <- paste(paste(rep("rs", nrow(val_4)), val_4$V1, sep=""), val_4$V4, sep=":")
write. table(val_1, "op_Val_1.bim", sep = "\t", col.names = F, row.names = F, quote = F)
write. table(val_2, "op_Val_2.bim", sep = "\t", col.names = F, row.names = F, quote = F)
write. table(val_3, "op_Val_3.bim", sep = "\t", col.names = F, row.names = F, quote = F)
write. table(val_4, "op_Val_4.bim", sep = "\t", col.names = F, row.names = F, quote = F)
# Extract and output common site information comm <- Reduce(intersect, list (a = val_1$V2, b = val_2$V2, c = val_3$V2, d = val_4$V2))
write. table(comm, "common_snps.txt", sep = "\t", col.names = F, row.names = F, quote = F)

b)PLINKソフトウェアを利用して4つのファイルを組み合わせ、かつ共通となるマーカーを保留し、コマンドは以下のとおりである:
plink --bfile op_Val_1 --merge-list merge_op.txt --extract common_snps.txt --recode A --out op_chip
ファイル「merge_op.txt」に以下の情報が貯蔵されている:
op_Val_2.bed op_Val_2.bim op_Val_2.fam
op_Val_3.bed op_Val_3.bim op_Val_3.fam
op_Val_4.bed op_Val_4.bim op_Val_4.fam
b) Combine the 4 files using PLINK software and retain the common markers, the command is:
plink --bfile op_Val_1 --merge-list merge_op. txt--extract common_snps. txt --recode A --out op_chip
The following information is stored in the file "merge_op.txt":
op_Val_2. bed op_Val_2. bim op_Val_2. fam
op_Val_3. bed op_Val_3. bim op_Val_3. fam
op_Val_4. bed op_Val_4. bim op_Val_4. fam

c)PLINKソフトウェアを利用して参考グループから同様である個体とサイトを抽出し、上記の4つのファイルにおける.fam情報をファイルに整理しかつ「op_chip_indi.txt」と命名し、「…」はファイルカタログを表し、コマンドは以下のとおりである:
plink --bfile …/Val_ref --keep op_chip_indi.txt --extract common_snps.txt --recode A --out op_rseq
処理を経て、上記4つのファイルが共通となるマーカー数が11、719で、参考グループにおいて検索可能な個体数が95である。
c) Using PLINK software to extract similar individuals and sites from the reference group, . Organize the fam information into a file and name it "op_chip_indi.txt", where "..." represents the file catalog, the command is:
plink --bfile .../Val_ref --keep op_chip_indi. txt--extract common_snps. txt --recode A --out op_rseq
After processing, the number of markers that are common to the above four files is 11,719, and the number of individuals that can be retrieved in the reference group is 95.

d)R統計チップと再配列分類の一致性を利用し、その方法は以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
#それぞれチップと再配列が取得した分類情報を読み取る
chip <- fread(“op_chip.raw”)
rseq <- fread(“op_rseq.raw”)
#ファイルにおける個体配列を統一する
fid <- data.frame(rseq$FID)
colnames(fid) <- “FID”
chip <- data.table(merge(fid, chip, sort = F))
#ファイルにおける最初の6列を削除し、かつ遺伝子型を出力する
chip[, c(1: 6):= NULL]
rseq[, c(1: 6):= NULL]
fwrite(chip, “geno_op_chip.csv”, sep = “ ,”, row.names = F, quote = F)
fwrite(rseq, “geno_op_rseq.csv”, sep = “ ,”, row.names = F, quote = F)
#一致性を統計する
sum(chip == rseq) / (nrow(chip) * ncol(chip)) * 100
統計によると、上記95の個体は計1、113、305のマーカーを有し、完全に同様であるマーカー数が1、002、829であることがわかった。そのため、チップと再配列分類の結果は90.08%完全に一致する。
d) Using the agreement of R statistic tip and rearrangement classification, the method is as follows:
# library (data.table) to unload the required R packages
# chip <- fread("op_chip.raw") to read the classification information obtained by chip and rearrangement respectively
rseq <-fread("op_rseq.raw")
# Unify the individual arrays in the file fid <- data. frame(rseq$FID)
colnames(fid) <- “FID”
chip <- data. table(merge(fid, chip, sort=F))
# chip[, c(1:6) := NULL] to delete the first 6 columns in the file and output the genotype
rseq[,c(1:6):= NULL]
fwrite(chip, "geno_op_chip.csv", sep = ",", row.names = F, quote = F)
fwrite(rseq, "geno_op_rseq.csv", sep = ",", row.names = F, quote = F)
# Sum(chip == rseq) / (nrow(chip) * ncol(chip)) * 100 to stat match
Statistics showed that the 95 individuals had a total of 1,113,305 markers and 1,002,829 markers that were completely similar. Therefore, the results of chip and rearrangement classification are 90.08% perfect match.

e)GEBV推定の正確性を保証するために、PLINKソフトウェアを利用して参考グループにおける残りの個体の遺伝子型を抽出し、コマンドは以下のとおりである:
plink --bfile …/Val_ref --remove op_chip_indi.txt --extract common_snps.txt --recode A --out ref
e) To ensure the accuracy of the GEBV estimation, genotype the remaining individuals in the reference group using PLINK software, the command is:
plink --bfile .../Val_ref --remove op_chip_indi. txt--extract common_snps. txt --recode A --out ref

f)Rを利用して参考グループを合併するおよび個体の遺伝子型を検証し、その方法は以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
#それぞれチップと再配列が取得した分類情報を読み取る
chip <- as.matrix(fread(“geno_op_chip.csv”))
rseq <- as.matrix(fread(“geno_op_rseq.csv”))
ref <- fread(“ref.raw”)
#ファイルにおける最初の6列を削除する
ref[, c(1: 6):= NULL]
ref <- as.matrix(ref)
#遺伝子型ファイルを組み合わせかつ組み合わせた後の遺伝子型ファイルを出力する
geno_chip <- rbind(chip, ref)
geno_rseq <- rbind(rseq, ref)
write.table(geno_chip, “geno_Val_Chip.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(geno_rseq, “geno_Val_Rseq.csv”, sep = “ ,”, row.names = F, quote = F)
f) Utilizing R to merge reference groups and verify individual genotypes, the method is as follows:
# library (data.table) to unload the required R packages
# read the classification information obtained by chip and rearrange respectively chip <- as. matrix(fread(“geno_op_chip.csv”))
rseq <- as. matrix(fread(“geno_op_rseq.csv”))
ref <-fread("ref.raw")
# Remove the first 6 columns in the file ref[, c(1:6) := NULL]
ref <- as. matrix (ref)
# Combine genotype files and output genotype file after combination geno_chip <- rbind(chip, ref)
geno_rseq <-rbind(rseq, ref)
write. table(geno_chip, "geno_Val_Chip.csv", sep = ",", row.names = F, quote = F)
write. table(geno_rseq, "geno_Val_Rseq.csv", sep = ",", row.names = F, quote = F)

g)g)において取得した2つのxxx.csvファイルがRにおいて加重GBLUP方法を用いてGEBVを推定する。具体的な操作方法は以下のとおりである(Linux環境):
#必要なRパッケージと関数をアンロードする
library(parallel)
library(data.table)
library(asreml)
library(pROC)
source(“ginv.R”)
関数ginvの定義は以下のとおりである:
ginv <- function(invG) {
Ginv <- data.frame(row = rep(1: nrow(invG), nrow(invG)), column = rep(1: nrow(invG), each = nrow(invG)), value = as.numeric(invG), lower.mat = as.logical(lower.tri(invG, diag = T)))
Ginv <- Ginv[Ginv$lower.mat == T, c(“row”, “column”, “value”)]
Ginv <- Ginv[order(Ginv$row, Ginv$column), ]
return(Ginv)

#遺伝子型、表現型情報を読み取る
geno_chip <- as.matrix(fread(“geno_Val_Chip.csv”, nThread = 10))
geno_rseq <- as.matrix(fread(“geno_Val_Rseq.csv”, nThread = 10))
pheno <- asreml.read.table(“pheno_op_Val.csv”, header=T, sep=“ ,”)
rnames <- as.matrix(fread(“pheno_op_Val.csv”))[, 1]
M_1 <- geno_chip
M_2 <- geno_rseq
#各サイトの二次等位遺伝子頻度を計算する
pi_1 <- round(colSums(M_1) / (2 * nrow(M_1)), 3)
pi_2 <- round(colSums(M_2) / (2 * nrow(M_2)), 3)
#Pマトリックスを構築する
P_1 <- matrix(2 * pi_1, byrow = T, nrow = nrow(M_1), ncol = ncol(M_1))
P_2 <- matrix(2 * pi_2, byrow = T, nrow = nrow(M_2), ncol = ncol(M_2))
#Zマトリックスを構築する
Z_1 <- as.matrix(M_1 - P_1)
Z_2 <- as.matrix(M_2 - P_2)
#等式分子の項を構築する
ZZt_1 <- do.call(‘rbind’, mclapply(1: nrow(Z_1), FUN = function(x) {tcrossprod(Z_1[x, ], Zt_1)}, mc.cores = 20))
ZZt_2 <- do.call(‘rbind’, mclapply(1: nrow(Z_2), FUN = function(x) {tcrossprod(Z_2[x, ], Zt_2)}, mc.cores = 20))
#等式分母の項を構築する
denom_1 <- 2 * (sum(pi_1 * (1 - pi_1)))
denom_2 <- 2 * (sum(pi_2 * (1 - pi_2)))
#Gマトリックスを構築する
G_ chip <- ZZt_1 / denom_1
G_ rseq <- ZZt_2 / denom_2
diag(G_chip) <- diag(G_chip) + 0.01
diag(G_rseq) <- diag(G_rseq) + 0.01
#Gマトリックスインバージョン
invG_chip0 <- solve(G_chip)
invG_rseq0 <- solve(G_rseq)
#GマトリックスインバージョンをASRemlが利用可能な三列の形式に変換し、かつ出力する
Ginv_chip <- ginv(invG_chip)
Ginv_rseq <- ginv(invG_rseq)
write.table(Ginv_chip, “Ginv_op_Val_Chip_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(Ginv_rseq, “Ginv_op_Val_Rseq_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
#GEBVを計算する
attr(Ginv_chip, “rowNames”) <- paste(rnames)
attr(Ginv_rseq, “rowNames”) <- paste(rnames)
WGBLUP1 <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv_chip),
rcov = ~units,
family = asreml.binomial(link = “logit”)
data = pheno,
maxiter = 100,
na.method.X = ‘omit’)
gebv01 <- coef(WGBLUP1)$random
write.table(gebv01, “GEBVs_at_iter_0_chip.csv”, sep = “ ,”, row.names = F, quote = F)

WGBLUP2 <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv_rseq),
rcov = ~units,
family = asreml.binomial(link = “logit”),
data = pheno,
maxiter = 100,
na.method.X = ‘omit’)
gebv02 <- coef(WGBLUP2)$random
write.table(gebv02, “GEBVs_at_iter_0_rseq.csv”, sep = “ ,”, row.names = F, quote = F)

#GBLUPプロセス・チップ部分を加重し、forサイクルを用い、6回の加重反復過程を実行する
#マーカーの効果値を推定する
Zt1 <- t(Z_1)
p1 <- do.call(‘rbind’, mclapply(1: nrow(Zt1), FUN = function(x) {Zt1[x, ] %*% invG01}, mc.cores = 5))
p2 <- p1 %*% gebv01
u01 <- p2 / pp_1
Varu01 <- u01 * u01 * 2 * pi_1 * (1 - pi_1)
write.table(u01, “ markerEff_chip_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(Varu01, “markerVar_chip_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
#各マーカーの重みを推定する
D <- rep(1, ncol(M_1))
D0 <- rep(1, ncol(M_1))
pre_D <- D0
u <- u01
cal <- 6
inter <- 1
for(t in 1: cal) {
d <- as.vector(u * u * 2 * pi_1 * (1 - pi_1))
for(i in 1: (length(d) / inter)) {
di <- mean(d[(inter * (i - 1) + 1): (inter * i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di

D <- sum(D0) / sum(pre_D) * pre_D
write.table(D, file = paste(“weights_chip_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
#加重Gマトリックスを推定する
p1 <- do.call(‘cbind’, mclapply(1: ncol(Z_1), FUN = function(x) {Z_1[, x] * D[x]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt1}, mc.cores = 5))
G <- p2 / pp_1
write.table(G, file = paste(“G_chip_at_iter_”, t, “.csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
diag(G) <- diag(G) + 0.01
invG <- solve(G)
write.table(invG, file = paste(“invG_chip_at_iter_”, t, “.csv”, sep =“ ”), sep=“,”, row.names=F, quote = F)
Ginv <- ginv(invG)
write.table(Ginv, file = paste(“Ginv_chip_at_iter_”, t, “.csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
attr(Ginv,“rowNames”) <- paste(rnames)
WGBLUP <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml.binomial(link = “logit”),
data = pheno,
maxiter = 100,
na.method.X =‘omit’)
gebv <- coef(WGBLUP)$random
write.table(gebv, file = paste(“GEBVs_chip_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
#加重後マーカーの効果値を推定する
p1 <- do.call(‘rbind’, mclapply(1: nrow(Zt1), FUN = function(x) {D[x] * Zt1[x, ]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc.cores = 5))
p3 <- do.call(‘rbind’, mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv01}, mc.cores = 5))
u <- p3 / pp_1
Varu <- u * u * 2 * pi_1 * (1 - pi_1)
write.table(u, file = paste(“markerEff_chip_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
write.table(Varu, file = paste(“ markerVar_chip_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)

#GBLUPプロセス・再配列部分を加重し、forサイクルを用い、6回の加重反復過程を実行する
Zt2 <- t(Z_2)
p1 <- do.call(‘rbind’, mclapply(1: nrow(Zt2), FUN = function(x) {Zt2[x, ] %*% invG02}, mc.cores = 5))
p2 <- p1 %*% gebv02
u02 <- p2 / pp_2
Varu02 <- u02 * u02 * 2 * pi_2 * (1 - pi_2)
write.table(u02, “ markerEff_rseq_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(Varu02, “ markerVar_rseq_at_iter_0.csv”, sep = “ ,”, row.names = F, quote = F)
#各マーカーの重みを推定する
D <- rep(1, ncol(M_2))
D0 <- rep(1, ncol(M_2))
pre_D <- D0
u <- u02
cal <- 6
inter <- 1
for(t in 1: cal) {
d <- as.vector(u * u * 2 * pi_2 * (1 - pi_2))
for(i in 1: (length(d) / inter)) {
di <- mean(d[(inter * (i - 1) + 1): (inter * i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di

D <- sum(D0) / sum(pre_D) * pre_D
write.table(D, file = paste(“ weights_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
#加重Gマトリックスを推定する
p1 <- do.call(‘cbind’, mclapply(1: ncol(Z_2), FUN = function(x) {Z_2[, x] * D[x]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt2}, mc.cores = 5))
G <- p2 / pp_2
write.table(G, file = paste(“G_rseq_at_iter_”, t, “.csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
diag(G) <- diag(G) + 0.01
invG <- solve(G)
write.table(invG, file = paste(“invG_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
Ginv <- ginv(invG)
write.table(Ginv, file = paste(“Ginv_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep=“ ,”, row.names=F, quote = F)
attr(Ginv,“ rowNames”) <- paste(rnames)
WGBLUP <- asreml(status ~ 1,
random = ~giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml.binomial(link = ”logit”),
data = pheno,
maxiter = 100,
na.method.X = ‘omit’)
gebv <- coef(WGBLUP)$random
write.table(gebv, file = paste(“ GEBVs_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
#加重後マーカーの効果値を推定する
p1 <- do.call(’rbind’, mclapply(1: nrow(Zt2), FUN = function(x) {D[x] * Zt2[x, ]}, mc.cores = 5))
p2 <- do.call(‘rbind’, mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc.cores = 5))
p3 <- do.call(‘rbind’, mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv02}, mc.cores = 5))
u <- p3 / pp_2
Varu <- u * u * 2 * pi_2 * (1 - pi_2)
write.table(u, file = paste(“markerEff_rseq_at_iter_”, t, “.csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)
write.table(Varu, file = paste(“ markerVar_rseq_at_iter_”, t, “ .csv”, sep =“ ”), sep = “ ,”, row.names = F, quote = F)

計算を経て、第4回反復した時に加重GBLUP方法は安定に近づき、そのためこの時の反復結果について後続の研究を実施する。検証に用いられる95の個体GEBVの間における関連係数は0.958で、これらの個体はGEBVは表6による:
g) the two xxx. A csv file estimates the GEBV using the weighted GBLUP method in R. The specific operation method is as follows (Linux environment):
# library (parallel) to unload the required R packages and functions
library(data.table)
library (asreml)
library (pROC)
source("ginv.R")
The definition of the function ginv is:
ginv <- function(invG) {
Ginv <- data. frame(row = rep(1: nrow(invG), nrow(invG)), column = rep(1: nrow(invG), each = nrow(invG)), value = as.numeric(invG), lower.mat = as.logical(lower.tri(invG,diag=T)))
Ginv <- Ginv [Ginv$lower. mat == T, c(“row”, “column”, “value”)]
Ginv <- Ginv [order(Ginv$row, Ginv$column), ]
return (Ginv)
}
# geno_chip to read genotype and phenotype information <- as. matrix(fread("geno_Val_Chip.csv", nThread=10))
geno_rseq <- as. matrix(fread("geno_Val_Rseq.csv", nThread=10))
pheno <- asreml. read. table("pheno_op_Val.csv", header=T, sep=",")
rnames <- as. matrix(fread(“pheno_op_Val.csv”)) [, 1]
M_1 <- geno_chip
M_2 <- geno_rseq
# Calculate the secondary coordinate gene frequency of each site pi_1 <- round(colSums(M_1)/(2*nrow(M_1)), 3)
pi_2 <- round(colSums(M_2)/(2*nrow(M_2)), 3)
# Construct a P matrix P_1 <- matrix(2*pi_1, byrow = T, nrow = nrow(M_1), ncol = ncol(M_1))
P_2<−matrix(2*pi_2, byrow=T, nrow=nrow(M_2), ncol=ncol(M_2))
# Construct Z matrix Z_1 <- as. matrix(M_1 - P_1)
Z_2 <- as. matrix(M_2-P_2)
# Build the terms of the equation numerator ZZt_1 <- do. call('rbind', mclapply(1: nrow(Z_1), FUN = function(x) {tcrossprod(Z_1[x, ], Zt_1)}, mc.cores = 20))
ZZt_2 <- do. call('rbind', mclapply(1: nrow(Z_2), FUN = function(x) {tcrossprod(Z_2[x, ], Zt_2)}, mc. cores = 20))
# Build the terms of the equation denominator denom_1 <- 2 * (sum(pi_1 * (1 - pi_1)))
denom_2<−2*(sum(pi_2*(1−pi_2)))
# Build the G matrix G_chip <- ZZt_1/denom_1
G_ rseq <- ZZt_2 / denom_2
diag(G_chip) <− diag(G_chip) + 0.01
diag(G_rseq) <− diag(G_rseq) + 0.01
# G matrix inversion invG_chip0 <- solve(G_chip)
invG_rseq0 <- solve(G_rseq)
# Ginv_chip <- ginv(invG_chip) which converts and outputs the G matrix inversion to a three-column format that ASReml can use
Ginv_rseq <- ginv(invG_rseq)
write. table(Ginv_chip, "Ginv_op_Val_Chip_at_iter_0.csv", sep = ",", row.names = F, quote = F)
write. table(Ginv_rseq, "Ginv_op_Val_Rseq_at_iter_0.csv", sep = ",", row.names = F, quote = F)
# Calculate GEBV attr(Ginv_chip, "rowNames") <- paste(rnames)
attr(Ginv_rseq, "rowNames") <- paste(rnames)
WGBLUP1 <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv_chip),
rcov = ~units,
family = asreml. binomial (link = “logit”)
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv01 <- coef(WGBLUP1)$random
write. table(gebv01, "GEBVs_at_iter_0_chip.csv", sep = ",", row.names = F, quote = F)

WGBLUP2 <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv_rseq),
rcov = ~units,
family = asreml. binomial(link = “logit”),
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv02 <- coef(WGBLUP2)$random
write. table(gebv02, "GEBVs_at_iter_0_rseq.csv", sep = ",", row.names = F, quote = F)

# Weight the GBLUP process chip part and use the for cycle and run 6 weighted iterations # Estimate the effect value of the marker Zt1 <- t(Z_1)
p1 <- do. call('rbind', mclapply(1: nrow(Zt1), FUN = function(x) {Zt1[x, ] %*% invG01}, mc. cores = 5))
p2 <- p1 %*% gebv01
u01 <- p2/pp_1
Varu01<-u01*u01*2*pi_1*(1-pi_1)
write. table(u01, "markerEff_chip_at_iter_0.csv", sep = ",", row.names = F, quote = F)
write. table(Varu01, "markerVar_chip_at_iter_0.csv", sep = ",", row.names = F, quote = F)
# Estimate the weight of each marker D <- rep(1, ncol(M_1))
D0 <- rep(1, ncol(M_1))
pre_D <- D0
u <-u01
cal <-6
inter <- 1
for(t in 1:cal) {
d <-as. vector(u*u*2*pi_1*(1-pi_1))
for(i in 1: (length(d)/inter)) {
di <−mean(d[(inter*(i−1)+1):(inter*i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di
}
D<−sum(D0)/sum(pre_D)*pre_D
write. table(D, file = paste("weights_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the weighted G matrix p1 <- do. call('cbind', mclapply(1: ncol(Z_1), FUN = function(x) {Z_1[, x]*D[x]}, mc. cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt1}, mc. cores = 5))
G<-p2/pp_1
write. table(G, file = paste("G_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
diag(G) <− diag(G) + 0.01
invG <- solve(G)
write. table(invG, file = paste("invG_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
Ginv<-ginv(invG)
write. table(Ginv, file = paste("Ginv_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
attr(Ginv, "rowNames") <- paste(rnames)
WGBLUP <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml. binomial(link = “logit”),
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv <- coef(WGBLUP)$random
write. table(gebv, file = paste("GEBVs_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the effect value of the weighted marker p1 <- do. call('rbind', mclapply(1: nrow(Zt1), FUN = function(x) {D[x] * Zt1[x, ]}, mc. cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc. cores = 5))
p3 <- do. call('rbind', mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv01}, mc. cores = 5))
u<−p3/pp_1
Varu<−u*u*2*pi_1*(1−pi_1)
write. table(u, file = paste("markerEff_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
write. table(Varu, file = paste("markerVar_chip_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
}
# GBLUP process - weight the rearrangement part, use the for cycle and perform 6 weighted iterations Zt2 <- t(Z_2)
p1 <- do. call('rbind', mclapply(1: nrow(Zt2), FUN = function(x) {Zt2[x, ] %*% invG02}, mc. cores = 5))
p2 <- p1 %*% gebv02
u02 <- p2/pp_2
Varu02<-u02*u02*2*pi_2*(1-pi_2)
write. table(u02, "markerEff_rseq_at_iter_0.csv", sep = ",", row.names = F, quote = F)
write. table(Varu02, "markerVar_rseq_at_iter_0.csv", sep = ",", row.names = F, quote = F)
# Estimate the weight of each marker D <- rep(1, ncol(M_2))
D0 <- rep(1, ncol(M_2))
pre_D <- D0
u <-u02
cal <-6
inter <- 1
for(t in 1:cal) {
d <-as. vector(u*u*2*pi_2*(1-pi_2))
for(i in 1: (length(d)/inter)) {
di <−mean(d[(inter*(i−1)+1):(inter*i)])
pre_D[(inter * (i - 1) + 1): (inter * i)] <- di
}
D<−sum(D0)/sum(pre_D)*pre_D
write. table(D, file = paste("weights_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the weighted G matrix p1 <- do. call('cbind', mclapply(1: ncol(Z_2), FUN = function(x) {Z_2[, x]*D[x]}, mc.cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% Zt2}, mc. cores = 5))
G<-p2/pp_2
write. table(G, file = paste("G_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
diag(G) <− diag(G) + 0.01
invG <- solve(G)
write. table(invG, file = paste("invG_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
Ginv<-ginv(invG)
write. table(Ginv, file = paste("Ginv_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
attr(Ginv, "rowNames") <- paste(rnames)
WGBLUP <-asreml(status ~ 1,
random = ~ giv(IID),
ginverse = list(IID = Ginv),
rcov = ~units,
family = asreml. binomial (link = "logit"),
data = pheno,
maxiter = 100,
na. method. X = 'omit')
gebv <- coef(WGBLUP)$random
write. table(gebv, file = paste("GEBVs_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
# Estimate the effect value of the weighted marker p1 <- do. call('rbind', mclapply(1: nrow(Zt2), FUN = function(x) {D[x]*Zt2[x, ]}, mc. cores = 5))
p2 <- do. call('rbind', mclapply(1: nrow(p1), FUN = function(x) {p1[x, ] %*% invG}, mc. cores = 5))
p3 <- do. call('rbind', mclapply(1: nrow(p2), FUN = function(x) {p2[x, ] %*% gebv02}, mc. cores = 5))
u<−p3/pp_2
Varu<−u*u*2*pi_2*(1−pi_2)
write. table(u, file = paste("markerEff_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
write. table(Varu, file = paste("markerVar_rseq_at_iter_", t, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
}
Through computation, the weighted GBLUP method approaches stability at the fourth iteration, so subsequent studies are performed on the iteration results at this time. The association coefficient among the 95 individuals GEBV used for validation was 0.958, and these individuals had GEBV according to Table 6:

Figure 0007158496000009
Figure 0007158496000010
Figure 0007158496000011
Figure 0007158496000009
Figure 0007158496000010
Figure 0007158496000011

2、「魚チップ1号」遺伝子チップサイトが参考グループにおける検証
発明者の既存の参考グループから「魚チップ1号」遺伝子チップの設計サイトを抽出し、これらのサイト情報を利用して加重GBLUPを実施し、かつ5倍の交差検証方法のランダムな組分けを用いて加重GBLUP予測正確性の評価方法として、被検測者の操作特徴曲線下面積(AUC)を加重GBLUPの評価方法で正確性の指標とする。解析モデルは一般化線形混合モデルを用いる。組分けのランダム誤差を減らすために、データセットに10回の組分けを行い、各組は5回計算する。そのため、延べ50回計算し、50回のAUCの平均値を最終的な評価結果とする。
2. "Fish chip No. 1" gene chip site is verified in the reference group The design site of "fish chip No. 1" gene chip is extracted from the existing reference group of the inventor, and weighted GBLUP is performed using these site information. and using a random grouping of a 5-fold cross-validation method as an assessment method for weighted GBLUP predictive accuracy, the subject's area under the operating characteristic curve (AUC) was measured as a weighted GBLUP assessment method for accuracy. as an indicator of The analytical model uses a generalized linear mixed model. To reduce the random error of grouping, the data set is grouped 10 times and each group is calculated 5 times. Therefore, a total of 50 calculations are performed, and the average value of the 50 AUCs is used as the final evaluation result.

解析結果によると、ヒラメ参考グループにおいてSNPチップと同様であるマーカーを用いてゲノム選択を実施し、AUC(正確性)値が0.885で、従来のBLUP方法によるAUC(0.579)値より高い。そのため、発明者が設計されるチップサイトを用いてはゲノム選択を順調かつ高効率に実施することができる。 According to the analysis results, the genome selection was performed using the same markers as the SNP chip in the flounder reference group, and the AUC (accuracy) value was 0.885, which was higher than the AUC (0.579) value by the conventional BLUP method. high. Therefore, using the chip site designed by the inventor, genome selection can be carried out smoothly and highly efficiently.

具体的な操作方法は以下のとおりである:
fcGENE、BEAGLEとPLINKソフトウェアを利用してヒラメ参考グループから抽出されるチップ設計サイト:欠損サイトを充填しかつ遺伝子型ファイルを出力し、コマンドは以下のとおりである:
fcgene --ped geno_op_Rseq_chip.ped --map geno_op_Rseq_chip.map --oformat beagle --out plink2beagle

java -Xmx5120m -jar beagle.jar unphased=plink2beagle.bgl missing=0 niterations=20 gprobs=true out=imputed_geno

gunzip imputed_geno.plink2beagle.bgl.phased.gz

fcgene --bgl imputed_geno.plink2beagle.bgl.phased --pedinfo plink2beagle_pedinfo.txt --snpinfo plink2beagle_snpinfo.txt --oformat plink --out beagle2plink

plink --file beagle2plink --recode A --out genotype_op_chip_Rseq
さらにRに以下のコマンドを入力する:
library(data.table)
geno <- fread(“genotype_op_chip_Rseq.raw”)
geno[, c(1: 6):= NULL]
fwrite(geno, “genotype_op_chip_Rseq.csv”, sep = “ ,”, row.names = F, quote = F)
The specific operation method is as follows:
Chip design sites extracted from the flounder reference group using fcGENE, BEAGLE and PLINK software: fill in the missing sites and output the genotype file, the commands are as follows:
fcgene --ped geno_op_Rseq_chip. ped --map geno_op_Rseq_chip. map --format beagle --out plink2beagle

java -Xmx5120m -jar beagle. jar unphased=plink2beagle. bgl missing=0 niterations=20 gprobs=true out=imputed_geno

gunzip imputed_geno. plink2beagle. bgl. phased. gz

fcgene--bgl imputed_geno. plink2beagle. bgl. phased --pedinfo plink2beagle_pedinfo. txt --snpinfo plink2beagle_snpinfo. txt --format plink --out beagle2plink

plink --file beagle2plink --record A --out genotype_op_chip_Rseq
Then enter the following command in R:
library(data.table)
geno <-fread("genotype_op_chip_Rseq.raw")
geno[, c(1:6) := NULL]
fwrite(geno, "genotype_op_chip_Rseq.csv", sep = ",", row.names = F, quote = F)

a)Rにおいてa)で取得した遺伝子型ファイルを利用してRにおいて加重GBLUPを実施する。加重GBLUPの具体的な方法は1)におけるh)部分を参照して実施する。 a) Perform a weighted GBLUP in R using the genotype file obtained in a). A specific method of weighted GBLUP is performed with reference to part h) in 1).

b)構築済みの加重GマトリックスをASReml-Rに代入して交差検証方法を行う。交差検証方法前に組分けを実施する必要がある:Rにおいて関数sample(1:931、931)を用いてすべての個体にランダムソートを行い、さらにソート後のデジタルを5列に分け、各列に含まれる要素個数はそれぞれ186・186・186・186と187である。上記の過程を10回繰り返し、延べ10のファイルを取得する。この10のファイルを同一のフォルダに入れて使用に備える。解析は一般化線形混合モデルを用い、異なる検測ロットと個体齢を固定効果として、各個体はランダム効果として適合を実施する。5倍の交差検証方法の具体的な実施方法は以下のとおりである:
#必要なRパッケージと関数をアンロードする
library(parallel)
library(asreml)
library(pROC)
#表現型、Gマトリックスインバージョンの三列形式を読み取る
pheno <- asreml.read.table(“phenotype_931.csv”, header=T, sep=“ ,”, na.string = NA)
ped <- asreml.read.table(“pedigree_931.csv”, header=T, sep=“ ,”, na.string = NA)
ainv <- asreml.Ainverse(ped)$ginv
Ginv <- fread(“…/ Ginv_at_iter_4.csv ”)
attr(Ginv,“rowNames”) <- paste(pheno[, 1])
#外部サイクルの回数を設定する
N <- 10
#結果変数を設定する
res <- matrix(NA, nrow = 5 * N, ncol = 1)
colnames(res) <- c(”auc”)
#交差検証方法を行いかつ検証結果を出力する
for (i in 1: N) {
coor <- read.table(file = paste(“./coor/coor_”, i, “.csv”, sep =“ ”), sep = “ ,”, header = F)
cyc <- ncol(coor)
gebv <- matrix(NA, nrow = nrow(pheno), ncol = cyc)
for (j in 1: cyc) {
y <- pheno
y$status[coor[(1: sum(coor[, j] > 0, na.rm = T)), j]] <- NA
CV <- asreml(status ~ Batch + Age,
random = ~giv(AnimalID),
ginverse = list(AnimalID = Ginv),
rcov = ~units,
family = asreml.binomial(link = “logit”),
data = y,
maxiter = 50)

gebv[, j] <- coef(CV)$random
write.table(gebv, file = paste(“GEBVs_chip_ref_coor_”, i, “ .csv”, sep =“ ”), sep =“ ,”, row.names = F, quote = F)
res[(j + (i - 1) * cyc), ] <- roc(as.vector(pheno$status[coor[(1: sum(coor[, j] > 0, na.rm = T)), j]]), as.vector(gebv[coor[(1: sum(coor[, j] > 0, na.rm = T)), j], j]))$auc
write.table(res, “results_of_roc_op_chip_ref.csv”, sep = “ ,”, row.names = F, quote = F)


#従来のBLUPの推定方法は加重GBLUPと同様で、モデルにおいてainvを用いてGinvを代替すればよい
上記のコードを実行した後、加重GBLUP方法によるAUC平均値は0.885で、従来のBLUP方法によるAUC平均値は0.579である。50回の交差検証方法の結果は表7による:
b) Substitute the pre-constructed weighted G matrix into ASReml-R to perform the cross-validation method. Before the cross-validation method, it is necessary to perform grouping: in R, use the function sample(1:931, 931) to perform random sorting on all individuals, and further divide the digital after sorting into 5 columns, each column are 186, 186, 186, 186 and 187, respectively. The above process is repeated 10 times to obtain a total of 10 files. Place these 10 files in the same folder and prepare for use. The analysis uses a generalized linear mixed model, with different test lots and individual ages as fixed effects and each individual as a random effect for fitting. A specific implementation of the 5-fold cross-validation method is as follows:
# library (parallel) to unload the required R packages and functions
library (asreml)
library (pROC)
# Phenotype, pheno <- asreml. read. table("phenotype_931.csv", header=T, sep=",", na.string=NA)
ped <- asreml. read. table("pedigree_931.csv", header=T, sep=",", na.string=NA)
ainv <- asreml. Ainverse (ped) $ginv
Ginv<-fread(".../Ginv_at_iter_4.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
# Set the number of external cycles N <- 10
# Set result variables res <- matrix(NA, nrow = 5*N, ncol = 1)
colnames(res) <- c("auc")
# Perform cross-validation method and output validation result for (i in 1: N) {
coor <- read. table(file = paste("./coor/coor_", i, ".csv", sep = ""), sep = ",", header = F)
cyc <- ncol(coor)
gebv <- matrix(NA, nrow = nrow(pheno), ncol = cyc)
for (j in 1: cyc) {
y <- pheno
y$status[coor[(1: sum(coor[, j] > 0, na.rm = T)), j]] <- NA
CV<−asreml(status ~ Batch + Age,
random = ~ giv(AnimalID),
ginverse = list(AnimalID = Ginv),
rcov = ~units,
family = asreml. binomial(link = “logit”),
data = y,
maxiter = 50)

gebv[,j] <- coef(CV)$random
write. table(gebv, file = paste("GEBVs_chip_ref_coor_", i, ".csv", sep = ""), sep = ",", row.names = F, quote = F)
res[(j+(i-1)*cyc), ] <- roc(as.vector(pheno$status[coor[(1:sum(coor[, j]>0, na.rm=T)), j]]), as.vector(gebv[coor[(1: sum(coor[, j] > 0, na.rm = T)), j], j])) $auc
write. table(res, "results_of_roc_op_chip_ref.csv", sep = ",", row.names = F, quote = F)
}
}
# The conventional BLUP estimation method is similar to the weighted GBLUP, and we can use ainv to replace Ginv in the model The mean AUC by method is 0.579. The results of the 50-fold cross-validation method are according to Table 7:

Figure 0007158496000012
Figure 0007158496000013
Figure 0007158496000012
Figure 0007158496000013

3、「魚チップ1号」遺伝子チップがヒラメ耐病性育種における応用
候補個体のゲノム推定育種価(GEBV)を推定するために、まず候補個体遺伝子型(「魚チップ1号」遺伝子チップ分類から取得される)を発明者の既存の参考グループ遺伝子型と組み合わせ、さらにRを用いて加重Gマトリックスを構築し、最後に用意済みの加重Gマトリックスと表現型データをASReml-Rに代入し、加重GBLUP方法を用いて各家系成体のGEBVを推定し、さらに成体GEBVの平均値を相応の家系のGEBVとする。各家系を感染生存率に従って高生存率家系(生存率が55%より高い)と低生存率家系(生存率が55%より低い)、かつ各家系GEBVと感染生存率の間のAUC値を計算し、さらに該AUC値を2で取得したAUC値と比較し、近いひいては2で取得したAUCより高い場合、発明者が設計される遺伝子チップがゲノム選択技術の要件を満たしかつヒラメ耐病性選別において優れた応用効果を有することを示す。各家系GEBVと感染生存率の間のAUC値を計算しかつ該AUC値を2で取得したAUC値と比較し、それにより発明者が設計される遺伝子チップとゲノム選択技術がヒラメ耐病性選別における実際の応用効果を検証するために用いられるもの。AUC値を推定する前、子の代の感染生存率を

Figure 0007158496000014
で変換を実施する。変換後、平均値より高い家系の生存率を1と記載し、平均値より低いものを0と記載する。遺伝子チップ方法がマーカーに対する要件を満たし、すべての候補個体の分類結果に組み合わせを行い、各チップの分類結果を単独で参考グループ遺伝子型と合併する必要がある。 3. Application of "fish chip No. 1" gene chip in flounder disease resistance breeding ) is combined with the inventor's existing reference group genotypes, and R is used to construct a weighted G matrix, and finally the prepared weighted G matrix and phenotypic data are substituted into ASReml-R, weighted GBLUP The method is used to estimate the GEBV of each pedigree adult, and the mean of the adult GEBV is taken as the GEBV of the corresponding pedigree. Each family was classified according to infection survival rate, high survival rate families (>55% survival rate) and low survival rate families (less than 55% survival rate), and AUC values between each family's GEBV and infection survival rate. and further compare the AUC value with the AUC value obtained in 2, and if it is close or even higher than the AUC obtained in 2, the gene chip designed by the inventor meets the requirements of genome selection technology and flounder disease resistance It shows that it has excellent application effect in sorting. Calculate the AUC value between each pedigree GEBV and the infection survival rate and compare the AUC value with the AUC value obtained in 2, whereby the gene chip and genome selection technology designed by the inventors is used for flounder disease resistance screening. It is used to verify the actual application effect in Before estimating the AUC value, the infection survival rate of offspring was
Figure 0007158496000014
to perform the conversion. After transformation, family survival rates above the mean are described as 1 and those below the mean as 0. The gene chip method should meet the requirements for markers, combine the classification results of all candidate individuals, and merge the classification results of each chip alone with the reference group genotype.

解析結果によると16の候補群の子の代家系における6の高生存率家系の平均生存率が62.4%であるおよび10の低生存率家系の平均的な生存率が33.47%(表8)であることがわかる。そのうち、高生存率家系成体の平均的なGEBVが2.10で、低生存率家系成体の平均的なGEBVが1.34である。計算はよると、これらのヒラメ家系のGEBV値を利用してはその感染生存率が予測される正確性が0.794に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。 The results of the analysis showed that in the 16 candidate groups of progeny of offspring, 6 high survival families had a mean survival rate of 62.4% and 10 low survival families had a mean survival rate of 33.47% ( Table 8). Of which, the average GEBV for adults in high survival families is 2.10, and the average GEBV for adults in low survival families is 1.34. Calculations show that using the GEBV values of these flounder families, the accuracy of predicting their infection survival rate can reach 0.794, which is close to the theoretical value. Therefore, the gene chip designed by the inventors can be effectively applied to the screening of flounder for disease-resistant traits.

具体的な操作方法は以下のとおりである:
PLINKとRを利用して候補個体分類ファイルを利用し、さらに候補個体から後続検証に用いられる個体を選定し、これらの個体情報を一つのテキストファイルに貯蔵し、家系番号・個体番号・父本番号・母本番号・性別と表現型値の配列に従ってテキストファイルを用意し、各行に一つの個体で、各個体の各項目情報はtable区切り記号を用いて区切りを実施する。発明を実施するための形態:
#各SNPチップの分類結果を読み取り、結果はxxx.bed、xxx.bimとxxx.famに貯蔵される。
plink --vcf op1.vcf --make-bed --out op1
plink --vcf op2.vcf --make-bed --out op2
plink --vcf op3.vcf --make-bed --out op3

Rにおいて各ファイルにおけるSNPの命名方法を変換し、具体的な操作方法は以下のとおりである:
#Rパッケージをアンロードする
library(data.table)
#データを読み取る
op1 <- fread(“op1.bim”, header = F)
op2 <- fread(“op2.bim”, header = F)
op3 <- fread(“op3.bim”, header = F)
#SNP名称を変換する
op1$V2 <- paste(paste(rep(“rs”, nrow(op1)), op1$V1, sep =“ ”), op1$V4, sep = “:”)
op2$V2 <- paste(paste(rep(“rs”, nrow(op2)), op2$V1, sep =“ ”), op2$V4, sep = “:”)
op3$V2 <- paste(paste(rep(“rs”, nrow(op3)), op3$V1, sep =“ ”), op3$V4, sep = “:”)
#各チップが取得した分類情報を出力する
write.table(op1$V2, “snps_op1.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(op2$V2, “snps_op2.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
write.table(op3$V2, “snps_op3.txt”, sep = “¥t”, col.names = F, row.names = F, quote = F)
#整理後のbimファイルを出力する
fwrite(op1, “op1.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
fwrite(op2, “op2.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
fwrite(op3, “op3.bim”, sep = “¥t”, col.names = F, row.names = F, quote = F)
#Rを終了する

検証個体情報を用意し、かつタイトルが「selected_indi.txt」のファイルに入れ、ファイルは以下の方式に従って整理する:
1701.CEL 1701.CEL 0 0 0 -9
1702.CEL 1702.CEL 0 0 0 -9
#PLINKソフトウェアを利用して各SNPチップに含まれる個体を抽出し、その方式は以下のとおりである:
plink --bfile op1 --keep selected_indi.txt --make-bed --out op_can_1
plink --bfile op2 --keep selected_indi.txt --make-bed --out op_can_2
plink --bfile op3 --keep selected_indi.txt --make-bed --out op_can_3

#検証に用いられる個体の遺伝子型ファイルを出力する
plink --bfile op_can_1 --recode A --out op1
plink --bfile op_can_2 --recode A --out op2
plink --bfile op_can_3 --recode A --out op3
さらにRにおいて以下のコマンドを用いて遺伝子型ファイルを出力する:
library(data.table)
op1 <- fread(“op1.raw”)
op2 <- fread(“op2.raw”)
op3 <- fread(“op3.raw”)
op1[, c(1:6):= NULL]
op2[, c(1:6):= NULL]
op3[, c(1:6):= NULL]
fwrite(op1, “geno_op1_can.csv”, sep = “ ,”, row.names = F, quote = F)
fwrite(op2, “geno_op2_can.csv”, sep = “ ,”, row.names = F, quote = F)
fwrite(op3, “geno_op3_can.csv”, sep = “ ,”, row.names = F, quote = F)
a)PLINKとRを利用して参考グループ遺伝子型をそれぞれ各チップ候補個体遺伝子型と組み合わせを実施し、具体的な操作方法は以下のとおりである:
plink --bfile …/op_Ref --extract snps_op1.txt --recode A --out op_Ref_op1
plink --bfile …/op_Ref --extract snps_op2.txt --recode A --out op_Ref_op2
plink --bfile …/op_Ref --extract snps_op3.txt --recode A --out op_Ref_op3
さらにRにおいて以下のコマンドを用い、遺伝子型ファイルを処理し、出力する:
library(data.table)
#参考グループ遺伝子型を読み取りかつそれに処理を実施する
op1_ref <- fread(“op_Ref_op1.raw”)
op2_ref <- fread(“op_Ref_op2.raw”)
op3_ref <- fread(“op_Ref_op3.raw”)
op1_ref[, c(1: 6):= NULL]
op2_ref[, c(1: 6):= NULL]
op3_ref[, c(1: 6):= NULL]
op1_ref <- as.matrix(op1_ref)
op2_ref <- as.matrix(op2_ref)
op3_ref <- as.matrix(op3_ref)
#候補個体遺伝子型を処理する:読み取り、組み合わせ
op1_can <- as.matrix(fread(“geno_op1_can.csv”))
op2_can <- as.matrix(fread(“geno_op2_can.csv”))
op3_can <- as.matrix(fread(“geno_op3_can.csv”))
geno_op1 <- rbind(op1_can, op1_ref)
geno_op2 <- rbind(op2_can, op2_ref)
geno_op3 <- rbind(op3_can, op3_ref)
write.table(geno_op1, “geno_op1.csv”, sep = “ ,”, row.names = F, quote = F)
write.table(geno_op2, “geno_op2.csv”, sep =“ ,”, row.names = F, quote = F)
write.table(geno_op3, “geno_op3.csv”, sep = “ ,”, row.names = F, quote = F)
The specific operation method is as follows:
PLINK and R are used to use the candidate individual classification file, select the individual to be used for subsequent verification from the candidate individual, store this individual information in a single text file, and store the family number, individual number, and father book. A text file is prepared according to the sequence of number, parental book number, sex and phenotype value, one individual per line, and each item information of each individual is separated using a table delimiter. MODES FOR CARRYING OUT THE INVENTION:
# Read the classification result of each SNP chip, the result is xxx. bed, xxx. bim and xxx. Stored in fam.
plink --vcf op1. vcf --make-bed --out op1
plink --vcf op2. vcf --make-bed --out op2
plink --vcf op3. vcf --make-bed --out op3

Convert the SNP naming method in each file in R, the specific operation method is as follows:
# unload R package library(data.table)
# read data op1 <- fread("op1.bim", header = F)
op2 <-fread("op2.bim", header = F)
op3 <-fread("op3.bim", header = F)
# Convert SNP names op1$V2 <- paste (paste(rep(“rs”, nrow(op1)), op1$V1, sep = “ ”), op1$V4, sep = “:”)
op2$V2<-paste(paste(rep("rs", nrow(op2)), op2$V1, sep=""), op2$V4, sep=":")
op3$V2<-paste(paste(rep("rs", nrow(op3)), op3$V1, sep=""), op3$V4, sep=":")
# output the classification information acquired by each chip write. table (op1$V2, "snps_op1.txt", sep = "\t", col.names = F, row.names = F, quote = F)
write. table (op2$V2, "snps_op2.txt", sep = "\t", col.names = F, row.names = F, quote = F)
write. table (op3$V2, "snps_op3.txt", sep = "\t", col.names = F, row.names = F, quote = F)
# fwrite (op1, “op1.bim”, sep = “\t”, col.names = F, row.names = F, quote = F) to output the bim file after organization
fwrite(op2, "op2.bim", sep = "\t", col.names = F, row.names = F, quote = F)
fwrite(op3, "op3.bim", sep = "\t", col.names = F, row.names = F, quote = F)
# Exit R

Prepare verification individual information and put it in a file with the title "selected_indi.txt", and organize the file according to the following method:
1701. CEL 1701. CEL 0 0 0 -9
1702. CEL 1702. CEL 0 0 0 -9
# Using PLINK software to extract the individuals contained in each SNP chip, the method is as follows:
plink --bfile op1 --keep selected_indi. txt --make-bed --out op_can_1
plink --bfile op2 --keep selected_indi. txt --make-bed --out op_can_2
plink --bfile op3 --keep selected_indi. txt --make-bed --out op_can_3

# Output the individual genotype file used for verification plink --bfile op_can_1 --recode A --out op1
plink --bfile op_can_2 --recode A --out op2
plink --bfile op_can_3 --recode A --out op3
In addition, output the genotype file using the following command in R:
library(data.table)
op1 <-fread("op1.raw")
op2 <-fread("op2.raw")
op3 <-fread("op3.raw")
op1[, c(1:6) := NULL]
op2[, c(1:6) := NULL]
op3[, c(1:6) := NULL]
fwrite(op1, "geno_op1_can.csv", sep = ",", row.names = F, quote = F)
fwrite(op2, "geno_op2_can.csv", sep = ",", row.names = F, quote = F)
fwrite(op3, "geno_op3_can.csv", sep = ",", row.names = F, quote = F)
a) Use PLINK and R to combine the genotypes of the reference group with the genotypes of each chip candidate individual, and the specific operation method is as follows:
plink --bfile .../op_Ref --extract snps_op1. txt --recode A --out op_Ref_op1
plink --bfile .../op_Ref --extract snps_op2. txt --recode A --out op_Ref_op2
plink --bfile .../op_Ref --extract snps_op3. txt --recode A --out op_Ref_op3
Further process and output the genotype file in R using the following command:
library(data.table)
# Read the reference group genotype and perform processing on it op1_ref <- fread("op_Ref_op1.raw")
op2_ref <-fread(“op_Ref_op2.raw”)
op3_ref <-fread(“op_Ref_op3.raw”)
op1_ref[, c(1:6) := NULL]
op2_ref[, c(1:6) := NULL]
op3_ref[, c(1:6) := NULL]
op1_ref <- as. matrix (op1_ref)
op2_ref <- as. matrix(op2_ref)
op3_ref <- as. matrix(op3_ref)
# Process candidate individual genotypes: read, combine op1_can <- as. matrix(fread(“geno_op1_can.csv”))
op2_can <- as. matrix(fread(“geno_op2_can.csv”))
op3_can <- as. matrix(fread(“geno_op3_can.csv”))
geno_op1 <-rbind(op1_can, op1_ref)
geno_op2 <- rbind(op2_can, op2_ref)
geno_op3 <-rbind(op3_can, op3_ref)
write. table(geno_op1, "geno_op1.csv", sep = ",", row.names = F, quote = F)
write. table(geno_op2, "geno_op2.csv", sep = ",", row.names = F, quote = F)
write. table(geno_op3, "geno_op3.csv", sep = ",", row.names = F, quote = F)

b)b)における処理済みの4つの遺伝子型ファイルを利用し、Rにおいてそれぞれ加重Gマトリックスを構築し、加重Gマトリックスの構築方法は1)における記述と同様である b) Use the four genotype files processed in b) to construct a weighted G matrix respectively in R, and the construction method of the weighted G matrix is the same as described in 1).

c)ASReml-Rを用いて候補個体のGEBVを推定し、コードは以下のとおりである:
#必要なRパッケージと関数をアンロードする
library(data.table)
library(asreml)
###op1###
pheno <- asreml.read.table(“phenotype_op1.csv”, sep = “ ,”, header = T)
Ginv <- fread(“…/Ginv_op1.csv”)
attr(Ginv, “rowNames”) <- paste(pheno[, 1])
op1 <- asreml(status ~ Batch + Age, random = ~giv(AnimalID), ginverse = list(AnimalID = Ginv), rcov = ~units, family = asreml.binomial(link = “logit”), na.method.X = “omit”, data = pheno, maxiter = 50)
write.table(coef(op1)$random, “gebv_op1.csv”, sep = “ ,”, col.names = F, quote = F)

###op2###
pheno <- asreml.read.table(“ phenotype_op2.csv”, sep = “ ,”, header = T)
Ginv <- fread(“ …/Ginv_op2.csv”)
attr(Ginv, “ rowNames”) <- paste(pheno[, 1])
op2 <- asreml(status ~ Batch + Age, random = ~giv(AnimalID), ginverse = list(AnimalID = Ginv), rcov = ~units, family = asreml.binomial(link = “logit”), na.method.X = “ omit”, data = pheno, maxiter = 50)
write.table(coef(op2)$random, “ gebv_op2.csv”, sep = “ ,”, col.names = F, quote = F)
###op3###
pheno <- asreml.read.table(“ phenotype_op3.csv”, sep = “ ,”, header = T)
Ginv <- fread(“ …/Ginv_op3.csv”)
attr(Ginv, “rowNames”) <- paste(pheno[, 1])
op3 <- asreml(status ~ Batch + Age, random = ~giv(AnimalID), ginverse = list(AnimalID = Ginv), rcov = ~units, family = asreml.binomial(link = “logit”), na.method.X = “omit”, data = pheno, maxiter = 50)
write.table(coef(op3)$random, “gebv_op3.csv”, sep = “ ,”, col.names = F, quote = F)
c) Estimate the candidate individual's GEBV using ASReml-R, the code is:
# library (data.table) to unload the required R packages and functions
library (asreml)
###op1###
pheno <- asreml. read. table("phenotype_op1.csv", sep = ",", header = T)
Ginv<-fread(".../Ginv_op1.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
op1 <- asreml(status ~ Batch + Age, random = ~ giv (AnimalID), ginverse = list (AnimalID = Ginv), rcov = ~ units, family = asreml.binomial (link = "logit"), na.method. X = “omit”, data = pheno, maxiter = 50)
write. table(coef(op1)$random, "gebv_op1.csv", sep = ",", col.names = F, quote = F)

###op2###
pheno <- asreml. read. table("phenotype_op2.csv", sep = ",", header = T)
Ginv<-fread(".../Ginv_op2.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
op2 <- asreml(status ~ Batch + Age, random = ~ giv (AnimalID), ginverse = list (AnimalID = Ginv), rcov = ~ units, family = asreml.binomial (link = "logit"), na.method. X = “omit”, data = pheno, maxiter = 50)
write. table(coef(op2)$random, "gebv_op2.csv", sep = ",", col.names = F, quote = F)
###op3###
pheno <- asreml. read. table("phenotype_op3.csv", sep = ",", header = T)
Ginv<-fread(".../Ginv_op3.csv")
attr(Ginv, "rowNames") <- paste(pheno[, 1])
op3 <- asreml(status ~ Batch + Age, random = ~ giv (AnimalID), ginverse = list (AnimalID = Ginv), rcov = ~ units, family = asreml.binomial (link = "logit"), na.method. X = “omit”, data = pheno, maxiter = 50)
write. table(coef(op3)$random, "gebv_op3.csv", sep = ",", col.names = F, quote = F)

d)推定されるすべての候補個体のGEBVに従って相応の家系GEBVを計算し、かつ公式

Figure 0007158496000015
を用いて各家系の感染生存率に変換を行い、変換後の平均値より高い家系の生存率を1に設定し、平均値より低いものを0に設定する。最後に、各家系GEBVと変換後の生存率の間のAUC値を計算し、AUC値の計算方法は以下のとおりである:
#必要なRパッケージをアンロードする
library(data.table)
library(pROC)
#各家系GEBVおよび相応の感染生存率を一つのファイルに整理し、家系番号・GEBVと変換後の感染生存率の配列に従って配列し、各行は一つの家系情報のみを含み、整理後のファイルを読み取る
res <- fread(“…/gebv_and_sr_op_can.csv”)
#子の代の家系生存率を変換する
res$SR_trans <- exp(res$SR) / (1 + exp(res$SR))
#変換後の生存率を平均値に従ってさらに0と1に変換する
SR_binary <- matrix(NA, nrow = nrow(res), ncol = 1)
SR_binary[which(res$SR_trans > mean(res$SR_trans)), ] <- 1
SR_binary[which(res$SR_trans < mean(res$SR_trans)), ] <- 0
#AUC値を計算する
roc(SR_binary[, 1], res$GEBV)
計算を経て、16のヒラメ家系のGEBVを取得し、計算は各家系GEBVと相応の感染生存率の間のAUC(正確性)が0.794であることを示す。各家系GEBVおよび感染生存率は表8による。 d) Calculate the corresponding pedigree GEBV according to the estimated GEBV of all candidate individuals, and formula
Figure 0007158496000015
is used to convert the infection survival rate of each family, and the survival rate of families higher than the mean after conversion is set to 1, and the survival rate lower than the mean is set to 0. Finally, the AUC value between each pedigree GEBV and post-conversion survival rate was calculated, and the method for calculating the AUC value is as follows:
# library (data.table) to unload the required R packages
library (pROC)
# Organize each family GEBV and corresponding infection survival rate into one file, arranged according to the order of family number/GEBV and infection survival rate after conversion, each line contains only one family information, file after sorting read res <- fread(".../gebv_and_sr_op_can.csv")
# Translate pedigree viability res$SR_trans <- exp(res$SR) / (1 + exp(res$SR))
# Further transform the post-transformed viability to 0 and 1 according to the mean SR_binary <- matrix(NA, nrow = nrow(res), ncol = 1)
SR_binary[which (res$SR_trans > mean(res$SR_trans)), ] <- 1
SR_binary[which (res$SR_trans < mean(res$SR_trans)), ] <- 0
# Calculate AUC value roc(SR_binary[, 1], res$GEBV)
Through calculation, the GEBV of 16 flounder families are obtained, and the calculation shows that the AUC (accuracy) between each family's GEBV and the corresponding infection survival rate is 0.794. Each kindred GEBV and infection survival rate are according to Table 8.

Figure 0007158496000016
候補群における16の子の代家系を感染生存率に従って6の高生存率家系(平均的な生存率が62.4%である)と10の低生存率家系(平均的な生存率が33.47%である)の二大類(表8)に分け、高生存率と低生存率家系の成体のGEBVを比較し、高生存率家系成体の平均的なGEBVが2.10で、低生存率家系成体の平均的なGEBVが1.34であることがわかる。計算はよると、これらのヒラメ家系のGEBV値を使用してはその感染生存率が予測される正確性が0.794に達することができ、理論値に近い。そのため、発明者が設計される遺伝子チップはヒラメ耐病性性状の選別に効果的に応用することができる。
Figure 0007158496000016
The 16 offspring in the candidate group were divided into 6 high-survival families (with an average survival rate of 62.4%) and 10 low-survival families (with an average survival rate of 33.4%) according to infection survival rate. 47%) were divided into two categories (Table 8) and compared the GEBV of adults in high-survival and low-survival families. It can be seen that the average GEBV for an adult family is 1.34. Calculations show that using the GEBV values of these flounder families, the accuracy of predicting their infection survival rate can reach 0.794, which is close to the theoretical value. Therefore, the gene chip designed by the inventors can be effectively applied to the screening of flounder for disease-resistant traits.

上記の結果によると、「魚チップ1号」遺伝子チップを用いてヒラメ候補グループの個体に遺伝子分類を行い、加重GBLUPを用いてゲノム育種値(GEBV)を計算し、GEBV数値の大きさに従ってヒラメ耐病性親魚の選別を行い、これらの親魚を用いて育成される後代種苗の耐感染生存率が顕著に高まり、それにより「魚チップ1号」遺伝子チップがヒラメ耐病性優良品種の育成において普及応用を実行することができることがわかる。 According to the above results, the “Fish Chip No. 1” gene chip was used to perform genetic classification on the individuals of the flounder candidate group, the weighted GBLUP was used to calculate the genomic breeding value (GEBV), and the flounder according to the magnitude of the GEBV number Disease-resistant parent fish are selected, and the infection-resistant survival rate of progeny seedlings grown using these parent fish is remarkably increased. It turns out that it is possible to execute

産業中の実施可能性
本発明より提供されるヒラメ耐病性に関連するSNPローカスの遺伝子チップはヒラメ耐病性個体の選別に利用可能であり、かつ実際の選択正確性は理論値に近いため、ヒラメ耐病性優良品種の選択正確性を高め、育種期間を短縮させ、これにより、ヒラメ耐病性優良品種の選別のための遺伝子チップ技術を提供し、魚類耐病性優良品種の選別のための遺伝子チップ育種の新しい道を切り開くことができる。
Feasibility in Industry The gene chip of the SNP locus associated with flounder disease resistance provided by the present invention can be used for selection of flounder disease-resistant individuals, and the actual selection accuracy is close to the theoretical value. Increase the accuracy of selection of excellent disease-resistant varieties and shorten the breeding period, thereby providing gene chip technology for selecting excellent disease-resistant flounder varieties and gene-chip breeding for selecting excellent disease-resistant fish varieties. can open up new avenues for

Claims (2)

ヒラメ耐病性個体の選別方法であって、上記の耐病性に関する病気はエドワジエラ症であり、前記方法は遺伝子チップを用いて検測し、前記遺伝子チップはヒラメ耐病性に関連する一組のSNPローカスを検測し、前記の一組のSNPローカスは48697個のSNPローカスを含み、前記48697個のSNPローカスは、配列がSEQ ID NO:1-SEQ ID NO:48697である48697個の配列における、各配列の第36位の塩基であることを特徴とするヒラメ耐病性個体の選別方法。 A method for selecting disease-resistant individuals of flounder, wherein the disease related to disease resistance is Edwardziellasis, the method is detected using a gene chip, and the gene chip is a set of SNP locus associated with disease resistance of flounder and the set of SNP loci includes 48697 SNP loci, and the 48697 SNP loci are in 48697 sequences whose sequences are SEQ ID NO: 1-SEQ ID NO: 48697, A method for selecting a disease-resistant individual of Japanese flounder, characterized in that it is the 36th base of each sequence . 請求項に記載のヒラメ耐病性個体の選別方法であって、
前記方法は、
1)候補グループにおける個体ゲノムDNAを抽出し、かつ上記の遺伝子チップを利用して検測して、SNPマーカーの遺伝子型判定の結果を取得するステップと、
2)参考グループのSNP集合より遺伝子チップと同様であるSNPローカスの遺伝子型判定の結果を抽出し、さらに参考グループのSNPの遺伝子型判定の結果とチップを利用して取得した候補グループの遺伝子型判定の結果とを合併するステップ、
3)合併された遺伝子型と参考グループが表現した型とを利用し、加重GBLUP方法を用いて候補グループの推定育種値GEBVを推定し、さらにGEBV値に基づいて被検測個体の耐病性潜在能力を決定するステップと、
を含み、
かつ、参考グループの遺伝子型を利用し、加重最良線形不偏推定方法を用いて予測正確性を推定し、
その中、予測正確性の推定において5倍交差検証方法を利用し、特徴曲線の下の面積AUCを予測正確性を判定する指標とし、AUCが1に近いほど予測正確性は高い
ことを特徴とするヒラメ耐病性個体の選別方法。
The method for selecting disease-resistant flounder individuals according to claim 1 ,
The method includes:
1) A step of extracting individual genomic DNA in the candidate group and performing detection using the gene chip to obtain results of genotyping of SNP markers;
2) From the SNP set of the reference group, the results of genotyping of SNP loci similar to those of the gene chip are extracted, and the results of genotyping of the SNPs of the reference group and genotypes of the candidate group obtained using the chip are obtained. merging the result of the determination;
3) Using the combined genotype and the type represented by the reference group, the weighted GBLUP method is used to estimate the estimated breeding value GEBV of the candidate group, and the disease resistance potential of the tested individuals based on the GEBV value. determining capabilities;
including
and using the genotype of the reference group to estimate the prediction accuracy using the weighted best linear unbiased estimation method,
Among them, a 5-fold cross-validation method is used in estimating prediction accuracy, and the area under the characteristic curve AUC is used as an index for determining prediction accuracy, and the closer the AUC is to 1, the higher the prediction accuracy. A method for selecting disease-resistant flounder individuals.
JP2020556756A 2019-12-17 2019-12-17 Disease resistance breeding gene chip of flounder and its application Active JP7158496B2 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/125857 WO2021119980A1 (en) 2019-12-17 2019-12-17 Gene chip for disease resistance breeding of olive flounder and application thereof

Publications (2)

Publication Number Publication Date
JP2022518304A JP2022518304A (en) 2022-03-15
JP7158496B2 true JP7158496B2 (en) 2022-10-21

Family

ID=71001219

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2020556756A Active JP7158496B2 (en) 2019-12-17 2019-12-17 Disease resistance breeding gene chip of flounder and its application

Country Status (3)

Country Link
JP (1) JP7158496B2 (en)
CN (1) CN111278994B (en)
WO (1) WO2021119980A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114015789A (en) * 2021-12-06 2022-02-08 中国水产科学研究院黄海水产研究所 Genome selection method for cultivating disease-resistant improved Dongxiang spots
CN114410746B (en) * 2022-03-29 2022-07-12 中国海洋大学三亚海洋研究院 Dongxiang spot molecule source-tracing selection breeding method and application thereof
CN114990243B (en) * 2022-06-21 2023-06-09 中国水产科学研究院黄海水产研究所 Method for screening vibriosis-resistant cynoglossus semilaevis marker combination and cynoglossus semilaevis disease-resistant individual
CN115992265B (en) * 2023-03-22 2023-07-14 中山大学 Grouper whole genome liquid phase chip and application thereof
KR102602152B1 (en) * 2023-08-04 2023-11-13 제주대학교 산학협력단 A method for producing superior seeds using egg quality evaluation of artificially fertilized eggs of flounder mothers.

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101911918B (en) * 2010-03-18 2011-06-15 中国水产科学研究院黄海水产研究所 Selective breeding method for disease-resistant superior strains of paralichthys olivaceus
CN104304096B (en) * 2014-08-26 2016-01-27 中国水产科学研究院黄海水产研究所 The selection of the sick excellent strain of the anti-Edwardsiella tarda of a kind of lefteye flounder
CN104328116A (en) * 2014-11-06 2015-02-04 中国海洋大学 SNP (single-nucleotide polymorphism) locus related to heat resistance of Paralichthys olivaceus and application thereof
CN105624318B (en) * 2016-03-24 2019-05-03 中国水产科学研究院北戴河中心实验站 One kind SNP site relevant to lefteye flounder growth traits, its screening technique and application
CN105713974B (en) * 2016-03-24 2019-04-19 中国水产科学研究院北戴河中心实验站 One kind SNP marker relevant to lefteye flounder quantitative character, its screening technique and application
CN106480189B (en) * 2016-10-18 2018-11-09 中国水产科学研究院黄海水产研究所 A kind of disease-resistant prevalent variety cultivation method of fish based on full-length genome selection

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Fish and Shellfish Immunology,2017年,vol.66,p.43-49
水産育種,2016年,vol.46, no.1,p.1-14

Also Published As

Publication number Publication date
JP2022518304A (en) 2022-03-15
CN111278994B (en) 2023-02-07
CN111278994A (en) 2020-06-12
WO2021119980A1 (en) 2021-06-24

Similar Documents

Publication Publication Date Title
JP7158496B2 (en) Disease resistance breeding gene chip of flounder and its application
CN111225986B (en) Chicken whole genome SNP chip and application thereof
CN110191965B (en) Pig whole genome 50K SNP chip and application
CN107967409B (en) Pig whole genome low-density SNP chip and manufacturing method and application thereof
Seidel Brief introduction to whole-genome selection in cattle using single nucleotide polymorphisms
CN110484636B (en) Molecular marker related to pig total papilla number characters and application thereof
Naqvi Application of molecular genetic technologies in livestock production: potentials for developing countries
CN110734983A (en) SNP markers related to intramuscular fat traits of Suhuai pigs, and detection method and application thereof
JP2018525985A (en) Cow breeding method to improve milk production
Sabir et al. Applying molecular tools for improving livestock performance: From DNA markers to next generation sequencing technologies
Jones et al. Progress and opportunities through use of genomics in animal production
CN117431324A (en) High-density SNP chip in whole genome of dairy cow and application thereof
CN111944913A (en) Cynoglossus semilaevis disease-resistant breeding gene chip and application thereof
CN110144406B (en) Method for screening DNA bar codes of Kebao broiler chickens and application thereof
KR101825497B1 (en) Kits for Detecting Equine Maternal lineage and Predicting Athletic Ability with Single Nucleotide Polymorphism and Method for Detection of Equine Maternal lineage and Prediction of Athletic Ability thereby
JP2010533491A (en) Methods for improving the genomic marker index of dairy animals and dairy products
Singh et al. Whole-genome selection in livestock
Raschia et al. Quantitative trait loci exploration and characterization of gestation length in Holstein cattle
Forutan et al. A conditional multi-trait sequence GWAS of heifer fertility in tropically adapted beef cattle
Stolpovsky et al. Genomic Selection. I: Latest Trends and Possible Ways of Development
Calboli et al. Genomic selection for survival under naturally occurring Saprolegnia oomycete infection in farmed European whitefish Coregonus lavaretus
BÜLBÜL et al. USE OF SNP CHIPS FOR GENOME-WIDE ASSOCIATION STUDIES IN LIVESTOCK
Van Eenennaam Improving EPD accuracy by combining EPD information with DNA test results
WO2009048344A2 (en) Genetic markers and methods of use as predictive indicators for production traits in animals, particularly bovine animals
McLaren et al. Genome-wide selection for reproductive traits in swine

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20201103

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20220407

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220701

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20221004

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20221011

R150 Certificate of patent or registration of utility model

Ref document number: 7158496

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150