CN113930538B - 一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建 - Google Patents
一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建 Download PDFInfo
- Publication number
- CN113930538B CN113930538B CN202111288744.4A CN202111288744A CN113930538B CN 113930538 B CN113930538 B CN 113930538B CN 202111288744 A CN202111288744 A CN 202111288744A CN 113930538 B CN113930538 B CN 113930538B
- Authority
- CN
- China
- Prior art keywords
- loc
- chr5
- chr9
- chr7
- chr3
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 83
- 235000009566 rice Nutrition 0.000 title claims abstract description 83
- 238000000034 method Methods 0.000 title claims abstract description 47
- 238000010276 construction Methods 0.000 title abstract description 15
- 240000007594 Oryza sativa Species 0.000 title description 65
- 238000001514 detection method Methods 0.000 claims abstract description 28
- 241000209094 Oryza Species 0.000 claims abstract 19
- 101100240528 Caenorhabditis elegans nhr-23 gene Proteins 0.000 claims description 114
- 238000012549 training Methods 0.000 claims description 52
- 230000006978 adaptation Effects 0.000 claims description 23
- 238000002372 labelling Methods 0.000 claims description 22
- 240000002582 Oryza sativa Indica Group Species 0.000 claims description 11
- 240000008467 Oryza sativa Japonica Group Species 0.000 claims description 11
- 239000003153 chemical reaction reagent Substances 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 6
- 108090000623 proteins and genes Proteins 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 3
- 238000013528 artificial neural network Methods 0.000 claims description 2
- 210000000349 chromosome Anatomy 0.000 claims description 2
- 235000013339 cereals Nutrition 0.000 abstract description 3
- 238000011156 evaluation Methods 0.000 abstract description 3
- 239000000523 sample Substances 0.000 description 53
- 238000012360 testing method Methods 0.000 description 13
- 238000000746 purification Methods 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 5
- 238000012795 verification Methods 0.000 description 5
- 101100398587 Oryza sativa subsp. japonica LAC16 gene Proteins 0.000 description 4
- 239000000706 filtrate Substances 0.000 description 4
- 238000012165 high-throughput sequencing Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 101100058805 Oryza sativa subsp. japonica Os12g0515500 gene Proteins 0.000 description 3
- 101100189030 Oryza sativa subsp. japonica PP2C50 gene Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000010801 machine learning Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 101100381426 Oryza sativa subsp. japonica BADH1 gene Proteins 0.000 description 2
- 101100004828 Oryza sativa subsp. japonica CYP714B1 gene Proteins 0.000 description 2
- 101100230494 Oryza sativa subsp. japonica HAK23 gene Proteins 0.000 description 2
- 101100347964 Oryza sativa subsp. japonica NAP1;2 gene Proteins 0.000 description 2
- 101100219247 Oryza sativa subsp. japonica Os02g0557500 gene Proteins 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 1
- 101100112111 Caenorhabditis elegans cand-1 gene Proteins 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 101100321716 Oryza sativa subsp. japonica B'KAPPA gene Proteins 0.000 description 1
- 101100272374 Oryza sativa subsp. japonica BGLU7 gene Proteins 0.000 description 1
- 101100326678 Oryza sativa subsp. japonica CRO1 gene Proteins 0.000 description 1
- 101100391853 Oryza sativa subsp. japonica GATA19 gene Proteins 0.000 description 1
- 101100284158 Oryza sativa subsp. japonica GLU10 gene Proteins 0.000 description 1
- 101100015333 Oryza sativa subsp. japonica GME-1 gene Proteins 0.000 description 1
- 101100123071 Oryza sativa subsp. japonica GRC2 gene Proteins 0.000 description 1
- 101100337779 Oryza sativa subsp. japonica GRF4 gene Proteins 0.000 description 1
- 101100451378 Oryza sativa subsp. japonica HOX2 gene Proteins 0.000 description 1
- 101100071480 Oryza sativa subsp. japonica HSFA1 gene Proteins 0.000 description 1
- 101100017776 Oryza sativa subsp. japonica HSP26.7 gene Proteins 0.000 description 1
- 101100346671 Oryza sativa subsp. japonica MT2B gene Proteins 0.000 description 1
- 101100079496 Oryza sativa subsp. japonica NCED1 gene Proteins 0.000 description 1
- 101100460559 Oryza sativa subsp. japonica NINJA1 gene Proteins 0.000 description 1
- 101100242312 Oryza sativa subsp. japonica OSK1 gene Proteins 0.000 description 1
- 101100009093 Oryza sativa subsp. japonica Os02g0773300 gene Proteins 0.000 description 1
- 101100162369 Oryza sativa subsp. japonica Os05g0428700 gene Proteins 0.000 description 1
- 101100438065 Oryza sativa subsp. japonica Os08g0159800 gene Proteins 0.000 description 1
- 101100399571 Oryza sativa subsp. japonica Os09g0533400 gene Proteins 0.000 description 1
- 101100082592 Oryza sativa subsp. japonica PDC3 gene Proteins 0.000 description 1
- 101100522129 Oryza sativa subsp. japonica PTC1 gene Proteins 0.000 description 1
- 101100034487 Oryza sativa subsp. japonica RLCK176 gene Proteins 0.000 description 1
- 101100475715 Oryza sativa subsp. japonica RSH3 gene Proteins 0.000 description 1
- 101100095029 Oryza sativa subsp. japonica SAT3 gene Proteins 0.000 description 1
- 101100095756 Oryza sativa subsp. japonica SHR1 gene Proteins 0.000 description 1
- 101100421812 Oryza sativa subsp. japonica SODCC2 gene Proteins 0.000 description 1
- 101100150127 Oryza sativa subsp. japonica SPO11-4 gene Proteins 0.000 description 1
- 101100258302 Oryza sativa subsp. japonica SUV3 gene Proteins 0.000 description 1
- 101100536783 Oryza sativa subsp. japonica TGAL7 gene Proteins 0.000 description 1
- 101100537123 Oryza sativa subsp. japonica TIFY10C gene Proteins 0.000 description 1
- 101100207163 Oryza sativa subsp. japonica TOP6B gene Proteins 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010219 correlation analysis Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 238000011451 sequencing strategy Methods 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Biology (AREA)
- Microbiology (AREA)
- Mycology (AREA)
- Immunology (AREA)
- Botany (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biochemistry (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Cosmetics (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Polyesters Or Polycarbonates (AREA)
Abstract
本发明公开了一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建,涉及水稻种植环境检测技术领域,水稻是比较重要的粮食作物,本发明通过高通量测序的方法,检测和分析籼稻和粳稻群体中SNP位点的基因型,获得了特定的目标SNP位点,采用机器学习和人工智能的方法,构建分析模型,用于对待测水稻品种进行适种区预测。综合预测准确率超过90%,相对于常规的适种区域评价方法,具有成本低,时间短等技术优势。
Description
技术领域
本发明涉及水稻种植环境检测技术领域,具体而言,涉及一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建。
背景技术
水稻作为主要的主食农作物,在人们日常生活中占据着及其重要的地位。随着人们生活水平的不断提高,人们对水稻的品质追求逐渐提高,影响水稻的品质的因素除了水稻品种本身的遗传特性之外,生态环境也是影响水稻品质优劣的关键因素,例如土壤因素和气候因素等,优质水稻只有在适宜的生态条件下才能实现品种与气候条件的合理匹配。
现有常规的鉴定水稻适种区域的流程为:基于天气因素进行种植区域的选择,通过育秧和插秧,进行适应性观察。整个过程需要耗费120天左右。因此,如何有效地鉴定水稻的适种区域是如今亟待解决的问题之一。
鉴于此,特提出本发明。
发明内容
本发明的目的在于提供一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建。
本发明是这样实现的:
第一方面,本发明实施例提供了用于检测目标SNP位点基因型的试剂在制备用于鉴定水稻适种区域的试剂盒中的应用,所述目标SNP位点的表1所述SNP位点中的至少197个位点。
表1 纬度分布相关SNP位点列表
SNP位点 | 染色体 | 位置 | 分组 | 基因 | 籼稻基因型 | 粳稻基因型 | 相关系数 |
1 | Chr1 | 3504090 | iS | LOC_Os01g07390 | G | A/G | -0.982303 |
2 | Chr1 | 3518700 | iS | LOC_Os01g07420 | G | T/G | -0.944999 |
3 | Chr1 | 3533630 | iS | LOC_Os01g07450 | G | A/G | -0.987614 |
4 | Chr1 | 3678883 | iS | LOC_Os01g07640 | C | A/C | -0.974404 |
5 | Chr1 | 4985027 | iS | LOC_Os01g09670 | C | G/C | 0.984276 |
6 | Chr1 | 27651597 | iS | LOC_Os01g48230 | C | T/C | 0.916038 |
7 | Chr2 | 3800381 | iS | LOC_Os02g07360 | C | T/C | -0.962901 |
8 | Chr2 | 17905603 | iS | LOC_Os02g30140 | G | T/G | 0.972697 |
9 | Chr2 | 22144278 | iS | LOC_Os02g36690 | C | T/C | 0.986419 |
10 | Chr2 | 34765406 | iS | LOC_Os02g56700 | A | C/A | -0.97866 |
11 | Chr2 | 34802120 | iS | LOC_Os02g56770 | C | G/C | -0.995321 |
12 | Chr2 | 34802184 | iS | LOC_Os02g56770 | A | C/A | -0.97189 |
13 | Chr2 | 34839061 | iS | LOC_Os02g56850 | A | G/A | -0.963336 |
14 | Chr2 | 35190956 | iS | LOC_Os02g57420 | C | T/C | -0.999645 |
15 | Chr2 | 35257599 | iS | LOC_Os02g57540 | T | A/T | -0.990985 |
16 | Chr3 | 7432537 | iS | LOC_Os03g13720 | G | A/G | -0.906067 |
17 | Chr3 | 7482380 | iS | LOC_Os03g13810 | G | A/G | -0.912055 |
18 | Chr3 | 7482867 | iS | LOC_Os03g13810 | G | A/G | -0.903824 |
19 | Chr3 | 7577047 | iS | LOC_Os03g13960 | G | A/G | -0.950387 |
20 | Chr3 | 7577109 | iS | LOC_Os03g13960 | C | T/C | -0.94452 |
21 | Chr3 | 7641744 | iS | LOC_Os03g14060 | A | T/A | -0.996892 |
22 | Chr3 | 7697508 | iS | LOC_Os03g14180 | C | T/C | -0.905056 |
23 | Chr3 | 7763466 | iS | LOC_Os03g14260 | G | C/G | -0.954439 |
24 | Chr3 | 7816066 | iS | LOC_Os03g14370 | C | A/C | -0.926546 |
25 | Chr3 | 7832252 | iS | LOC_Os03g14400 | G | A/G | -0.929566 |
26 | Chr3 | 8183508 | iS | LOC_Os03g15000 | C | T/C | 0.990826 |
27 | Chr3 | 8219465 | iS | LOC_Os03g15050 | C | T/C | 0.992718 |
28 | Chr3 | 23933754 | jiS | LOC_Os03g42900 | C | T | -0.948204 |
29 | Chr4 | 6915721 | iS | LOC_Os04g12499 | C | T/C | 0.955724 |
30 | Chr4 | 9283358 | jiS | LOC_Os04g16950 | C | T | -0.993127 |
31 | Chr4 | 33518815 | iS | LOC_Os04g56210 | G | T/G | 0.999991 |
32 | Chr5 | 578975 | iS | LOC_Os05g02020 | C | T/C | -0.967705 |
33 | Chr5 | 21041557 | iS | LOC_Os05g35410 | C | T/C | 0.99897 |
34 | Chr5 | 21603285 | jiS | LOC_Os05g36980 | T | C | 0.99997 |
35 | Chr5 | 27101999 | iS | LOC_Os05g46790 | C | T/C | -0.997302 |
36 | Chr5 | 27202478 | iS | LOC_Os05g47480 | C | G/C | 0.915826 |
37 | Chr5 | 27230464 | iS | LOC_Os05g47545 | T | A/T | 0.982394 |
38 | Chr5 | 27378401 | iS | LOC_Os05g47780 | T | A/T | -0.978414 |
39 | Chr5 | 27447153 | iS | LOC_Os05g47860 | A | C/A | -0.997354 |
40 | Chr5 | 27457848 | iS | LOC_Os05g47880 | C | T/C | -0.988848 |
41 | Chr5 | 27562629 | iS | LOC_Os05g48070 | G | A/G | -0.991535 |
42 | Chr5 | 27604248 | iS | LOC_Os05g48150 | G | A/G | -0.99948 |
43 | Chr5 | 27690359 | iS | LOC_Os05g48300 | G | A/G | -0.999182 |
44 | Chr5 | 27715849 | iS | LOC_Os05g48340 | T | A/T | -0.997468 |
45 | Chr5 | 27724718 | iS | LOC_Os05g48360 | A | C/A | -0.999951 |
46 | Chr5 | 27725803 | iS | LOC_Os05g48360 | C | G/C | -0.997908 |
47 | Chr5 | 27725942 | iS | LOC_Os05g48360 | T | C/T | -0.995913 |
48 | Chr5 | 27726919 | iS | LOC_Os05g48360 | C | T/C | -0.999245 |
49 | Chr5 | 27727262 | iS | LOC_Os05g48360 | C | A/C | -0.923878 |
50 | Chr5 | 27727469 | iS | LOC_Os05g48360 | C | T/C | -0.985788 |
51 | Chr5 | 27730517 | iS | LOC_Os05g48370 | C | T/C | -0.998158 |
52 | Chr5 | 27730638 | iS | LOC_Os05g48370 | G | C/G | -0.998709 |
53 | Chr5 | 27730969 | iS | LOC_Os05g48370 | T | G/T | -0.999767 |
54 | Chr5 | 27731335 | iS | LOC_Os05g48370 | A | G/A | -0.998896 |
55 | Chr5 | 27743763 | iS | LOC_Os05g48390 | A | G/A | -0.992472 |
56 | Chr5 | 27749412 | iS | LOC_Os05g48400 | T | C/T | -0.999285 |
57 | Chr5 | 27749912 | iS | LOC_Os05g48400 | G | A/G | -0.989755 |
58 | Chr5 | 27786024 | iS | LOC_Os05g48470 | T | C/T | -0.999259 |
59 | Chr5 | 27787315 | iS | LOC_Os05g48470 | T | C/T | -0.999175 |
60 | Chr5 | 27797053 | iS | LOC_Os05g48500 | A | G/A | -0.998895 |
61 | Chr5 | 27834315 | iS | LOC_Os05g48570 | C | T/C | -0.995407 |
62 | Chr5 | 27868164 | iS | LOC_Os05g48610 | A | G/A | -0.968579 |
63 | Chr5 | 27882703 | iS | LOC_Os05g48640 | C | A/C | -0.990795 |
64 | Chr5 | 27882763 | iS | LOC_Os05g48640 | G | A/G | -0.999937 |
65 | Chr5 | 27883933 | iS | LOC_Os05g48650 | C | T/C | -0.997071 |
66 | Chr5 | 27883935 | iS | LOC_Os05g48650 | T | C/T | -0.999684 |
67 | Chr5 | 27883944 | iS | LOC_Os05g48650 | G | A/G | -0.993837 |
68 | Chr5 | 28240944 | iS | LOC_Os05g49210 | G | C/G | -0.901596 |
69 | Chr5 | 28240950 | iS | LOC_Os05g49210 | G | C/G | -0.902469 |
70 | Chr5 | 28541857 | iS | LOC_Os05g49750 | T | C/T | -0.902361 |
71 | Chr5 | 29226038 | iS | LOC_Os05g50930 | T | A/T | 0.94298 |
72 | Chr6 | 4552400 | iS | LOC_Os06g09040 | T | C/T | 0.977047 |
73 | Chr6 | 4552782 | iS | LOC_Os06g09040 | G | A/G | 0.976179 |
74 | Chr6 | 10958464 | iS | LOC_Os06g19260 | T | C/T | -0.988459 |
75 | Chr6 | 11002220 | iS | LOC_Os06g19340 | C | T/C | -0.988332 |
76 | Chr6 | 11002364 | iS | LOC_Os06g19340 | C | A/C | -0.996656 |
77 | Chr6 | 11002639 | iS | LOC_Os06g19340 | G | A/G | -0.99586 |
78 | Chr6 | 11002804 | iS | LOC_Os06g19340 | C | T/C | -0.996251 |
79 | Chr6 | 11002807 | iS | LOC_Os06g19340 | C | A/C | -0.987677 |
80 | Chr6 | 11002867 | iS | LOC_Os06g19340 | G | A/G | -0.995488 |
81 | Chr6 | 11003365 | iS | LOC_Os06g19340 | A | T/A | -0.999803 |
82 | Chr6 | 11003852 | iS | LOC_Os06g19340 | C | A/C | -0.999871 |
83 | Chr6 | 11003897 | iS | LOC_Os06g19340 | G | T/G | -0.996749 |
84 | Chr6 | 12542169 | jiS | LOC_Os06g21700 | T | C | 0.99941 |
85 | Chr6 | 19206019 | iS | LOC_Os06g32990 | G | A/G | 0.998537 |
86 | Chr6 | 23121286 | iS | LOC_Os06g38960 | T | C/T | 0.983876 |
87 | Chr6 | 27999640 | jS | LOC_Os06g46210 | A/T | T | 0.901835 |
88 | Chr6 | 28004175 | jS | LOC_Os06g46230 | A/G | G | 0.902849 |
89 | Chr6 | 30026712 | iS | LOC_Os06g49610 | T | A/T | 0.98795 |
90 | Chr7 | 4317188 | iS | LOC_Os07g08400 | G | A/G | -0.996894 |
91 | Chr7 | 7936787 | iS | LOC_Os07g13860 | C | T/C | -0.999597 |
92 | Chr7 | 7936880 | iS | LOC_Os07g13860 | C | T/C | -0.999864 |
93 | Chr7 | 8138798 | iS | LOC_Os07g14270 | G | A/G | -0.998728 |
94 | Chr7 | 8260209 | iS | LOC_Os07g14500 | C | T/C | -0.998656 |
95 | Chr7 | 8354450 | iS | LOC_Os07g14650 | A | T/A | -0.997302 |
96 | Chr7 | 10260691 | iS | LOC_Os07g17380 | C | T/C | -0.980703 |
97 | Chr7 | 10922601 | iS | LOC_Os07g18460 | C | A/C | -0.998809 |
98 | Chr7 | 10998571 | iS | LOC_Os07g18610 | G | A/G | -0.968525 |
99 | Chr7 | 11068926 | iS | LOC_Os07g18720 | G | A/G | -0.987192 |
100 | Chr7 | 11291168 | iS | LOC_Os07g19070 | C | T/C | -0.98709 |
101 | Chr7 | 11339307 | iS | LOC_Os07g19160 | C | T/C | -0.955608 |
102 | Chr7 | 11339309 | iS | LOC_Os07g19160 | G | A/G | -0.956164 |
103 | Chr7 | 12679751 | iS | LOC_Os07g22534 | A | T/A | -0.999873 |
104 | Chr7 | 12679881 | iS | LOC_Os07g22534 | C | A/C | -0.999934 |
105 | Chr7 | 12777578 | iS | LOC_Os07g22680 | G | C/G | -0.998502 |
106 | Chr7 | 13104746 | iS | LOC_Os07g23260 | C | A/C | -0.996793 |
107 | Chr7 | 13105316 | iS | LOC_Os07g23260 | G | A/G | -0.988835 |
108 | Chr7 | 13105683 | iS | LOC_Os07g23260 | C | T/C | -0.994167 |
109 | Chr7 | 13470577 | iS | LOC_Os07g23850 | A | G/A | -0.965939 |
110 | Chr7 | 13470711 | iS | LOC_Os07g23850 | C | T/C | -0.99951 |
111 | Chr7 | 13471048 | iS | LOC_Os07g23850 | C | T/C | -0.999911 |
112 | Chr7 | 13471158 | iS | LOC_Os07g23850 | T | A/T | -0.9848 |
113 | Chr7 | 13647625 | iS | LOC_Os07g24070 | G | A/G | -0.990986 |
114 | Chr7 | 13694835 | jiS | LOC_Os07g24130 | G | A | 0.9045 |
115 | Chr7 | 13727204 | iS | LOC_Os07g24160 | C | T/C | -0.976962 |
116 | Chr7 | 13727252 | iS | LOC_Os07g24160 | A | G/A | -0.935738 |
117 | Chr7 | 14517027 | iS | LOC_Os07g25400 | C | T/C | -0.997331 |
118 | Chr7 | 19924307 | iS | LOC_Os07g33330 | T | G/T | -0.951452 |
119 | Chr7 | 22517158 | iS | LOC_Os07g37580 | G | A/G | -0.976934 |
120 | Chr7 | 22602073 | iS | LOC_Os07g37700 | T | C/T | -0.904227 |
121 | Chr7 | 24164555 | iS | LOC_Os07g40300 | G | C/G | 0.978756 |
122 | Chr7 | 24164681 | iS | LOC_Os07g40300 | G | T/G | 0.983382 |
123 | Chr7 | 27553423 | iS | LOC_Os07g46170 | G | T/G | -0.970923 |
124 | Chr7 | 27554776 | iS | LOC_Os07g46170 | T | C/T | -0.969759 |
125 | Chr7 | 27569432 | iS | LOC_Os07g46190 | C | G/C | -0.994529 |
126 | Chr8 | 5346091 | iS | LOC_Os08g09220 | C | A/C | -0.999996 |
127 | Chr8 | 6192821 | iS | LOC_Os08g10540 | C | T/C | -0.94664 |
128 | Chr8 | 14510072 | iS | LOC_Os08g23980 | C | T/C | -0.976184 |
129 | Chr8 | 14976326 | iS | LOC_Os08g24750 | G | A/G | -0.995614 |
130 | Chr8 | 14981781 | iS | LOC_Os08g24760 | C | A/C | -0.9493 |
131 | Chr8 | 14984434 | iS | LOC_Os08g24760 | C | T/C | -0.974555 |
132 | Chr8 | 14991526 | iS | LOC_Os08g24760 | G | A/G | -0.973114 |
133 | Chr8 | 15009268 | iS | LOC_Os08g24790 | G | C/G | -0.953215 |
134 | Chr8 | 15206870 | iS | LOC_Os08g25060 | G | T/G | -0.939676 |
135 | Chr8 | 21755671 | iS | LOC_Os08g34620 | A | T/A | -0.902437 |
136 | Chr9 | 5838790 | iS | LOC_Os09g10710 | C | T/C | -0.931202 |
137 | Chr9 | 5849225 | iS | LOC_Os09g10720 | G | A/G | -0.937803 |
138 | Chr9 | 5878257 | iS | LOC_Os09g10770 | G | T/G | -0.925331 |
139 | Chr9 | 5883681 | iS | LOC_Os09g10780 | T | C/T | -0.940112 |
140 | Chr9 | 5899102 | iS | LOC_Os09g10810 | C | T/C | -0.93708 |
141 | Chr9 | 5922570 | iS | LOC_Os09g10840 | A | G/A | -0.931888 |
142 | Chr9 | 12556552 | jiS | LOC_Os09g20850 | T | G | -0.937599 |
143 | Chr9 | 12607231 | jiS | LOC_Os09g20940 | A | G | -0.91905 |
144 | Chr9 | 12645024 | jiS | LOC_Os09g20990 | G | C | -0.91128 |
145 | Chr9 | 12645392 | jiS | LOC_Os09g20990 | T | C | -0.922538 |
146 | Chr9 | 12665833 | jiS | LOC_Os09g21000 | T | C | -0.932733 |
147 | Chr9 | 12667402 | jiS | LOC_Os09g21000 | C | T | -0.97675 |
148 | Chr9 | 12685817 | jiS | LOC_Os09g21040 | G | C | -0.905109 |
149 | Chr9 | 12686635 | jiS | LOC_Os09g21040 | C | T | -0.9608 |
150 | Chr9 | 12965554 | jiS | LOC_Os09g21440 | G | A | -0.920161 |
151 | Chr9 | 12965575 | jiS | LOC_Os09g21440 | C | T | -0.915386 |
152 | Chr9 | 13016795 | jiS | LOC_Os09g21510 | G | A | -0.906435 |
153 | Chr9 | 14976796 | iS | LOC_Os09g25060 | G | C/G | -0.998159 |
154 | Chr9 | 14993347 | iS | LOC_Os09g25070 | G | C/G | -0.999938 |
155 | Chr9 | 15087997 | iS | LOC_Os09g25170 | G | A/G | 0.975344 |
156 | Chr9 | 15096499 | jiS | LOC_Os09g25200 | C | T | 0.999036 |
157 | Chr9 | 15096735 | jiS | LOC_Os09g25200 | G | A | 0.990989 |
158 | Chr9 | 15097080 | jiS | LOC_Os09g25200 | G | A | 0.994973 |
159 | Chr9 | 15097684 | jiS | LOC_Os09g25200 | G | A | 0.990887 |
160 | Chr9 | 15131526 | jiS | LOC_Os09g25280 | G | A | 0.981258 |
161 | Chr9 | 15131592 | jiS | LOC_Os09g25280 | A | T | 0.999831 |
162 | Chr9 | 15131928 | jiS | LOC_Os09g25280 | G | A | 0.999923 |
163 | Chr9 | 15189303 | iS | LOC_Os09g25340 | G | A/G | 0.968525 |
164 | Chr9 | 15329699 | jiS | LOC_Os09g25550 | C | T | 0.932895 |
165 | Chr9 | 15359983 | iS | LOC_Os09g25590 | T | A/T | 0.941552 |
166 | Chr9 | 16455985 | jiS | LOC_Os09g27050 | G | A | 0.917891 |
167 | Chr9 | 16496241 | iS | LOC_Os09g27120 | G | A/G | 0.980676 |
168 | Chr9 | 16786991 | iS | LOC_Os09g27620 | C | T/C | 0.931718 |
169 | Chr9 | 17093854 | iS | LOC_Os09g28180 | T | C/T | 0.945604 |
170 | Chr9 | 17172557 | iS | LOC_Os09g28280 | G | A/G | 0.952378 |
171 | Chr9 | 17208139 | iS | LOC_Os09g28340 | T | A/T | 0.981792 |
172 | Chr9 | 17208190 | iS | LOC_Os09g28340 | G | A/G | 0.97047 |
173 | Chr9 | 18361804 | iS | LOC_Os09g30170 | A | G/A | -0.918916 |
174 | Chr11 | 1572535 | iS | LOC_Os11g03940 | C | G/C | 0.988903 |
175 | Chr11 | 24251922 | iS | LOC_Os11g40630 | G | A/G | -0.926557 |
176 | Chr11 | 24252359 | iS | LOC_Os11g40630 | G | A/G | -0.90211 |
177 | Chr12 | 2319598 | iS | LOC_Os12g05230 | G | C/G | -0.995367 |
178 | Chr12 | 6909654 | iS | LOC_Os12g12550 | A | G/A | -0.955324 |
179 | Chr12 | 6951340 | iS | LOC_Os12g12610 | A | T/A | -0.978886 |
180 | Chr12 | 7131331 | iS | LOC_Os12g12880 | G | T/G | -0.95253 |
181 | Chr12 | 7193940 | iS | LOC_Os12g12990 | G | A/G | -0.997098 |
182 | Chr12 | 7296789 | jiS | LOC_Os12g13120 | C | A | -0.997095 |
183 | Chr12 | 7729496 | jiS | LOC_Os12g13710 | G | A | -0.989097 |
184 | Chr12 | 8181488 | iS | LOC_Os12g14360 | C | T/C | -0.999436 |
185 | Chr12 | 10463109 | iS | LOC_Os12g18140 | C | T/C | -0.922164 |
186 | Chr12 | 13270701 | iS | LOC_Os12g23460 | G | A/G | -0.94481 |
187 | Chr12 | 13271524 | iS | LOC_Os12g23460 | T | C/T | -0.931038 |
188 | Chr12 | 13431652 | iS | LOC_Os12g23670 | A | G/A | -0.946254 |
189 | Chr12 | 13431680 | iS | LOC_Os12g23670 | A | G/A | -0.904795 |
190 | Chr12 | 13487577 | iS | LOC_Os12g23754 | C | T/C | -0.905409 |
191 | Chr12 | 13879868 | iS | LOC_Os12g24320 | A | T/A | -0.960254 |
192 | Chr12 | 15962749 | iS | LOC_Os12g27200 | G | C/G | -0.928603 |
193 | Chr12 | 16544114 | iS | LOC_Os12g28050 | G | A/G | -0.998384 |
194 | Chr12 | 20020156 | iS | LOC_Os12g33090 | G | A/G | -0.937742 |
195 | Chr12 | 20020178 | iS | LOC_Os12g33090 | A | T/A | -0.929899 |
196 | Chr12 | 20020191 | iS | LOC_Os12g33090 | G | T/G | -0.951613 |
197 | Chr12 | 20040776 | iS | LOC_Os12g33110 | C | T/C | -0.981781 |
198 | Chr12 | 20125271 | iS | LOC_Os12g33270 | G | A/G | -0.984687 |
199 | Chr12 | 20125292 | iS | LOC_Os12g33270 | A | T/A | -0.984745 |
200 | Chr12 | 20806725 | iS | LOC_Os12g34330 | C | T/C | -0.971071 |
201 | Chr12 | 20870266 | iS | LOC_Os12g34460 | A | G/A | -0.999573 |
202 | Chr12 | 20870356 | iS | LOC_Os12g34460 | A | C/A | -0.998666 |
203 | Chr12 | 20870439 | iS | LOC_Os12g34460 | T | G/T | -0.998457 |
204 | Chr12 | 20876904 | iS | LOC_Os12g34480 | A | C/A | -0.999992 |
205 | Chr12 | 20876949 | iS | LOC_Os12g34480 | C | T/C | -0.991352 |
206 | Chr12 | 20879954 | iS | LOC_Os12g34490 | A | G/A | -0.98581 |
207 | Chr12 | 20890306 | iS | LOC_Os12g34524 | T | C/T | -0.993718 |
208 | Chr12 | 20890582 | iS | LOC_Os12g34524 | T | C/T | -0.969697 |
209 | Chr12 | 24682146 | iS | LOC_Os12g39930 | G | C/G | 0.919475 |
210 | Chr12 | 24898957 | iS | LOC_Os12g40220 | T | C/T | -0.92824 |
211 | Chr12 | 25030400 | iS | LOC_Os12g40450 | A | C/A | -0.913568 |
212 | Chr12 | 25030618 | iS | LOC_Os12g40450 | C | G/C | -0.911038 |
213 | Chr12 | 25076283 | iS | LOC_Os12g40520 | C | A/C | -0.92069 |
214 | Chr12 | 25204218 | iS | LOC_Os12g40710 | C | A/C | -0.914294 |
215 | Chr1 | 9137420 | jS | LOC_Os01g16170 | A/T | T | -0.994413 |
216 | Chr1 | 27162194 | jS | LOC_Os01g47520 | A/C | C | 0.980717 |
217 | Chr1 | 30871270 | jS | LOC_Os01g53710 | G/A | A | -0.992389 |
218 | Chr1 | 33041112 | jS | LOC_Os01g57170 | T/G | G | 0.99853 |
219 | Chr1 | 33301236 | jS | LOC_Os01g57590 | T/C | C | 0.999578 |
220 | Chr1 | 43018025 | jS | LOC_Os01g74250 | A/G | G | -0.985189 |
221 | Chr1 | 43018592 | jS | LOC_Os01g74250 | A/C | C | -0.996912 |
222 | Chr1 | 43047621 | jS | LOC_Os01g74300 | T/A | A | -0.995363 |
223 | Chr1 | 43074754 | jS | LOC_Os01g74350 | G/A | A | -0.997904 |
224 | Chr2 | 4186188 | jS | LOC_Os02g07990 | A/G | G | -0.996146 |
225 | Chr2 | 4186214 | jS | LOC_Os02g07990 | A/G | G | -0.997078 |
226 | Chr2 | 4186226 | jS | LOC_Os02g07990 | T/C | C | -0.996639 |
227 | Chr2 | 4186323 | jS | LOC_Os02g07990 | C/G | G | -0.983882 |
228 | Chr2 | 4196320 | jS | LOC_Os02g08010 | A/C | C | -0.986377 |
229 | Chr2 | 4201502 | jS | LOC_Os02g08010 | A/G | G | -0.999799 |
230 | Chr2 | 4208313 | jS | LOC_Os02g08018 | T/C | C | -0.996683 |
231 | Chr2 | 5492898 | jS | LOC_Os02g10460 | T/A | A | -0.987334 |
232 | Chr2 | 5908447 | jS | LOC_Os02g11030 | A/G | G | 0.98325 |
233 | Chr2 | 10070467 | jS | LOC_Os02g17490 | T/C | C | -0.9965 |
234 | Chr2 | 10105273 | jS | LOC_Os02g17560 | A/G | G | -0.999284 |
235 | Chr2 | 10112136 | jS | LOC_Os02g17580 | G/A | A | -0.995944 |
236 | Chr2 | 10112665 | jS | LOC_Os02g17580 | G/A | A | -0.997727 |
237 | Chr2 | 10112690 | jS | LOC_Os02g17580 | C/T | T | -0.999915 |
238 | Chr2 | 13171691 | jiS | LOC_Os02g22120 | A | G | -0.998391 |
239 | Chr2 | 14100675 | jiS | LOC_Os02g24300 | C | T | -0.998562 |
240 | Chr2 | 14139905 | jiS | LOC_Os02g24370 | T | C | -0.996897 |
241 | Chr2 | 21115032 | jS | LOC_Os02g35150 | C/A | A | -0.980861 |
242 | Chr2 | 21115473 | jS | LOC_Os02g35150 | A/T | T | -0.983828 |
243 | Chr2 | 21117159 | jS | LOC_Os02g35160 | T/A | A | -0.985308 |
244 | Chr2 | 27083248 | jiS | LOC_Os02g44700 | A | G | 0.99489 |
245 | Chr2 | 27298500 | jiS | LOC_Os02g45040 | T | C | 0.984856 |
246 | Chr2 | 28073630 | jS | LOC_Os02g46070 | G/T | T | 0.996636 |
247 | Chr2 | 28113218 | jS | LOC_Os02g46120 | A/G | G | 0.981635 |
248 | Chr2 | 28624177 | jS | LOC_Os02g46910 | A/G | G | -0.993445 |
249 | Chr2 | 28866907 | jS | LOC_Os02g47280 | T/C | C | -0.999536 |
250 | Chr2 | 29027942 | jS | LOC_Os02g47510 | C/T | T | -0.997535 |
251 | Chr2 | 29699620 | jS | LOC_Os02g48520 | T/C | C | -0.991588 |
252 | Chr2 | 29719561 | jS | LOC_Os02g48550 | T/C | C | -0.999087 |
253 | Chr2 | 29795712 | jS | LOC_Os02g48660 | A/C | C | -0.999973 |
254 | Chr2 | 29903898 | jS | LOC_Os02g48880 | A/T | T | -0.999657 |
255 | Chr2 | 30596890 | jS | LOC_Os02g50110 | G/C | C | -0.99582 |
256 | Chr2 | 30631898 | jS | LOC_Os02g50140 | A/C | C | -0.992357 |
257 | Chr2 | 30814041 | jiS | LOC_Os02g50470 | T | C | -0.995746 |
258 | Chr2 | 30841358 | jS | LOC_Os02g50490 | G/A | A | -0.99983 |
259 | Chr2 | 31081444 | jS | LOC_Os02g50850 | A/G | G | 0.981315 |
260 | Chr2 | 31081445 | jS | LOC_Os02g50850 | C/T | T | 0.981315 |
261 | Chr2 | 31626320 | jiS | LOC_Os02g51640 | G | A | 0.993381 |
262 | Chr2 | 31634550 | jiS | LOC_Os02g51650 | T | C | 0.998495 |
263 | Chr2 | 32648336 | jiS | LOC_Os02g53330 | T | C | -0.993957 |
264 | Chr3 | 2104759 | jS | LOC_Os03g04520 | A/T | T | 0.997788 |
265 | Chr3 | 2115483 | jS | LOC_Os03g04530 | G/A | A | 0.999041 |
266 | Chr3 | 2119566 | jS | LOC_Os03g04545 | C/G | G | 0.996728 |
267 | Chr3 | 2128578 | jS | LOC_Os03g04560 | C/T | T | 0.992887 |
268 | Chr3 | 2129383 | jS | LOC_Os03g04560 | T/C | C | 0.998757 |
269 | Chr3 | 2140731 | jS | LOC_Os03g04580 | T/C | C | 0.999607 |
270 | Chr3 | 2740113 | jS | LOC_Os03g05520 | T/C | C | -0.989256 |
271 | Chr3 | 4464434 | jS | LOC_Os03g08660 | T/C | C | -0.999937 |
272 | Chr3 | 4796914 | jS | LOC_Os03g09200 | T/C | C | -0.985851 |
273 | Chr3 | 7202925 | jS | LOC_Os03g13350 | T/C | C | 0.986847 |
274 | Chr3 | 10213637 | jS | LOC_Os03g18210 | G/C | C | 0.993949 |
275 | Chr3 | 10910669 | jS | LOC_Os03g19380 | A/G | G | -0.991598 |
276 | Chr3 | 11870304 | jS | LOC_Os03g20920 | G/T | T | 0.981182 |
277 | Chr3 | 12197917 | jS | LOC_Os03g21310 | T/C | C | 0.999989 |
278 | Chr3 | 12273368 | jiS | LOC_Os03g21480 | A | G | -0.991012 |
279 | Chr3 | 17048443 | jS | LOC_Os03g29920 | G/C | C | -0.985271 |
280 | Chr3 | 17221874 | jS | LOC_Os03g30170 | G/A | A | -0.982282 |
281 | Chr3 | 27454570 | jS | LOC_Os03g48250 | C/T | T | 0.999991 |
282 | Chr3 | 27816666 | jS | LOC_Os03g48820 | T/C | C | 0.993201 |
283 | Chr3 | 28244997 | jS | LOC_Os03g49600 | G/C | C | 0.99381 |
284 | Chr3 | 29486342 | jS | LOC_Os03g51550 | T/G | G | -0.992537 |
285 | Chr3 | 29871252 | jiS | LOC_Os03g52070 | A | C | -0.998974 |
286 | Chr3 | 30028194 | jS | LOC_Os03g52284 | T/C | C | -0.984856 |
287 | Chr3 | 30064054 | jS | LOC_Os03g52340 | A/G | G | -0.999443 |
288 | Chr3 | 30068550 | jS | LOC_Os03g52350 | G/A | A | -0.99929 |
289 | Chr3 | 30096391 | jS | LOC_Os03g52450 | T/C | C | -0.999995 |
290 | Chr3 | 30115735 | jiS | LOC_Os03g52475 | T | C | -0.999282 |
291 | Chr3 | 30131372 | jiS | LOC_Os03g52510 | G | C | -0.999376 |
292 | Chr3 | 30132761 | jiS | LOC_Os03g52510 | G | T | -0.999767 |
293 | Chr3 | 30132805 | jiS | LOC_Os03g52510 | C | T | -0.997313 |
294 | Chr3 | 30149972 | jiS | LOC_Os03g52560 | A | C | -0.985721 |
295 | Chr3 | 30307210 | jS | LOC_Os03g52850 | G/C | C | -0.996333 |
296 | Chr3 | 30594571 | jS | LOC_Os03g53300 | G/T | T | 0.988098 |
297 | Chr3 | 30602983 | jS | LOC_Os03g53320 | T/C | C | 0.999968 |
298 | Chr3 | 30610420 | jS | LOC_Os03g53350 | C/T | T | 0.992241 |
299 | Chr3 | 30689013 | jS | LOC_Os03g53500 | T/G | G | -0.986905 |
300 | Chr3 | 30702519 | jS | LOC_Os03g53530 | G/A | A | -0.981891 |
301 | Chr3 | 30753345 | jS | LOC_Os03g53630 | G/A | A | -0.995648 |
302 | Chr3 | 31776134 | jiS | LOC_Os03g55810 | A | G | 0.99978 |
303 | Chr3 | 32535090 | jS | LOC_Os03g57070 | T/C | C | -0.981045 |
304 | Chr3 | 32594052 | jiS | LOC_Os03g57160 | A | G | -0.998038 |
305 | Chr3 | 32594181 | jiS | LOC_Os03g57160 | A | C | -0.999398 |
306 | Chr3 | 35464900 | jS | LOC_Os03g62660 | G/A | A | 0.987252 |
307 | Chr3 | 35991802 | jS | LOC_Os03g63750 | C/T | T | -0.9888 |
308 | Chr4 | 562364 | jS | LOC_Os04g01890 | C/T | T | -0.999096 |
309 | Chr4 | 567066 | jS | LOC_Os04g01900 | T/C | C | -0.996484 |
310 | Chr4 | 1004049 | jS | LOC_Os04g02660 | A/G | G | -0.999323 |
311 | Chr4 | 1004146 | jS | LOC_Os04g02660 | A/G | G | -0.989494 |
312 | Chr4 | 1004219 | jS | LOC_Os04g02660 | A/G | G | -0.999952 |
313 | Chr4 | 19323662 | jiS | LOC_Os04g32250 | C | A | 0.987106 |
314 | Chr4 | 19326618 | jiS | LOC_Os04g32250 | G | C | 0.990613 |
315 | Chr4 | 19326640 | jiS | LOC_Os04g32250 | T | G | 0.999716 |
316 | Chr4 | 19423346 | jiS | LOC_Os04g32380 | C | T | 0.991907 |
317 | Chr4 | 20321102 | jS | LOC_Os04g33570 | G/T | T | -0.997915 |
318 | Chr4 | 20359031 | jS | LOC_Os04g33610 | T/C | C | -0.994479 |
319 | Chr4 | 20368636 | jS | LOC_Os04g33630 | T/C | C | -0.98079 |
320 | Chr4 | 20377050 | iS | LOC_Os04g33640 | T | G/T | -0.997984 |
321 | Chr4 | 20394421 | jS | LOC_Os04g33680 | A/C | C | -0.998985 |
322 | Chr4 | 20450447 | jS | LOC_Os04g33770 | T/C | C | -0.998206 |
323 | Chr4 | 20499434 | jiS | LOC_Os04g33840 | C | T | -0.99954 |
324 | Chr4 | 20499599 | jiS | LOC_Os04g33840 | T | C | -0.998413 |
325 | Chr4 | 20531021 | iS | LOC_Os04g33890 | C | T/C | -0.998415 |
326 | Chr4 | 20595921 | jiS | LOC_Os04g34010 | C | G | -0.999954 |
327 | Chr4 | 21815679 | jS | LOC_Os04g35790 | G/A | A | -0.981915 |
328 | Chr4 | 21835960 | jS | LOC_Os04g35830 | T/C | C | -0.995777 |
329 | Chr4 | 23172728 | jS | LOC_Os04g39020 | T/G | G | -0.993379 |
330 | Chr4 | 23174850 | jS | LOC_Os04g39020 | T/A | A | -0.992865 |
331 | Chr4 | 23215190 | jS | LOC_Os04g39060 | T/C | C | -0.996263 |
332 | Chr5 | 919710 | jS | LOC_Os05g02630 | G/C | C | 0.987537 |
333 | Chr5 | 21494622 | jS | LOC_Os05g36270 | C/T | T | -0.99028 |
334 | Chr5 | 21501717 | jS | LOC_Os05g36290 | T/A | A | -0.983755 |
335 | Chr5 | 23879031 | jS | LOC_Os05g40730 | T/C | C | 0.996216 |
336 | Chr5 | 26274490 | jiS | LOC_Os05g45260 | C | G | -0.996619 |
337 | Chr5 | 26274501 | jiS | LOC_Os05g45260 | C | A | -0.993848 |
338 | Chr5 | 26328697 | jiS | LOC_Os05g45380 | C | T | -0.992049 |
339 | Chr5 | 26328817 | jiS | LOC_Os05g45380 | G | A | -0.991243 |
340 | Chr5 | 26328877 | jiS | LOC_Os05g45380 | G | A | -0.999922 |
341 | Chr5 | 26348030 | jiS | LOC_Os05g45420 | C | T | -0.989979 |
342 | Chr5 | 26467714 | jS | LOC_Os05g45670 | C/G | G | -0.98481 |
343 | Chr5 | 26612265 | jS | LOC_Os05g45920 | A/C | C | -0.992524 |
344 | Chr5 | 26624594 | jiS | LOC_Os05g45940 | T | C | -0.999869 |
345 | Chr5 | 26624654 | jiS | LOC_Os05g45940 | G | A | -0.998581 |
346 | Chr5 | 26632235 | jiS | LOC_Os05g45954 | C | G | -0.985761 |
347 | Chr5 | 26633430 | jiS | LOC_Os05g45954 | T | C | -0.998846 |
348 | Chr5 | 26633568 | jiS | LOC_Os05g45954 | T | A | -0.980013 |
349 | Chr5 | 26633678 | jiS | LOC_Os05g45954 | G | A | -0.996242 |
350 | Chr5 | 26666115 | jS | LOC_Os05g46000 | A/C | C | -0.991552 |
351 | Chr5 | 26682978 | jiS | LOC_Os05g46020 | A | G | -0.999531 |
352 | Chr5 | 26707043 | jiS | LOC_Os05g46040 | T | C | -0.99851 |
353 | Chr5 | 26707294 | jiS | LOC_Os05g46040 | A | T | -0.999452 |
354 | Chr5 | 26707295 | jiS | LOC_Os05g46040 | T | C | -0.99961 |
355 | Chr5 | 26721147 | jiS | LOC_Os05g46070 | G | C | -0.992448 |
356 | Chr5 | 26721236 | jiS | LOC_Os05g46070 | A | G | -0.996352 |
357 | Chr5 | 26721278 | jiS | LOC_Os05g46070 | T | C | -0.999323 |
358 | Chr5 | 26721383 | jiS | LOC_Os05g46070 | G | A | -0.989081 |
359 | Chr5 | 26721869 | jiS | LOC_Os05g46070 | A | C | -0.991111 |
360 | Chr5 | 26749646 | jS | LOC_Os05g46120 | A/T | T | -0.994659 |
361 | Chr5 | 26749724 | jS | LOC_Os05g46120 | A/G | G | -0.999821 |
362 | Chr5 | 26749739 | jS | LOC_Os05g46120 | G/C | C | -0.99162 |
363 | Chr5 | 26749784 | jS | LOC_Os05g46120 | T/C | C | -0.987903 |
364 | Chr5 | 26777001 | jS | LOC_Os05g46170 | T/C | C | -0.995209 |
365 | Chr5 | 26809500 | jiS | LOC_Os05g46230 | A | C | -0.999847 |
366 | Chr5 | 26809632 | jiS | LOC_Os05g46230 | T | C | -0.995423 |
367 | Chr5 | 27418390 | jS | LOC_Os05g47830 | C/T | T | -0.988319 |
368 | Chr5 | 27719954 | iS | LOC_Os05g48350 | T | C/T | 0.993016 |
369 | Chr5 | 27882703 | iS | LOC_Os05g48640 | C | A/C | -0.996842 |
370 | Chr5 | 28641570 | jS | LOC_Os05g49930 | A/G | G | 0.999361 |
371 | Chr5 | 28710628 | jS | LOC_Os05g50100 | T/G | G | 0.994552 |
372 | Chr5 | 28772543 | jS | LOC_Os05g50200 | G/C | C | 0.999952 |
373 | Chr5 | 28895833 | jS | LOC_Os05g50410 | G/C | C | 0.983503 |
374 | Chr6 | 473566 | jS | LOC_Os06g01830 | T/C | C | -0.980549 |
375 | Chr6 | 1784528 | jS | LOC_Os06g04230 | A/C | C | 0.984615 |
376 | Chr6 | 2138513 | jS | LOC_Os06g04870 | T/C | C | -0.997741 |
377 | Chr6 | 2338006 | jS | LOC_Os06g05209 | T/G | G | -0.993766 |
378 | Chr6 | 2405551 | jS | LOC_Os06g05350 | A/C | C | -0.997754 |
379 | Chr6 | 2588134 | jS | LOC_Os06g05730 | G/C | C | -0.988228 |
380 | Chr6 | 2593578 | jS | LOC_Os06g05750 | A/G | G | -0.993076 |
381 | Chr6 | 2769344 | jS | LOC_Os06g06014 | G/C | C | -0.999177 |
382 | Chr6 | 2770160 | jS | LOC_Os06g06014 | T/G | G | -0.999776 |
383 | Chr6 | 2830050 | jS | LOC_Os06g06115 | A/C | C | -0.999999 |
384 | Chr6 | 4648824 | jS | LOC_Os06g09240 | A/C | C | -0.990878 |
385 | Chr6 | 7122887 | jS | LOC_Os06g13000 | C/T | T | -0.995603 |
386 | Chr6 | 7145862 | jS | LOC_Os06g13040 | C/T | T | -0.99467 |
387 | Chr6 | 7146003 | jS | LOC_Os06g13040 | A/G | G | -0.98761 |
388 | Chr6 | 7773164 | jS | LOC_Os06g13960 | G/A | A | 0.99987 |
389 | Chr6 | 7773187 | jS | LOC_Os06g13960 | G/T | T | 0.986931 |
390 | Chr6 | 10438801 | jS | LOC_Os06g17950 | A/T | T | -0.991906 |
391 | Chr6 | 24091717 | jS | LOC_Os06g40450 | A/T | T | -0.99904 |
392 | Chr6 | 24091785 | jS | LOC_Os06g40450 | G/C | C | -0.988101 |
393 | Chr6 | 25406568 | jS | LOC_Os06g42300 | C/T | T | 0.994798 |
394 | Chr6 | 27969100 | jS | LOC_Os06g46149 | C/A | A | -0.989193 |
395 | Chr6 | 28493453 | jS | LOC_Os06g46980 | T/G | G | 0.980861 |
396 | Chr7 | 2711922 | jS | LOC_Os07g05680 | A/G | G | 0.983021 |
397 | Chr7 | 4093339 | jS | LOC_Os07g08080 | A/T | T | -0.98216 |
398 | Chr7 | 5513670 | jiS | LOC_Os07g10270 | A | G | -0.992536 |
399 | Chr7 | 5519707 | jS | LOC_Os07g10290 | A/C | C | -0.983521 |
400 | Chr7 | 7951636 | jS | LOC_Os07g13900 | T/A | A | 0.99998 |
401 | Chr7 | 8982622 | jS | LOC_Os07g15480 | T/C | C | -0.985788 |
402 | Chr7 | 15244771 | jS | LOC_Os07g26500 | T/G | G | 0.999997 |
403 | Chr7 | 16284413 | jiS | LOC_Os07g27920 | A | G | 0.99362 |
404 | Chr7 | 17266009 | jS | LOC_Os07g29400 | T/C | C | -0.984003 |
405 | Chr7 | 20771577 | jS | LOC_Os07g34650 | T/C | C | -0.999798 |
406 | Chr7 | 23772782 | jS | LOC_Os07g39660 | C/T | T | 0.980437 |
407 | Chr7 | 23845357 | jS | LOC_Os07g39800 | T/C | C | 0.985009 |
408 | Chr7 | 23869557 | jS | LOC_Os07g39820 | A/C | C | 0.999834 |
409 | Chr7 | 23953426 | jS | LOC_Os07g39930 | C/A | A | 0.997889 |
410 | Chr7 | 25979498 | jS | LOC_Os07g43380 | A/G | G | 0.996496 |
411 | Chr7 | 27984918 | jS | LOC_Os07g46820 | T/C | C | -0.999876 |
412 | Chr7 | 28076891 | jS | LOC_Os07g46990 | G/A | A | -0.996912 |
413 | Chr7 | 28417860 | jS | LOC_Os07g47520 | G/T | T | 0.991822 |
414 | Chr7 | 28784787 | jS | LOC_Os07g48190 | T/C | C | 0.999952 |
415 | Chr7 | 28829556 | jS | LOC_Os07g48260 | C/G | G | 0.990513 |
416 | Chr7 | 28850524 | jS | LOC_Os07g48280 | T/A | A | 0.99837 |
417 | Chr7 | 28882379 | jS | LOC_Os07g48330 | G/C | C | -0.999413 |
418 | Chr7 | 28882382 | jS | LOC_Os07g48330 | G/C | C | -0.999413 |
419 | Chr7 | 28894819 | jiS | LOC_Os07g48350 | A | G | -0.996039 |
420 | Chr7 | 29491264 | iS | LOC_Os07g49250 | G | A/G | -0.988234 |
421 | Chr7 | 29578472 | iS | LOC_Os07g49380 | C | T/C | -0.996971 |
422 | Chr8 | 3511536 | jS | LOC_Os08g06330 | C/A | A | 0.991047 |
423 | Chr8 | 11678308 | jS | LOC_Os08g19530 | T/C | C | -0.999028 |
424 | Chr8 | 15299321 | jS | LOC_Os08g25180 | T/C | C | -0.985796 |
425 | Chr9 | 3024036 | jiS | LOC_Os09g06420 | G | A | -0.986648 |
426 | Chr9 | 4226063 | jS | LOC_Os09g08150 | A/C | C | 0.990615 |
427 | Chr9 | 4226080 | jS | LOC_Os09g08150 | T/C | C | 0.997057 |
428 | Chr9 | 9368140 | jS | LOC_Os09g15330 | T/C | C | 0.987763 |
429 | Chr9 | 15914968 | jS | LOC_Os09g26330 | G/C | C | -0.996467 |
430 | Chr9 | 16239650 | jiS | LOC_Os09g26730 | T | C | -0.998711 |
431 | Chr9 | 16274847 | jiS | LOC_Os09g26780 | G | T | -0.993966 |
432 | Chr9 | 17827593 | jiS | LOC_Os09g29360 | G | T | -0.998132 |
433 | Chr9 | 17882025 | jiS | LOC_Os09g29430 | T | C | -0.998234 |
434 | Chr9 | 17882040 | jiS | LOC_Os09g29430 | T | C | -0.991242 |
435 | Chr9 | 17882115 | jiS | LOC_Os09g29430 | G | A | -0.99281 |
436 | Chr9 | 17977806 | jiS | LOC_Os09g29560 | T | G | -0.995015 |
437 | Chr9 | 17982727 | jiS | LOC_Os09g29570 | T | C | -0.998893 |
438 | Chr9 | 17982738 | jiS | LOC_Os09g29570 | A | G | -0.991976 |
439 | Chr9 | 18010475 | jiS | LOC_Os09g29610 | A | G | -0.987761 |
440 | Chr9 | 18010965 | jiS | LOC_Os09g29610 | A | T | -0.998364 |
441 | Chr9 | 18039009 | jS | LOC_Os09g29660 | A/G | G | -0.980703 |
442 | Chr9 | 18100366 | jS | LOC_Os09g29780 | T/C | C | -0.990865 |
443 | Chr9 | 18291060 | jS | LOC_Os09g30090 | G/A | A | -0.981759 |
444 | Chr9 | 18294399 | jS | LOC_Os09g30100 | A/G | G | -0.982626 |
445 | Chr9 | 20225250 | jS | LOC_Os09g34260 | A/C | C | -0.996247 |
446 | Chr9 | 20951735 | jS | LOC_Os09g36300 | T/A | A | -0.983258 |
447 | Chr9 | 22288999 | jS | LOC_Os09g38790 | A/G | G | 0.988308 |
448 | Chr9 | 22289880 | jS | LOC_Os09g38790 | T/C | C | 0.992389 |
449 | Chr9 | 22290006 | jS | LOC_Os09g38790 | T/C | C | 0.995365 |
450 | Chr10 | 3833048 | jS | LOC_Os10g07270 | A/T | T | -0.980719 |
451 | Chr10 | 14641648 | jS | LOC_Os10g28200 | A/G | G | -0.987157 |
452 | Chr10 | 14693493 | jS | LOC_Os10g28254 | T/C | C | -0.999947 |
453 | Chr10 | 15186949 | jiS | LOC_Os10g29159 | G | A | -0.982546 |
454 | Chr10 | 15498950 | jiS | LOC_Os10g29810 | G | C | -0.984055 |
455 | Chr10 | 15660014 | iS | LOC_Os10g30140 | C | A/C | -0.999688 |
456 | Chr10 | 15660206 | iS | LOC_Os10g30140 | A | G/A | -0.98086 |
457 | Chr10 | 15660323 | iS | LOC_Os10g30140 | T | C/T | -0.98234 |
458 | Chr10 | 15660920 | iS | LOC_Os10g30140 | A | G/A | -0.984055 |
459 | Chr10 | 22635214 | jiS | LOC_Os10g42080 | G | A | -0.988441 |
460 | Chr10 | 22935522 | jS | LOC_Os10g42520 | C/T | T | -0.992315 |
461 | Chr10 | 22987951 | jiS | LOC_Os10g42630 | G | A | -0.986413 |
462 | Chr10 | 23036407 | jiS | LOC_Os10g42710 | C | A | -0.989914 |
463 | Chr10 | 23053899 | jiS | LOC_Os10g42724 | A | C | -0.995214 |
464 | Chr10 | 23054376 | jiS | LOC_Os10g42724 | T | C | -0.991041 |
465 | Chr10 | 23054552 | jiS | LOC_Os10g42724 | C | A | -0.99231 |
466 | Chr10 | 23075871 | jiS | LOC_Os10g42780 | A | G | -0.992163 |
467 | Chr11 | 2379158 | jS | LOC_Os11g05360 | G/C | C | -0.994112 |
468 | Chr11 | 10754570 | jS | LOC_Os11g18910 | G/A | A | -0.99169 |
469 | Chr11 | 10918376 | jS | LOC_Os11g19140 | A/C | C | -0.983099 |
470 | Chr11 | 11046703 | jS | LOC_Os11g19270 | A/G | G | -0.992657 |
471 | Chr11 | 11541392 | jS | LOC_Os11g20050 | T/C | C | -0.992365 |
472 | Chr11 | 14029186 | jS | LOC_Os11g24570 | G/A | A | -0.982831 |
473 | Chr11 | 14052352 | jS | LOC_Os11g24630 | A/C | C | -0.999611 |
474 | Chr11 | 16690681 | jS | LOC_Os11g28800 | A/C | C | -0.994743 |
475 | Chr11 | 17830266 | jS | LOC_Os11g30620 | C/T | T | -0.995594 |
476 | Chr11 | 17830718 | jS | LOC_Os11g30620 | G/A | A | -0.998938 |
477 | Chr11 | 17831066 | jS | LOC_Os11g30620 | C/T | T | -0.98409 |
478 | Chr12 | 6563995 | jS | LOC_Os12g11980 | A/G | G | -0.988379 |
479 | Chr12 | 23092134 | jS | LOC_Os12g37600 | A/G | G | -0.98055 |
480 | Chr12 | 23100522 | jS | LOC_Os12g37620 | G/C | C | -0.982122 |
481 | Chr12 | 23100726 | jS | LOC_Os12g37620 | C/A | A | -0.98591 |
482 | Chr12 | 23928305 | jS | LOC_Os12g38910 | T/A | A | 0.994533 |
483 | Chr12 | 25663165 | jS | LOC_Os12g41420 | A/G | G | -0.988915 |
484 | Chr12 | 25823002 | jS | LOC_Os12g41710 | C/G | G | -0.982161 |
485 | Chr12 | 25916933 | jiS | LOC_Os12g41840 | A | G | -0.995288 |
486 | Chr12 | 26453094 | jS | LOC_Os12g42590 | A/C | C | -0.999948 |
487 | Chr12 | 26506682 | jS | LOC_Os12g42650 | A/G | G | -0.99509 |
488 | Chr12 | 26506683 | jS | LOC_Os12g42650 | A/C | C | -0.99509 |
489 | Chr12 | 26538978 | jS | LOC_Os12g42710 | C/T | T | -0.999787 |
490 | Chr12 | 26579061 | jS | LOC_Os12g42760 | G/A | A | -0.994127 |
491 | Chr12 | 26670828 | jS | LOC_Os12g42910 | C/G | G | -0.994242 |
492 | Chr12 | 26670929 | jS | LOC_Os12g42910 | G/T | T | -0.998353 |
493 | Chr12 | 26675568 | jS | LOC_Os12g42930 | T/A | A | -0.987157 |
494 | Chr12 | 26677989 | jS | LOC_Os12g42930 | T/C | C | -0.983245 |
备注:水稻参考基因组版本:MSU7.0(http://rice.uga.edu/);iS代表仅籼稻受选的目标位点,表示位点与籼稻的性状相关,jS代表仅粳稻受选的目标位点,表示位点与粳稻的性状相关,jiS代表籼粳稻同时受选的目标位点,表示同时与籼稻和粳稻的性状相关。
第二方面,本发明实施例提供了一种用于鉴定水稻适种区域的试剂盒,其包括用于检测目标SNP位点基因型的试剂,所述目标SNP位点包括如前述实施例所述的应用中的目标SNP位点。
第三方面,本发明实施例提供了一种水稻适种区域的预测模型的训练方法,其包括:获取训练样本的目标SNP位点的检测信息以及对应的标注结果,其中,所述目标SNP位点包括如前述实施例所述的应用中的目标SNP位点;将训练样本的目标SNP位点的检测信息以及对应的标注结果输入预先构建好的预测模型,获得训练样本的预测结果;其中,所述预测模型用于根据样本所述目标SNP位点的基因型信息判断样本的适种区域;基于所述标注结果和所述预测结果对构建的预测模型进行参数更新。
第四方面,本发明实施例提供了一种水稻适种区域的预测模型的预测方法,其包括:将待测样本的目标SNP位点的检测信息输入预先训练好的预测模型中,获得待测样本的预测结果,其中,所述目标SNP位点包括如前述实施例所述的应用中的目标SNP位点,所述预测模型为如前述实施例所述的水稻适种区域的预测模型的训练方法训练获得的。
第五方面,本发明实施例提供了一种水稻适种区域的预测模型的预测装置,其包括获取模块和预测模块。获取模块用于获取待测样本的目标SNP位点的检测信息;预测模块,用于将待测样本的目标SNP位点的检测信息输入预先训练好的预测模型中,获得待测样本的预测结果;其中,所述目标SNP位点包括如前述实施例所述的应用中的目标SNP位点,所述预测模型为如前述实施例所述的水稻适种区域的预测模型的训练方法训练获得的。
第六方面,本发明实施例提供了一种水稻适种区域的预测模型的训练装置,其包括获取模块、处理模块和参数更新模块。获取模块用于获取训练样本的目标SNP位点的检测信息以及对应的标注结果,其中,所述目标SNP位点包括如前述实施例所述的应用中的目标SNP位点。处理模块用于将所述训练样本的目标SNP位点的检测信息以及对应的标注结果输入预先构建好的预测模型,获得训练样本的预测结果;其中,所述预测模型用于根据样本所述目标SNP位点的基因型信息判断样本的适种区域。参数更新模块用于根据所述标注结果和所述预测结果对构建的预测模型进行参数更新。
第七方面,本发明实施例提供了一种电子设备,其包括:处理器和存储器;所述存储器用于存储程序,当所述程序被所述处理器执行时,使得所述处理器实现如前述实施例所述的水稻适种区域的预测模型的训练方法,或,如前述实施例所述的水稻适种区域的预测模型的预测方法。
第八方面,本发明实施例提供了一种计算机可读介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如前述实施例所述的水稻适种区域的预测模型的训练方法,或,如前述实施例所述的水稻适种区域的预测模型的预测方法。
本发明具有以下有益效果:
水稻是比较重要的粮食作物,本发明通过高通量测序的方法,检测和分析籼稻和粳稻群体中SNP位点的基因型,采用机器学习和人工智能的方法,构建分析模型,用于对待测水稻品种进行适种区预测。综合预测准确率超过90%,相对于常规的适种区域评价方法,具有成本低,时间短等技术优势。
附图说明
为了更清楚地说明本发明实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本发明的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。
图1为本发明与常规技术选择水稻适种区域的流程示意图,其中A为常规方法,B为本发明的方法;
图2为实施例1的预测准确率统计图;
图3为验证例1的不同类型位点建模预测准确性比较图;
图4为验证例2的不同比例样品作为训练集和测试集的比较图;
图5为验证例2的不同比例样品作为训练集和测试集的比较图。
具体实施方式
为使本发明实施例的目的、技术方案和优点更加清楚,下面将对本发明实施例中的技术方案进行清楚、完整地描述。实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市售购买获得的常规产品。
本发明实施例提供了用于检测目标SNP位点基因型的试剂在制备用于鉴定水稻适种区域的试剂盒中的应用,所述目标SNP位点的表1所述SNP位点中的至少197个位点。
发明人经一系列创造性发现提供了表1的SNP位点,这些位点均是在水稻基因组的编码基因中改变氨基酸编码的SNP位点,且在不同种植区域的水稻,在上述目标SNP位点的基因型具有显著差异,而通过对样本的目标SNP位点的基因型进行检测,能够达到有效鉴定或预测样本适种区域的目的,本发明的鉴定水稻适种区域的流程示意图可参照图1。
在优选的实施例中,所述目标SNP位点包括上表所示SNP位点中的至少296个位点;更优选地,所述目标SNP位点包括上表所示SNP位点中的至少395个位点;进一步优选地,所述目标SNP位点包括上表所示SNP位点,相比上述SNP位点的组合,表1中所有SNP位点的组合能够更全面、稳定且有效地实现预测样本适种区域的效果。
在优选的实施例中,所述水稻选自籼稻和粳稻中的至少一种。
在优选的实施例中,所述适种区域包括基于经纬度划分的地理区域。
在优选的实施例中,所述适种区域包括:纬度范围为北纬20~25度区间的区域R1、纬度范围为北纬25~30度区间的区域R2、纬度范围为北纬30~35度区间的区域R3、纬度范围为北纬35~40度区间的区域R4、纬度范围≥北纬40度区间的区域R5。
在优选的实施方式中,所述区域R1可选自中国广东省和广西壮族自治区中至少一个省份的可种植区域;所述区域R2可选自中国福建省、湖南省、江西省和浙江省中至少一个省份的可种植区域;所述区域R3可选自中国安徽省、河南省、湖北省、江苏省、上海市和四川省中至少一个省份的可种植区域;所述区域R4可选自中国宁夏回族自治区、山东省和天津市中至少一个地方的可种植区域;所述区域R5可选自中国黑龙江省、吉林省和辽宁省中至少一个省份的可种植区域。
本发明实施例还提供了一种用于鉴定水稻适种区域的试剂盒,其包括:用于检测目标SNP位点基因型的试剂,所述目标SNP位点包括如前述任意实施例所述的应用中的目标SNP位点。
在可选的实施例中,所述试剂可选自引物对、探针和芯片中的任意一种或几种的组合。
本发明实施例还提供了一种水稻适种区域的预测模型的训练方法,其包括以下步骤:
获取训练样本的目标SNP位点的检测信息以及对应的标注结果,其中,所述目标SNP位点包括如前述任意实施例所述的应用中的目标SNP位点;
将训练样本的目标SNP位点的检测信息以及对应的标注结果输入预先构建好的预测模型,获得训练样本的预测结果;其中,所述预测模型用于根据样本所述目标SNP位点的基因型信息判断样本的适种区域;
基于所述标注结果和所述预测结果对构建的预测模型进行参数更新。
水稻是比较重要的粮食作物,本发明通过高通量测序的方法,检测和分析籼稻和粳稻群体中SNP位点的基因型,采用机器学习和人工智能的方法,构建分析模型,用于对待测水稻品种进行适种区预测。综合预测准确率超过90%,相对于常规的适种区域评价方法,具有成本低,时间短等技术优势。
需要说明的是,所述标注结果可以为每个训练样本的目标SNP位点的检测信息对应的适种区域或表明适种区域的指代符号。
在可选的实施方式中,所述训练样本的样本量≥10,优选为≥30,更优选为≥50。
本发明实施例还提供了一种水稻适种区域的预测模型的预测方法,其包括:将待测样本的目标SNP位点的检测信息输入预先训练好的预测模型中,获得待测样本的预测结果,其中,所述目标SNP位点包括前述任意实施例所述的应用中的目标SNP位点,所述预测模型为如前述任意实施例所述的水稻适种区域的预测模型的训练方法训练获得的。
本发明实施例还提供了一种水稻适种区域的预测模型的预测装置,其包括获取模块和预测模块。
获取模块用于获取待测样本的目标SNP位点的检测信息。预测模块用于将待测样本的目标SNP位点的检测信息输入预先训练好的预测模型中,获得待测样本的预测结果。其中,所述目标SNP位点包括如前述任意实施例所述的应用中的目标SNP位点,所述预测模型为前述任意实施例所述的水稻适种区域的预测模型的训练方法训练获得的。
本发明实施例还提供了一种水稻适种区域的预测模型的训练装置,其包括获取模块、处理模块和参数更新模块。
获取模块用于获取训练样本的目标SNP位点的检测信息以及对应的标注结果,其中,所述目标SNP位点包括如前述任意实施例所述的应用中的目标SNP位点。处理模块,用于将所述训练样本的目标SNP位点的检测信息以及对应的标注结果输入预先构建好的预测模型,获得训练样本的预测结果;其中,所述预测模型用于根据样本所述目标SNP位点的基因型信息判断样本的适种区域。参数更新模块用于根据所述标注结果和所述预测结果对构建的预测模型进行参数更新。
本发明实施例还提供了一种电子设备,其包括:处理器和存储器;所述存储器用于存储程序,当所述程序被所述处理器执行时,使得所述处理器实现如前述任意实施例所述的水稻适种区域的预测模型的训练方法,或,如前述任意实施例所述的水稻适种区域的预测模型的预测方法。
存储器可以是但不限于,随机存取存储器(Random Access Memory,RAM),只读存储器(Read Only Memory,ROM),可编程只读存储器(Programmable Read-Only Memory,PROM),可擦除只读存储器(Erasable Programmable Read-Only Memory,EPROM),电可擦除只读存储器(Electric Erasable Programmable Read-Only Memory,EEPROM)等。
处理器可以是一种集成电路芯片,具有信号处理能力。该处理器可以是通用处理器,包括中央处理器(Central Processing Unit,CPU)、网络处理器(Network Processor,NP)等;还可以是数字信号处理器(Digital Signal Processing,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。
在实际应用中,该电子设备可以是服务器、云平台、手机、平板电脑、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、手持计算机、上网本、个人数字助理(personal digital assistant,PDA)、可穿戴电子设备、虚拟现实设备等设备,因此本申请实施例对电子设备的种类不做限制。
本发明实施例还提供了一种计算机可读介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如前述任意实施例所述的水稻适种区域的预测模型的训练方法,或,如前述任意实施例所述的水稻适种区域的预测模型的预测方法。
计算机可读介质包括:U盘、移动硬盘、只读存储器、随机存取存储器、磁碟或者光盘等各种可以存储程序代码的介质。
以下结合实施例对本发明的特征和性能作进一步的详细描述。
实施例1
一种用于鉴定水稻适种区域的预测模型的构建方法,其包括以下步骤。
(1)检测全国各种植区域水稻样品的基因型信息。
SNP位点的检测采用bwa(版本不限,http://bio-bwa.sourceforge.net/)比对软件对双末端序列进行比对,比对策略选择aln或mem方法均可,参数选择默认参数。使用samtools(版本1.9,http://samtools.sourceforge.net/)对比对的序列进行整理排序,使用python编写的计算机程序,获取SNP位点的基因型。
(2)根据纬度分布,将全国主要水稻种植区域划分为5个区域,划分规则如下。
R1纬度范围:约北纬20-25度区间,对应主要水稻种植区:广东省,广西壮族自治区;
R2纬度范围:约北纬25-30度区间,对应主要水稻种植区:福建省,湖南省,江西省,浙江省;
R3纬度范围:约北纬30-35度区间,对应主要水稻种植区:安徽省、河南省、湖北省、江苏省、上海市、四川省;
R4纬度范围:约北纬35~40度区间,对应主要水稻种植区:宁夏回族自治区、山东省、天津市;
R5纬度范围:约北纬40度以上区域,对应主要水稻种植区:黑龙江省、吉林省、辽宁省。
(3)分析水稻区间与核心位点之间的相关性。
采用相关性分析方法,分析SNP位点基因型与纬度坐标之间的相关性。选择相关性大于0.9的494个SNP位点(表1所示)作为目标SNP位点(纬度分布相关SNP位点列表)用于后续的预测模型构建与种植区域预测。
(4)机器学习与预测模型的构建。
将样品集中的样品划分为训练集和测试集。训练集和测试集均包含R1~R5区域的水稻样本。
获取训练样本所述目标SNP位点的基因型信息以及对应的标注结果(种植区域R1~R5中的任意一种);
将训练样本所述目标SNP位点的基因型信息以及对应的标注结果作为输入数据,基于神经网络建预测模型,输出训练样本的预测结果(适种区域)。
基于标注结果和预测结果,不断地完善预测模型的参数,提高模型的预测能力,获得训练好的用于鉴定水稻适种区域的预测模型。
在本实施例中,将741例样品中80%的样品设为训练集,20%的样品作为测试集,在其他实施例中,训练集和测序集的样本量也可以设置为其他比例。将构建好的预测模型用于预测(预测的具体步骤可参照后续的实施例2),包括:将待测样本SNP位点的基因型信息输入训练好的预测模型中,获得样本的预测结果。
实施例1构建的预测模型对籼稻和粳稻的预测结果如图2和表2~3所示。
表2. 籼稻预测区域与实际区域统计表
可见,预测模型对籼稻的平均预测正确率为91.03%。
表3. 籼稻预测区域与实际区域统计表
可见,预测模型对粳稻的平均预测正确率为93.95%。
实施例2
一种水稻适种区域的预测模型的预测方法,其包括采用本发明构建的预测模型对待测水稻进行适种区域的预测,具体如下。
(1)样品测序与预测。
DNA提取:
1.取植物新鲜水稻组(根,茎,叶均可,没有限制)织约100 mg 或干重组织约30mg,加入液氮充分研磨。
2.将研磨好的粉末迅速转移到预先装有700 μL 65℃预热缓冲液GPL 的离心管中,加入1 μL RNase,迅速颠倒混匀后,将离心管放在65℃水浴20 min,水浴过程中颠倒离心管以混匀样品。
3.加入700 μL 氯仿,充分混匀,12,000 rpm (~13,400 ×g )离心5 min。
4.小心将上一步所得上层水相转入一个新的离心管中,加入等体积缓冲液GPD,充分混匀。
5.将混匀的液体转入纯化柱,静置1 min,12,000 rpm 离心30 sec,弃滤液。(吸附柱容积为700 μL 左右,可分次加入离心)。
6.向纯化柱中加入500 μL 去蛋白液PS。12,000 rpm 离心30 sec,弃滤液。
7.向纯化柱中加入500 μL 漂洗液PE。12,000 rpm 离心30 sec,弃滤液。
8.重复步骤7,向纯化柱中加入500 μL 漂洗液PE。12,000 rpm 离心30 sec,弃滤液。
9.离心纯化柱,12,000 rpm 离心2 min,以彻底去除纯化柱中残留的液体。
10.将纯化柱置于新的1.5 ml 离心管中。向纯化柱中央处,悬空滴加40-100μl 纯化液TE。室温放置2 min。12,000 rpm 离心2 min,管底即为高纯度基因组DNA。-20℃保存。
11.采用1%琼脂糖电泳检测DNA样品是否有降解及蛋白杂质,采用NanoDrop 2000spectrophotometer (Thermo Scientific)分光光度计检测样品纯度(OD260/280),采用Invitrogen Qubit Fluorometer进行DNA浓度检测。
(2)文库构建与高通量测序。
检测合格后的样品根据Illumina DNA文库构建标准流程,构建双末端测序文库(不限于Illumina测序平台,BGISeq,Pacbio,Nanopore测序平台均可)。文库构建完成后以qPCR方法和Agilent 2100 Bioanalyzer(Agilent Technologies,USA)进行质控。对质检合格的DNA文库采用 Illumina Novaseq6000( Illumina, USA)高通量测序平台进行测序,测序策略为 PE150(Pair-End 150),测序量为1Gb/样品。
(3)SNP检测。
SNP检测采用bwa(版本:0.7.16a-r1181,http://bio-bwa.sourceforge.net/)比对软件对双末端序列进行比对(Li H. et al. 2009),比对策略选择aln+sampe方法,比对错配率设为0.04,其余参数选择默认参数。使用samtools(版本1.9,http://samtools.sourceforge.net/)对比对的序列进行整理排序,使用超低深度核心SNP位点检测策略,检测目标SNP位点的基因型。
(3)种植区域预测。
将待测样本SNP位点的基因型信息输入实施例1训练好的预测模型中,获得样本的预测结果。
实施例3
一种用于鉴定水稻适种区域的预测模型的构建方法,大致与实施例1相同,区别在于用于构建预测模型的SNP位点不同,本实施例随机从实施例1的目标位点中抽取了40%的位点作为最终的目标SNP位点。
实施例4
一种用于鉴定水稻适种区域的预测模型的构建方法,一种用于鉴定水稻适种区域的预测模型的构建方法,大致与实施例2相同,区别在于用于构建预测模型的SNP位点不同,本实施例随机从实施例1的目标位点中抽取了60%的位点作为最终的目标SNP位点。
实施例5
一种用于鉴定水稻适种区域的预测模型的构建方法,一种用于鉴定水稻适种区域的预测模型的构建方法,大致与实施例2相同,区别在于用于构建预测模型的SNP位点不同,本实施例随机从实施例1的目标位点中抽取了80%的位点作为最终的目标SNP位点。
验证例1
验证不同的SNP位点对预测结果的测试效果。
基于实施例1(全部相关SNP),设置5组对照组,对XX样本进行预测,对照组的信息如下:
对照组1(随机抽取40%相关SNP):采用实施例3的构建方法获得预测模型,用于预测;
对照组2(随机抽取60%相关SNP):采用实施例4的构建方法获得预测模型,用于预测;
对照组3(随机抽取80%相关SNP):采用实施例5的构建方法获得预测模型,用于预测;
对照组4(全基因组随机SNP):在全基因组范围内随机选择SNP位点(500个位点)作为目标位点,进行预测模型构建,构建流程同实施例1;
对照组5(四重简并SNP):在全基因组范围内随机选择SNP位点(500个位点)作为目标位点,进行预测模型构建,构建流程同实施例1。
上述分析重复测试10次,进行差异统计,结果如图3以及表4所示。
表4. 不同类型SNP预测正确率统计表
验证例2
基于实施例1提供的预测模型的构建方法,设置5组试验例,每组试验例设置不同样品比例的训练集和测试集,对预测模型的准确性进行评估。
采用的实施例1的模型构建方法,从样品库中随机选择不同比例的样品作为模型的训练集合测试集741个水稻品种,分析模型预测的准确率。5组试验例分别为:50/50、60/40、70/30、80/20和90/10。其中,50/50代表:定义50%的样品为训练集,50%的样品为测试集,依此类推。每个集合随机抽样3次进行测试,结果如图4~5所示。
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。
Claims (10)
1.用于检测目标SNP位点基因型的试剂在制备用于鉴定水稻适种区域的试剂盒中的应用,其特征在于,所述目标SNP位点包括如下SNP位点列表中所示SNP位点中的至少296个位点;
所述水稻选自籼稻和粳稻中的至少一种,所述适种区域包括:纬度范围为北纬20~25度区间的区域R1、纬度范围为北纬25~30度区间的区域R2、纬度范围为北纬30~35度区间的区域R3、纬度范围为北纬35~40度区间的区域R4、纬度范围≥北纬40度区间的区域R5,R1~R5的经度范围为东经73~135度。
2.根据权利要求1所述的用于检测目标SNP位点基因型的试剂在制备用于鉴定水稻适种区域的试剂盒中的应用,其特征在于,所述目标SNP位点包括所述SNP位点列表中所示SNP位点中的至少395个位点。
3.根据权利要求2所述的用于检测目标SNP位点基因型的试剂在制备用于鉴定水稻适种区域的试剂盒中的应用,其特征在于,所述目标SNP位点包括所述SNP位点列表中的所有位点。
4.一种用于鉴定水稻适种区域的试剂盒,其特征在于,其包括:用于检测目标SNP位点基因型的试剂,所述目标SNP位点包括如权利要求1~3任一项所述的应用中的目标SNP位点。
5.一种水稻适种区域的预测模型的训练方法,其特征在于,其包括:
获取训练样本的目标SNP位点的检测信息以及对应的标注结果,其中,所述目标SNP位点包括如权利要求1~3任一项所述的应用中的目标SNP位点;
将训练样本的目标SNP位点的检测信息以及对应的标注结果输入预先构建好的预测模型,获得训练样本的预测结果;其中,所述预测模型用于根据样本所述目标SNP位点的基因型信息判断样本的适种区域,所述预测模型为神经网络预测模型;
基于所述标注结果和所述预测结果对构建的预测模型进行参数更新。
6.一种水稻适种区域的预测模型的预测方法,其特征在于,其包括:将待测样本的目标SNP位点的检测信息输入预先训练好的预测模型中,获得待测样本的预测结果,其中,所述目标SNP位点包括如权利要求1~3任一项所述的应用中的目标SNP位点,所述预测模型为如权利要求5所述的水稻适种区域的预测模型的训练方法训练获得的。
7.一种水稻适种区域的预测模型的预测装置,其特征在于,其包括:
获取模块,用于获取待测样本的目标SNP位点的检测信息;
预测模块,用于将待测样本的目标SNP位点的检测信息输入预先训练好的预测模型中,获得待测样本的预测结果;其中,所述目标SNP位点包括如权利要求1~3任一项所述的应用中的目标SNP位点,所述预测模型为如权利要求5所述的水稻适种区域的预测模型的训练方法训练获得的。
8.一种水稻适种区域的预测模型的训练装置,其特征在于,其包括:
获取模块,用于获取训练样本的目标SNP位点的检测信息以及对应的标注结果,其中,所述目标SNP位点包括如权利要求1~3任一项所述的应用中的目标SNP位点;
处理模块,用于将所述训练样本的目标SNP位点的检测信息以及对应的标注结果输入预先构建好的预测模型,获得训练样本的预测结果;其中,所述预测模型用于根据样本所述目标SNP位点的基因型信息判断样本的适种区域;
参数更新模块,用于根据所述标注结果和所述预测结果对构建的预测模型进行参数更新。
9.一种电子设备,其特征在于,其包括:处理器和存储器;所述存储器用于存储程序,当所述程序被所述处理器执行时,使得所述处理器实现如权利要求5所述的水稻适种区域的预测模型的训练方法,或,如权利要求6所述的水稻适种区域的预测模型的预测方法。
10.一种计算机可读介质,其上存储有计算机程序,其特征在于,该计算机程序被处理器执行时实现如权利要求5所述的水稻适种区域的预测模型的训练方法,或,如权利要求6所述的水稻适种区域的预测模型的预测方法。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111288744.4A CN113930538B (zh) | 2021-11-02 | 2021-11-02 | 一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111288744.4A CN113930538B (zh) | 2021-11-02 | 2021-11-02 | 一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113930538A CN113930538A (zh) | 2022-01-14 |
CN113930538B true CN113930538B (zh) | 2024-01-23 |
Family
ID=79285277
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111288744.4A Active CN113930538B (zh) | 2021-11-02 | 2021-11-02 | 一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113930538B (zh) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110066886A (zh) * | 2019-05-28 | 2019-07-30 | 广州瑞科基因科技有限公司 | 一种鉴定水稻品种的试剂、方法及应用 |
CN111816303A (zh) * | 2020-07-08 | 2020-10-23 | 深圳承启生物科技有限公司 | 一种基于机器学习的难治性精神分裂症风险的预测方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
MX2018015256A (es) * | 2016-06-08 | 2019-04-25 | Monsanto Technology Llc | Metodos para identificar cruzas para usar en reproduccion de plantas. |
-
2021
- 2021-11-02 CN CN202111288744.4A patent/CN113930538B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110066886A (zh) * | 2019-05-28 | 2019-07-30 | 广州瑞科基因科技有限公司 | 一种鉴定水稻品种的试剂、方法及应用 |
CN111816303A (zh) * | 2020-07-08 | 2020-10-23 | 深圳承启生物科技有限公司 | 一种基于机器学习的难治性精神分裂症风险的预测方法 |
Non-Patent Citations (3)
Title |
---|
Development of Genome-Wide DNA Polymorphism Database for Map-Based Cloning of Rice Genes;Ying-Jia Shen等;《Plant Physiol》;第135卷(第3期);第1198-1205页 * |
The ties of brotherhood between japonica and indica rice for regional adaptation;Man Wang等;《Sci China Life Sci》;第65卷(第7期);第1369-1379页 * |
籼稻和粳稻的高效分子鉴定方法及其在水稻育种和进化研究中的意义;卢宝荣等;《自然科学进展》(第6期);第628-638页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113930538A (zh) | 2022-01-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Vogel et al. | Footprints of parasitism in the genome of the parasitic flowering plant Cuscuta campestris | |
Yue et al. | Contrasting evolutionary genome dynamics between domesticated and wild yeasts | |
Tarver et al. | The interrelationships of placental mammals and the limits of phylogenetic inference | |
Tajima et al. | Genetic origins of the Ainu inferred from combined DNA analyses of maternal and paternal lineages | |
Bell et al. | RNA-seq analysis of allele-specific expression, hybrid effects, and regulatory divergence in hybrids compared with their parents from natural populations | |
Streets et al. | Microfluidic single-cell whole-transcriptome sequencing | |
Gentekaki et al. | Large-scale phylogenomic analysis reveals the phylogenetic position of the problematic taxon Protocruzia and unravels the deep phylogenetic affinities of the ciliate lineages | |
Jiang et al. | Comparative and phylogenetic analyses of six Kenya Polystachya (Orchidaceae) species based on the complete chloroplast genome sequences | |
Wielstra et al. | Phylogenomics of the adaptive radiation of Triturus newts supports gradual ecological niche expansion towards an incrementally aquatic lifestyle | |
Liu et al. | Cryptic lineages and potential introgression in a mixed‐ploidy species (Phragmites australis) across temperate China | |
Maca-Meyer et al. | Status and relationships of the extinct giant Canary Island lizard Gallotia goliath (Reptilia: Lacertidae), assessed using ancient mtDNA from its mummified remains | |
Baleka et al. | Estimating the dwarfing rate of an extinct Sicilian elephant | |
CN112951330A (zh) | 一种基于宏组学技术从复杂环境体系中获知全程氨氧化微生物相对丰度和活性的方法 | |
Zou et al. | Coupling between ribotypic and phenotypic traits of protists across life cycle stages and temperatures | |
Zhao et al. | Novel genome sequence of Chinese cavefish (Triplophysa rosa) reveals pervasive relaxation of natural selection in cavefish genomes | |
CN113930538B (zh) | 一种用于鉴定水稻适种区域的试剂盒及其预测方法和预测模型的构建 | |
Kuo et al. | Weak gene–gene interaction facilitates the evolution of gene expression plasticity | |
Zhang et al. | A phylogenomic framework and divergence history of Cephalochordata amphioxus | |
CN113789394A (zh) | 一种鉴定三疣梭子蟹氨氮耐受性状的分子标记c13及其应用 | |
Liao et al. | Phylogeographic analysis reveals two cryptic species of the endangered fern Ceratopteris thalictroides (L.) Brongn.(Parkeriaceae) in China | |
Tripp et al. | Origin of African Physacanthus (Acanthaceae) via wide hybridization | |
Diz et al. | RNA-seq data from mature male gonads of marine mussels Mytilus edulis and M. galloprovincialis | |
Bray et al. | Convergence and novelty in adaptation to whole genome duplication in three independent polyploids | |
Al-Jumaili et al. | The usefulness of maternally inherited genetic markers for phylogeographic studies in village chicken | |
Liu et al. | Detection of genome-wide structural variations in the Shanghai Holstein cattle population using next-generation sequencing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |