CN110656184B - SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population - Google Patents

SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population Download PDF

Info

Publication number
CN110656184B
CN110656184B CN201810688113.3A CN201810688113A CN110656184B CN 110656184 B CN110656184 B CN 110656184B CN 201810688113 A CN201810688113 A CN 201810688113A CN 110656184 B CN110656184 B CN 110656184B
Authority
CN
China
Prior art keywords
cjk
snp
korea
population
korean
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810688113.3A
Other languages
Chinese (zh)
Other versions
CN110656184A (en
Inventor
陈华
赵石磊
石承民
刘琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Genomics of CAS
Original Assignee
Beijing Institute of Genomics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Genomics of CAS filed Critical Beijing Institute of Genomics of CAS
Priority to CN201810688113.3A priority Critical patent/CN110656184B/en
Publication of CN110656184A publication Critical patent/CN110656184A/en
Application granted granted Critical
Publication of CN110656184B publication Critical patent/CN110656184B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention belongs to the field of biotechnology, and particularly discloses an SNP marker combination for distinguishing Chinese Han nationality, Japan and Korean/Korean population, wherein specific information of the contained SNP molecular markers is shown in Table 1. The SNP marker combination provided by the invention can distinguish Chinese Han nationality, Japanese and Korean/Korean people, and the distinguishing accuracy of the SNP marker combination with different capacities can reach 90%, 95%, 99%, 99.9% and 99.99% respectively.

Description

SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population
Technical Field
The present invention belongs to the field of biotechnology, and specifically relates to a combination of SNP markers for distinguishing Chinese Han nationality, Japan, Korea/Korea population.
Background
Chinese han nationality population is very similar to japanese and korean/korean in external physical characteristics, and thus it is very difficult to effectively distinguish and identify these types of population. In recent years, people are frequently communicated socioeconomically between China and Korean, and a large number of people leave their places of birth and travel to other countries, and are merged into each class of local society. Thus, identification of such a mixed population presents a significant challenge to forensic investigation. Identity identification of individuals or remains is the most important task of criminal investigation work, is a restrictive step in investigation work, and directly influences the progress of the whole work and the efficiency of the investigation work. Currently, molecular genetic techniques based on DNA markers are the primary means of such identification efforts. However, the reliable identification of the identity currently needs to analyze a large number of genetic markers to ensure the reliability of the identification result, and people involved in the identification need to have perfect genetic database support to be able to make specific judgment on the identity of the individual. Meanwhile, in many cases, the substance to be tested is mostly a trace amount of DNA sample, and the identity of the test sample is difficult to identify and infer according to the materials.
Single Nucleotide Polymorphisms (SNPs) are the most abundant markers in the human genome, with an average of 1 SNP per 1250 bases. With the development of SNP analysis methods at the genome wide level, a large amount of genome SNP data of people are accumulated continuously. Large-scale studies have shown that SNP markers have important application value in inferring the origin of an individual, such as ethnic or geographic ancestral origin. The stability, abundance, and completeness of allelic frequency patterns of SNP markers make them important markers for individual identification. However, genome-wide SNP analysis is costly, requires large DNA samples, and is difficult to adapt to forensic applications. Therefore, the result of the genome-wide SNP analysis is needed, an efficient SNP molecular marker reference system is constructed, and the reference system is used for reliable discrimination and identification of the human population, so as to meet the needs of forensic analysis. The characteristics of forensic identification samples are combined, a set of simple, efficient and accurate molecular markers suitable for analyzing trace samples and deducing individual sources are provided, and the efficiency of investigation work is greatly improved.
Disclosure of Invention
To solve the problems of the prior art, it is an object of the present invention to provide a marker combination for distinguishing chinese han nationality, japan, korean/korean populations.
In order to realize the purpose of the invention, the technical scheme of the invention is as follows:
the invention firstly provides a marker combination (SNP reference system) for distinguishing Chinese Han nationality, Japan and Korea/Korea population, CJK-22panel for short, which comprises the following 22 SNP molecular markers:
rs12143597、rs11124005、rs9826254、rs2035023、rs17599827、rs17145135、rs10088365、rs11774451、rs2976396、rs16934422、rs7095575、rs978605、rs12805212、rs7117447、rs11220012、rs4578397、rs17823795、rs1256519、rs10131476、rs10083689、rs6070701、rs4820428。
the average classification accuracy of CJK-22panel on Chinese Han nationality, Japan, Korea/Korea population can reach 90%.
Further, in order to improve the average classification accuracy of the SNP reference system based on the CJK-22panel, the invention supplements 13 new SNP molecular markers based on the CJK-22panel to form a new marker combination (SNP reference system), called CJK-35panel for short, which comprises the following 35 SNP molecular markers:
rs12143597、rs11124005、rs9826254、rs2035023、rs17599827、rs17145135、rs10088365、rs11774451、rs2976396、rs16934422、rs7095575、rs978605、rs12805212、rs7117447、rs11220012、rs4578397、rs17823795、rs1256519、rs10131476、rs10083689、rs6070701、rs4820428、rs12134013、rs16825115、rs614664、rs40184、rs7721953、rs17172725、rs7006443、rs1436211、rs4939526、rs7958890、rs4533076、rs12596185、rs6016226。
the average classification accuracy of CJK-35panel on Chinese Han nationality, Japan, Korea/Korea population can reach 95%.
Further, on the basis of the CJK-35panel, in order to improve the average classification accuracy of the SNP reference system, the invention supplements 31 new SNP molecular markers on the basis of the CJK-35panel to form a new marker combination (SNP reference system), called CJK-66panel for short, comprising the following 66 SNP molecular markers:
rs12143597、rs11124005、rs9826254、rs2035023、rs17599827、rs17145135、rs10088365、rs11774451、rs2976396、rs16934422、rs7095575、rs978605、rs12805212、rs7117447、rs11220012、rs4578397、rs17823795、rs1256519、rs10131476、rs10083689、rs6070701、rs4820428、rs12134013、rs16825115、rs614664、rs40184、rs7721953、rs17172725、rs7006443、rs1436211、rs4939526、rs7958890、rs4533076、rs12596185、rs6016226、rs7555405、rs12406216、rs12565326、rs1222144、rs16834705、rs1456557、rs9854952、rs6799357、rs7674135、rs4690508、rs1152163、rs1428150、rs17641748、rs12234078、rs160357、rs17152163、rs13264712、rs10961684、rs2183966、rs3181200、rs12768145、rs11001479、rs10861065、rs2298236、rs11846710、rs808226、rs11625485、rs11631923、rs2642066、rs1864307、rs928284。
the average classification accuracy of CJK-66panel on Chinese Han nationality, Japan, Korea/Korea population can reach 99%.
Still further, the present invention is based on CJK-66panel, which is supplemented with 35 new SNP molecular markers to form a new marker combination (SNP reference system), CJK-101panel for short, comprising the following 101 SNP molecular markers:
rs12143597、rs11124005、rs9826254、rs2035023、rs17599827、rs17145135、rs10088365、rs11774451、rs2976396、rs16934422、rs7095575、rs978605、rs12805212、rs7117447、rs11220012、rs4578397、rs17823795、rs1256519、rs10131476、rs10083689、rs6070701、rs4820428、rs12134013、rs16825115、rs614664、rs40184、rs7721953、rs17172725、rs7006443、rs1436211、rs4939526、rs7958890、rs4533076、rs12596185、rs6016226、rs7555405、rs12406216、rs12565326、rs1222144、rs16834705、rs1456557、rs9854952、rs6799357、rs7674135、rs4690508、rs1152163、rs1428150、rs17641748、rs12234078、rs160357、rs17152163、rs13264712、rs10961684、rs2183966、rs3181200、rs12768145、rs11001479、rs10861065、rs2298236、rs11846710、rs808226、rs11625485、rs11631923、rs2642066、rs1864307、rs928284、rs1413899、rs10803014、rs340514、rs17348624、rs10928436、rs1022037、rs10446439、rs12497859、rs312510、rs13171386、rs2879074、rs13291088、rs1452658、rs12346987、rs12006467、rs7038964、rs7032231、rs11598305、rs4918000、rs17121800、rs11035476、rs624307、rs9587526、rs3211752、rs10483991、rs4369599、rs4886694、rs10775266、rs12928871、rs2377962、rs1217374、rs881118、rs2824082、rs2836901、rs4818015。
the average classification accuracy of CJK-101panel on Chinese Han nationality, Japan, Korea/Korea population can reach 99.9%.
Furthermore, the present invention is based on CJK-101panel, supplemented with 94 new SNP molecular markers to form a new marker combination (SNP reference system), CJK-195panel for short, which comprises 195 SNP molecular markers as follows:
rs12143597、rs11124005、rs9826254、rs2035023、rs17599827、rs17145135、rs10088365、rs11774451、rs2976396、rs16934422、rs7095575、rs978605、rs12805212、rs7117447、rs11220012、rs4578397、rs17823795、rs1256519、rs10131476、rs10083689、rs6070701、rs4820428、rs12134013、rs16825115、rs614664、rs40184、rs7721953、rs17172725、rs7006443、rs1436211、rs4939526、rs7958890、rs4533076、rs12596185、rs6016226、rs7555405、rs12406216、rs12565326、rs1222144、rs16834705、rs1456557、rs9854952、rs6799357、rs7674135、rs4690508、rs1152163、rs1428150、rs17641748、rs12234078、rs160357、rs17152163、rs13264712、rs10961684、rs2183966、rs3181200、rs12768145、rs11001479、rs10861065、rs2298236、rs11846710、rs808226、rs11625485、rs11631923、rs2642066、rs1864307、rs928284、rs1413899、rs10803014、rs340514、rs17348624、rs10928436、rs1022037、rs10446439、rs12497859、rs312510、rs13171386、rs2879074、rs13291088、rs1452658、rs12346987、rs12006467、rs7038964、rs7032231、rs11598305、rs4918000、rs17121800、rs11035476、rs624307、rs9587526、rs3211752、rs10483991、rs4369599、rs4886694、rs10775266、rs12928871、rs2377962、rs1217374、rs881118、rs2824082、rs2836901、rs4818015、rs9660866、rs12045644、rs4839460、rs1546161、rs1883905、rs12119983、rs35213472、rs7599196、rs12471916、rs2670740、rs7058、rs1921824、rs16850913、rs16863814、rs6436971、rs17008734、rs1398101、rs4353835、rs11129867、rs1512821、rs775779、rs17043433、rs7642488、rs13078671、rs10935306、rs12488690、rs955013、rs6852170、rs6813625、rs4861532、rs154719、rs2589787、rs17783275、rs26661、rs1609763、rs17107113、rs17676033、rs10277926、rs10249357、rs10265406、rs3923736、rs6979649、rs4831326、rs2410564、rs189509、rs10972006、rs12351269、rs4571809、rs7855025、rs12351851、rs1999432、rs6559935、rs11787920、rs6478966、rs7072215、rs12244293、rs11034709、rs17712923、rs12420406、rs11224805、rs10431079、rs17121524、rs7106815、rs10849181、rs11104947、rs3843636、rs10861462、rs7138673、rs7991339、rs9532080、rs2325476、rs9530168、rs4883926、rs16947714、rs2762148、rs1854743、rs12588915、rs11626859、rs12434392、rs1951486、rs12440686、rs1420288、rs3909546、rs829154、rs16944492、rs2278339、rs5022079、rs1117941、rs16991180、rs6030932、rs1333294、rs181146、rs1524930、rs131864。
the average classification accuracy of CJK-195panel on Chinese Han nationality, Japan, Korea/Korea population can reach 99.99%.
The research of the invention finds that when the total number of the SNPs is 195, the classification accuracy reaches 99.99%. As the last 94 SNPs only increase the classification accuracy by 0.09%, CJK-22panel, CJK-35panel, CJK-66panel and CJK-101panel have been able to meet the basic application requirements in view of the actual application cost.
The specific information of the SNP molecular markers of the present invention is shown in Table 1:
TABLE 1
Figure BDA0001712206250000061
Figure BDA0001712206250000071
Figure BDA0001712206250000081
Figure BDA0001712206250000091
Figure BDA0001712206250000101
Further, the present invention provides the use of the aforementioned SNP marker combination in any one of:
(1) constructing a DNA chip or a multiple PCR genotype analysis or other applicable kit;
(2) identifying the identity of the people in China, Japan and Korean;
(3) genetic analysis of east Asia population.
It should be noted that, those skilled in the art can design a primer (a set of primers) or a gene chip by applying conventional technical means according to the specific information of the SNP molecular markers involved in the SNP marker combination of the invention, which is not limited in the present invention, but the set of primers or the gene chip designed according to the SNP marker combination provided by the present invention for detecting any one of the SNP marker combinations described above belong to the protection scope of the present invention.
The primer set or gene chip can be applied to any one of the following aspects:
(1) constructing a DNA chip or a multiple PCR genotype analysis or other applicable kit;
(2) identifying the identity of the people in China, Japan and Korean;
(3) genetic analysis of east Asia population.
The invention has the beneficial effects that:
the invention extracts SNP marker combinations with high information division from massive genome data to form reference systems which can be used for distinguishing Chinese Han nationality, Japanese and Korean/Korean crowds, and the distinguishing accuracy rates of the reference systems with different sizes can respectively reach 90%, 95%, 99%, 99.9% and 99.99%.
Drawings
FIG. 1 is a graph showing the variation of the average classification accuracy of SNP numbers in example 2.
FIG. 2 is a comparison of the results of PCA plots for all 535330 SNPs in example 2, CJK-22panel, CJK-35panel, CJK-66panel, CJK-101panel, and CJK-195 panel.
Detailed Description
The present invention is further illustrated by the following examples. It is to be understood that the following examples are given for illustrative purposes only and are not intended to limit the scope of the present invention. Various modifications and alterations of this invention will become apparent to those skilled in the art without departing from the spirit and scope of this invention.
The experimental procedures used in the following examples are all conventional procedures unless otherwise specified.
Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
Example 1
In this embodiment, samples and data information of chinese han nationality people and japanese population are screened from a thousand people genome project (1000GP) database, samples of korea/korean population are screened from a korean individual genome project (KPGP) database, samples of korea/korean population and han nationality people are screened from an asian diversity project database (ADP), the total number of formed samples is 585, the number of chinese han nationality samples is 293, the number of japanese samples is 103, and the number of korean han samples is 189. The sample and data information used in this embodiment are all disclosed in the prior art.
1000GP the 1000genome Project, thousand human genome Project, Altsuhler et al (2015) A global reference for human genetic variation, Nature 526: 68-74.
KPGP: the Korea Personal genome Project, Korea genome Project, Zhang et al (2014) where genome sequencing of 35 induced viruses is involved in the genetic architecture of BMC Bioinformatics 15: S6.
ADP: the Asian Diversity Project, Asian Diversity program, Liu et al.
TABLE 2 sample sources
Figure BDA0001712206250000131
This example illustrates the extraction of CJK-22panel, CJK-35panel, CJK-66panel, CJK-101panel and CJK-195panel from a total of 535330 SNPs in the above sample.
The CJK panel construction method specifically comprises the following steps:
(1) data partitioning
Taking a known genetic marker database as a sample, and segmenting sample data according to genetic information difference to obtain two types: han nationality-japan-korea/korea and japan-korea/korea.
(2) And (3) data filtering:
separately calculating the F of the genetic markers in two classesSTAnd the genetic markers in the two classes are sorted in descending order, and the first 20000 genetic markers arranged in the two classes are reserved;
(3) selecting a genetic marker: the SNPTag method was used to pick 200 each of 20000 genetic markers in the two types of data, making up two MaC pools. The two MaC pools were then fused to obtain the final SNP marker combination. Setting the fusion termination threshold as: 1) AAC of the SNP marker combination reaches 0.9999; 2) the number of SNPs in the SNP marker combination reaches 200. Finally, the termination condition 1) was satisfied when the number of SNPs was 195).
(4) In 195 SNP combinations, the first 22 SNPs are extracted to construct CJK-22panel, and the ancestral inference accuracy rate can reach 90 percent; extracting the first 35 SNPs to construct CJK-35panel, wherein the ancestral inference accuracy can reach 95%; the CJK-66panel is constructed by extracting the first 66 SNPs, and the ancestral inference accuracy can reach 99%; the CJK-101panel is constructed by extracting the first 101 SNPs, and the ancestral inference accuracy can reach 99.9 percent; CJK-195panel is constructed by all 195 SNP, and the ancestral inference accuracy can reach 99.99%.
Example 2
Example 1 a combination of SNPs that is favorable for ancestral inference was extracted from a total of 535330 SNPs. The algorithm can balance the ancestral inference ability of the SNPs themselves and the information overlap between different SNPs to obtain the best combined inference effect. The screened SNPs are sequentially added, and the average classification accuracy AAC is calculated, and the obtained curve is shown in FIG. 1. The classification accuracy AC is defined as the ratio of the number of correctly classified samples to the total number of test samples,
Figure BDA0001712206250000141
the average classification accuracy (AAC) is defined as the average of 1000 repeated AC values under randomly selected test sets.
In this example, the performance evaluation of the SNP reference system obtained in example 1 was carried out in three ways. The first way is to directly compare the true and predicted ancestors; the second way is to calculate the commonly used indexes in the classification problem, including Sensitivity (Sensitivity), Specificity (Specificity), Positive Predictive Value (PPV), and Negative Predictive Value (NPV); the third way is to visually analyze the performance of the SNP reference frame by PCA plot.
(1) Comparing true and predicted ancestors
TABLE 3 CJK-22panel
Figure BDA0001712206250000142
TABLE 4 CJK-35panel
Figure BDA0001712206250000143
TABLE 5 CJK-66panel
Figure BDA0001712206250000151
TABLE 6 CJK-101panel
Figure BDA0001712206250000152
TABLE 7 CJK-195panel
Figure BDA0001712206250000153
(2) General classification performance index
True TP true Positive (A population is identified as A population)
False negative FN false negative (A population is identified as non-A population)
False positive FP false positive (non-A population is identified as A population)
True negative TN (non-A population is identified as non-A population)
Sensitivity: sensitivity TP/(TP + FN)
Specificity: specificity TN/(FP + TN)
Positive predictive value: PPV is TP/(TP + FP)
Negative predictive value: NPV (TN/(TN + FN)
TABLE 8 CJK-22panel performance
Figure BDA0001712206250000154
TABLE 9 CJK-35panel performance
Figure BDA0001712206250000161
TABLE 10 CJK-66panel performance
Figure BDA0001712206250000162
TABLE 11 CJK-101panel performance
Figure BDA0001712206250000163
TABLE 12 CJK-195panel performance
Figure BDA0001712206250000164
(3) Principal component analysis
The effect of the SNP reference system was verified by principal component analysis. Principal component analysis is a commonly used data feature extraction method. When the SNP is used for deducing the ancestral source of an individual, each SNP of the individual is equivalent to one ancestral attribute label of the individual (each SNP of the individual is equivalent to one clue for deducing the ancestral source of the individual), and a plurality of the ancestral attribute labels are combined together to form a panel, so that the ancestral source deduction of the individual can be effectively carried out.
PCA charts of all 535330 SNPs, CJK-195panel, CJK-101panel, CJK-66panel, CJK-35panel and CJK-22panel were compared as shown in FIGS. 2(1) to (6). In the figure, black circle symbols represent Han individuals, gray square symbols represent Japanese individuals, and plus signs represent Korean individuals. In FIG. 2(1), all symbols of the same type are grouped together without crossing between symbols of different types, indicating that all SNPs contain enough information to distinguish samples of different populations; FIGS. 2(3), (4), (5), (6) are similar to FIG. 2(1), and only a very small portion of individuals cross in the first and second principal component spaces between different groups, but the cross individuals can still be clearly distinguished in other principal component spaces, which shows that although only 22-101 SNPs are included, the included information is enough to effectively deduce the group to which the individual belongs. In addition, CJK-195panel performed better than CJK-101panel, CJK-101panel performed better than CJK-66panel, CJK-66panel performed better than CJK-35panel, and CJK-35panel performed better than CJK-22panel, based on four criteria of sensitivity, specificity, positive predictive value, and negative predictive value (tables 8-12).
Although the invention has been described in detail hereinabove with respect to a general description and specific embodiments thereof, it will be apparent to those skilled in the art that modifications or improvements may be made thereto based on the invention. Accordingly, such modifications and improvements are intended to be within the scope of the invention as claimed.

Claims (7)

1. Set of primers or gene chip for discriminating chinese han nationality, japan, korean/korean population, characterized in that the set of primers or gene chip is used for detecting the following SNP marker combination consisting of the following SNP sites: rs12143597, rs11124005, rs9826254, rs2035023, rs17599827, rs17145135, rs10088365, rs11774451, rs2976396, rs 169383 4422, rs7095575, rs978605, rs12805212, rs7117447, rs11220012, rs4578397, rs17823795, rs1256519, rs10131476, rs10083689, rs6070701, rs 4828.
2. The set of primers or gene chip of claim 1, wherein said SNP marker set further comprises the following SNP sites: rs12134013, rs16825115, rs614664, rs40184, rs7721953, rs17172725, rs7006443, rs1436211, rs4939526, rs7958890, rs4533076, rs12596185, rs 6016226.
3. The set of primers or gene chip of claim 2, wherein said SNP marker set further comprises the following SNP sites: rs7555405, rs12406216, rs12565326, rs1222144, rs16834705, rs1456557, rs9854952, rs6799357, rs7674135, rs4690508, rs1152163, rs1428150, rs17641748, rs12234078, rs160357, rs17152163, rs13264712, rs10961684, rs2183966, rs3181200, rs 12745, rs11001479, rs10861065, rs2298236, rs11846710, rs808226, rs11625485, rs11631923, rs2642066, rs1864307 and rs 928284.
4. The set of primers or gene chip of claim 3, wherein said SNP marker set further comprises the following SNP sites: rs1413899, rs10803014, rs340514, rs17348624, rs10928436, rs1022037, rs10446439, rs12497859, rs312510, rs13171386, rs2879074, rs13291088, rs1452658, rs12346987, rs12006467, rs 1027038964, rs7032231, rs11598305, rs4918000, rs17121800, rs11035476, rs 624304307, rs9587526, rs3211752, rs10483991, rs4369599, rs4886694, rs10775266, rs12928871, rs2377962, rs 7371214, rs881118, rs2824082, rs 2801 and rs 4818015.
5. The set of primers or gene chip of claim 4, wherein said SNP marker set further comprises the following SNP sites: rs9660866, rs12045644, rs4839460, rs1546161, rs1883905, rs12119983, rs35213472, rs7599196, rs12471916, rs2670740, rs7058, rs1921824, rs16850913, rs16863814, rs6436971, rs17008734, rs1398101, rs4353835, rs11129867, rs 15128282821, rs 779, rs17043433, rs7642488, rs 130671, rs10935306, rs 124124690, rs955013, rs 6852172172170, rs 681363625, rs 61532, rs154719 719, rs 177817675, rs 267767163, rs 16017107107113, rs 7677926, rs 10257, rs10265406, rs 3977392, rs 3977799, rs 1247943320798, rs 300794354798, rs 72794354779, rs 72798, rs 29563756375637563756375637569, rs 72563756375637563756375637569, rs 729, rs 725637563756375637563756375637563756375637569, rs 729, rs 7256375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637569, rs 729, rs 725637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637567, rs 729, rs 7256375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637567, rs 729, rs 72563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563756375637563745, rs 1745, rs 729, rs 1745.
6. Use of a SNP marker set as set forth in any one of claims 1 to 5in any one of:
(1) constructing a DNA chip or a multiple PCR genotype analysis kit;
(2) identifying the identity of the people in China, Japan and Korean;
(3) genetic analysis of east Asia population.
7. Use of a primer set or gene chip according to any one of claims 1 to 5in any one of:
(1) constructing a DNA chip or a multiple PCR genotype analysis kit;
(2) identifying the identity of the people in China, Japan and Korean;
(3) genetic analysis of east Asia population.
CN201810688113.3A 2018-06-28 2018-06-28 SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population Active CN110656184B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810688113.3A CN110656184B (en) 2018-06-28 2018-06-28 SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810688113.3A CN110656184B (en) 2018-06-28 2018-06-28 SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population

Publications (2)

Publication Number Publication Date
CN110656184A CN110656184A (en) 2020-01-07
CN110656184B true CN110656184B (en) 2022-04-29

Family

ID=69027389

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810688113.3A Active CN110656184B (en) 2018-06-28 2018-06-28 SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population

Country Status (1)

Country Link
CN (1) CN110656184B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114634988B (en) * 2022-04-28 2022-09-16 贵州医科大学 SNP (Single nucleotide polymorphism) sites and method for identifying and researching biological geographic sources of east Asia population

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110078071A (en) * 2009-12-30 2011-07-07 경상북도(관련부서:경상북도산림생태과학원) Snp marker for identifying korea panax ginseng, kit and identifying method using the same
CN102121046A (en) * 2009-06-25 2011-07-13 中国科学院北京基因组研究所 Chinese population linkage analysis single nucleotide polymorphism (SNP) marker sets and use method and application thereof
CN108060237A (en) * 2018-01-12 2018-05-22 四川大学 Forensic medicine composite detection kit based on 55 Y chromosome SNP genetic markers
CN110885888A (en) * 2018-09-07 2020-03-17 中国科学院北京基因组研究所 SNP marker combination for deducing different geographical region populations of Asia
CN111118168A (en) * 2018-11-01 2020-05-08 中国科学院北京基因组研究所 SNP marker combination for deducing main ethnic group of northwest China and adjacent middle and Asia countries

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102121046A (en) * 2009-06-25 2011-07-13 中国科学院北京基因组研究所 Chinese population linkage analysis single nucleotide polymorphism (SNP) marker sets and use method and application thereof
KR20110078071A (en) * 2009-12-30 2011-07-07 경상북도(관련부서:경상북도산림생태과학원) Snp marker for identifying korea panax ginseng, kit and identifying method using the same
CN108060237A (en) * 2018-01-12 2018-05-22 四川大学 Forensic medicine composite detection kit based on 55 Y chromosome SNP genetic markers
CN110885888A (en) * 2018-09-07 2020-03-17 中国科学院北京基因组研究所 SNP marker combination for deducing different geographical region populations of Asia
CN111118168A (en) * 2018-11-01 2020-05-08 中国科学院北京基因组研究所 SNP marker combination for deducing main ethnic group of northwest China and adjacent middle and Asia countries

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
27重SNP系统推断种族来源的效能;冯杏玲等;《南方医科大学学报》;20170419;第37卷(第4期);全文 *
A panel of 74 AISNPs: Improved ancestry inference within Eastern Asia;Cai-Xia Li et al.;《Forensic Sci Int Genet》;20160404;全文 *
Genetic structure, divergence and admixture of Han Chinese, Japanese and Korean populations;Yuchen Wang et al.;《Hereditas》;20180406;第155卷(第19期);全文 *
用于五大洲际人群区分的SNP体系研究;郝伟琪等;《南方医科大学学报(自然科学版)》;20180315;第38卷(第3期);全文 *

Also Published As

Publication number Publication date
CN110656184A (en) 2020-01-07

Similar Documents

Publication Publication Date Title
CN102682224B (en) Method and device for detecting copy number variations
CN108220413A (en) The fluorescent composite amplification reagent kit of joint-detection people's Y chromosome STR and Indel locus and its application
CN103114150B (en) The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action
CN108504749A (en) 29 micro- haplotype sites, screening technique, composite amplification system and application
CN106460045B (en) Common copy number variation of human genome for risk assessment of susceptibility to cancer
CN107419017B (en) Method and system for inferring source of five continental ethnic groups of individuals of unknown origin
Snedecor et al. Fast and accurate kinship estimation using sparse SNPs in relatively large database searches
Ramani et al. Ancestry prediction in Singapore population samples using the Illumina ForenSeq kit
CN110444253B (en) Method and system suitable for mixed pool gene positioning
CN110656184B (en) SNP marker combination for distinguishing Chinese Han nationality, Japan, Korea/Korea population
CN107122625B (en) Method for processing high-throughput sequencing information of human short segment tandem repeat sequence
CN110885888B (en) SNP marker combination for deducing different geographical region populations of Asia
CN111292807B (en) Method for analyzing double cells in single-cell transcriptome data
CN106702010A (en) Genetic marker combination, individual gene identity card, two-dimensional code, kit and application thereof
JP5825790B2 (en) Nucleic acid information processing apparatus and processing method thereof
Sherier et al. Determining informative microbial single nucleotide polymorphisms for human identification
CN111118168B (en) SNP marker combination for deducing main ethnic group of northwest China and adjacent middle and Asia countries
CN113832236A (en) Primer group and kit for identifying sika deer, red deer and hybrid deer and application
CN107862177B (en) Construction method of single nucleotide polymorphism molecular marker set for distinguishing carp populations
CN106566873B (en) Molecular marker and kit for forensic individual identification
CN112233724A (en) Ancestral polymorphism prediction method based on big data artificial intelligence algorithm
Yahya et al. Analysis of the genetic structure of the Malay population: Ancestry-informative marker SNPs in the Malay of Peninsular Malaysia
US20140019062A1 (en) Nucleic Acid Information Processing Device and Processing Method Thereof
CN114530200B (en) Mixed sample identification method based on calculation of SNP entropy
KR102397822B1 (en) Apparatus and method for analyzing cells using chromosome structure and state information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant