CN110021363A - For constructing the device and method of user friendly chromosomal gene variation map - Google Patents

For constructing the device and method of user friendly chromosomal gene variation map Download PDF

Info

Publication number
CN110021363A
CN110021363A CN201711423213.5A CN201711423213A CN110021363A CN 110021363 A CN110021363 A CN 110021363A CN 201711423213 A CN201711423213 A CN 201711423213A CN 110021363 A CN110021363 A CN 110021363A
Authority
CN
China
Prior art keywords
gene
submodule
information
genetic mutation
map
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711423213.5A
Other languages
Chinese (zh)
Other versions
CN110021363B (en
Inventor
玄兆伶
李大为
梁峻彬
陈重建
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ANNOROAD (YIWU) MEDICAL INSPECTION Co.,Ltd.
Anouta gene technology (Beijing) Co.,Ltd.
Original Assignee
ANNOROAD GENETIC TECHNOLOGY (BEIJING) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ANNOROAD GENETIC TECHNOLOGY (BEIJING) Co Ltd filed Critical ANNOROAD GENETIC TECHNOLOGY (BEIJING) Co Ltd
Priority to CN201711423213.5A priority Critical patent/CN110021363B/en
Publication of CN110021363A publication Critical patent/CN110021363A/en
Application granted granted Critical
Publication of CN110021363B publication Critical patent/CN110021363B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks

Abstract

The present invention relates to a kind of for constructing the device and method of user friendly chromosomal gene variation map.Device for constructing user friendly chromosomal gene variation map of the invention includes data acquisition module, data preparation module, constant gene segment C figure drafting module, chromosome map drafting module and genetic mutation information labeling module.The device and method for constructing user friendly chromosomal gene variation map of the invention can automate, really, intuitively and aesthetically show the specific variation situation of any gene on whole chromosome.

Description

For constructing the device and method of user friendly chromosomal gene variation map
Technical field
The invention belongs to genetic test fields, and in particular to one kind is for constructing user friendly chromosomal gene variation figure The method and device of spectrum.
Background technique
Genetic test is the technology detected by blood, other body fluid or cell to DNA, can be diagnosed the illness, It can be used for the prediction of disease risks.Genetic test is usually that the Oral Mucosal Cells for taking detected person to fall off or its hetero-organization are thin Born of the same parents after expanding its gene information, detect the DNA molecular information in detected person's cell by particular device, and body is predicted The risk suffered from the disease is analyzed the various genetic profiles contained by it, to allow one to understand the gene information of oneself, and is passed through The living environment and living habit for improving oneself, avoid or delay the generation of disease.
With the development of new-generation sequencing technology, the gene based on NGS (Next Generation Sequencing) is examined Survey technology is grown rapidly, and can detecte since inside and outside various factors makes the base composition or arrangement of gene DNA sequence Sequence change caused by DNA primary structure change, specifically include that single base change (i.e. single nucleotide variations, Single Nucleotide Variant, SNV), big or small sequence fragment insertion and missing (i.e. Insertion& Deletion, InDel), the copy number variation (Copy Number Variant, CNV) of sequence fragment, sequence structure variation (Structure Variant, SV) etc..
Genetic test mechanism is usually in the form of genetic mutation map to user report testing result.For full-length genome For (chromosome) genetic test, existing genetic mutation map construction method pays close attention to the variation feelings for showing full-length genome entirety Condition, or show chromosome location locating for gene, it there is no genetic mutation map construction method can be really intuitively in genetic test Show which variation a certain gene specifically has occurred in report.This reader that genetic test is reported can not intuitively obtain The relevant information of genetic test result is disagreeableness for a user.
Bibliography
1.Yang D,Khan S,et al.Association of BRCA1and BRCA2mutations with survival,chemotherapy sensitivity,and gene mutator phenotype in patients with ovarian cancer.JAMA.2011;306(14):1557-1565.
2.M Krzywinski,J Schein,et al.Circos:an information aesthetic for comparative genomics.Genome Res.2009.19:1639-1645.
Summary of the invention
In view of above-mentioned the deficiencies in the prior art, the purpose of the present invention is to provide one kind can construct it is user friendly The device and method of type chromosomal gene variation map, can automate, really, intuitively and aesthetically show on whole chromosome The specific variation situation of any gene.Further, genetic test result can also be expressed as colored image, so that data are more It is easy to recognize by vision-based detection, improves the readability of genetic test result.
The present inventor has made intensive studies to solve above-mentioned technical problem, as a result, it has been found that: by using optimization Information labeling rule, can rationally arrangement needs the information that marks in limited spacial flex, to solve above-mentioned technology Problem.
That is, the present invention includes:
1. a kind of for constructing the device of user friendly chromosomal gene variation map comprising:
Data acquisition module, for obtaining genetic mutation detection data, gene information and chromosomal G-banding data;Here, Genetic mutation detection data includes that such as raw sequencing data obtains after processing, comparison, mutation algorithm detection and annotation Snp or indel variation information.Gene information includes all transcriptions of each gene for example provided from refseq database Originally, locating chromosome, the information such as exon number, initial position and final position.Chromosomal G-banding data refer to: application is glimmering After photoinitiator dye handles chromosome, chromosome can be observed under fluorescence microscope along its long axis and show a rule width and brightness These band information are converted to the text of absolute position and section of the every band of electronical record in chromosome by different bands Part.
Data preparation module is connected with the data acquisition module, for matching the transcript of input gene, extracts Genetic mutation detection data all within the scope of 15~30bp, preferably 20bp and general around the exon and exon of the transcript The genetic mutation detection data arranges output by specified format;Here, transcript, which refers to, passes through what transcription was formed by a gene One or more mature mRNA for coding protein, each gene may have multiple transcripts, it is possible to have multiple turns This number is recorded, input data needs to provide the specific transcript number of gene to be mapped.Specified format refer to will extract information by The form shown in figure according to hope is arranged, such as " 9 exons, heterozygosis: c.6513G > C:p.V2171V " is output to In the temporary file of one for example entitled gene_mutpos.txt, final algorithm, which has been run, can delete all temporary files.
Constant gene segment C figure drafting module is connected with the data preparation module, is used for according to constant gene segment C information, will The conversion of all exon length is proportional, while the accumulation of each exon is added one and isometric includes sub-segments;Here, it " draws System " refers to script in shape of drawing into device, is stored after completing into such as file of png format, opens this file When figure can be shown on such as indicator screen.The constant gene segment C information including, for example, gene transcripts, exon id, The information such as chromosome id, initial position and final position.Length conversion is proportional to be referred to: due to the length ratio of the introne of gene Exon is higher by several times, if Direct graphic method will lead to naked eyes and can't see exon at all.If wishing only to show exon and outer The variation of sub- surrounding 20bp is shown, subregion can be included all and is all replaced with equal length.If entire painting canvas regarded as It is 1 × 1 painting canvas, each position of exon region can be converted into ratio value locating on painting canvas: physical location/(institute Have exon region total length+length of intron × introne number), that is, it has been converted into ratio.
Chromosome map drafting module is connected, for using chromosomal G-banding with the constant gene segment C figure drafting module All G are shown zone segment length and are converted into ratio by different colours mark, are judged that each section is located at p arm or q arm, are drawn base Gene position is marked on figure because of the chromosome at place and the figure in the chromosome;Here it is possible to make for a gene One figure, draws chromosome first, which position that gene is located at chromosome is then marked.And
Genetic mutation information labeling module, with the data preparation module, constant gene segment C figure drafting module and chromosome Figure drafting module is connected comprising:
Submodule A, for judging whether the gene has genetic mutation site, if it is, output order starts following sons Module B, if it is not, then any genetic mutation information is not marked, it is only that chromosome map and constant gene segment C figure is defeated as final result Out.
Submodule B is connected with the submodule A, currently makees whether map space places all bases enough for judging Because of variant sites information, if it is, output order starts following submodule C;If it is not, then pop-up miscue, informs base Because variant sites are too many, can not map.It should be noted that the case where can not mapping for report, can first filter out Site, then map, for the number of sites mapped preferably within 50, the site retained after filtering is preferably user's concern Site or the relevant site of disease/prognosis.
Submodule C is connected with the submodule B and submodule H, for judging whether current point is located at the gene It is interior, if it is, output order starts following submodule D;If it is not, then pop-up warning, informs current point not in the gene model In enclosing, and output order starts following submodule H;Here, it is currently used in the sky that the relevant figure of genetic mutation testing result is presented Between be referred to as and currently make map space.In general, making a figure for a gene, the content that a figure can be shown is limited , that is, making map space (the device of the invention stresses the use of user, and mapping Spatial General 6 R refers to that genetic test is reported) is to have Limit.The pixel and font size for the map that the apparatus according to the invention is drawn can choose the variation Information Number that setting is drawn Measure the upper limit.In the case where guaranteeing that figure can be identified visually under the premise of keeping content clearly and layout is beautiful, it can only be drawn in figure Most about 50 variation information need not map if more than 50.On the other hand, nearly all patient is in a gene On variant sites be no more than 15,50 can substantially guarantee map to all samples.It is step-by-step when mapping It sets sequence to draw into variant sites information one by one, the variant sites information drawn is exactly current point;Current point mapping Before, the newest variant sites information finished be exactly on a bit;If after current point is finished, i.e., the variant sites that will be drawn are believed Breath is exactly next point;Left point is exactly remaining all variant sites information finished not yet.
Submodule D is connected with the submodule C and submodule H, works as judging whether current spatial is placed enough Preceding point and left point;If it is, carrying out submodule E;If it is not, then moving the base of distance to a declared goal mark current point directly up Because of the information that makes a variation, and output order starts following submodule H.
Submodule E is connected with the submodule D and submodule H, for judge current point and it is upper apart from whether It is especially close, so that the markup information of this two o'clock can be overlapped;If it is, moving down distance to a declared goal mark current point from upper position Genetic mutation information, and output order starts following submodule H;If it is not, then output order starts following submodule G;This In, " especially close " or " especially remote " is all to be compared the distance of point-to-point transmission with preset value.If the distance of point-to-point transmission It is then " especially close " less than preset value (for example, 0.01);If the distance of point-to-point transmission is greater than preset value (for example, It 0.1) is, then " especially remote ".The preset value can be set as needed.
Whether submodule G is connected with the submodule E and submodule H, for judging current point apart from upper special It is not remote and especially close apart from next point, if it is, moving up the genetic mutation information of distance to a declared goal mark current point, and defeated Instruction starts following submodule H out;If it is not, then directly in the genetic mutation information of current location mark current point, and export Instruction starts following submodule H.
And
Submodule H, for judging whether current point is the last one genetic mutation site, if it is mark terminates, will Above-mentioned deterministic process obtains result and exports as final result, if it is not, then skipping to the lower variant sites of gene and output order Start above-mentioned submodule C.
2. a kind of method for constructing user friendly chromosomal gene variation map comprising:
Data acquisition obtains genetic mutation detection data, gene information and chromosomal G-banding data;Here, genetic mutation Snp that detection data includes such as raw sequencing data to be obtained after processing, comparison, mutation algorithm detection and annotation or Indel variation information.Gene information includes all transcripts of each gene, locating for example provided from refseq database Chromosome, the information such as exon number, initial position and final position.Chromosomal G-banding data refer to: using at fluorescent dye After managing chromosome, chromosome can be observed under fluorescence microscope along its long axis and show the rule width cross different with brightness These band information are converted to the file of absolute position and section of the every band of electronical record in chromosome by line.
Data preparation, the transcript of matching input gene, extract 15 around the exon and exon of the transcript~ Within the scope of 30bp, preferably 20bp all genetic mutation detection datas and by the genetic mutation detection data by specified format arrange Output;Here, transcript refers to by a gene by transcribing the one or more maturations for coding protein formed MRNA, each gene may have multiple transcripts, it is possible to have multiple transcripts to number, input data needs provide base to be mapped The specific transcript of cause is numbered.Specified format refers to that will extract the form that information is desirably shown in figure arranges, example As " 9 exons, heterozygosis: c.6513G > C:p.V2171V " is output to the interim of a for example entitled gene_mutpos.txt In file, final algorithm, which has been run, can delete all temporary files.
Constant gene segment C figure is drawn, according to constant gene segment C information, all exon length are converted proportional, while each outer Aobvious son accumulation is added one and isometric includes sub-segments;For example, the rectangle of each color-grading represents in fig. 1 and 2 One exon, the blank between two exons, which represents, has an introne, and exon is passed through by the arrangement drafting of ID sequence More and more shallow same colour system rectangle is gradually drawn, the graduated colors that vision is seen is produced, can be conducive to using gradual change colour system Vision receives and improves aesthetics;Here, " drafting " refers to that script in shape of drawing into device, is stored after completing to for example In the file of png format, figure can be shown when opening this file on such as indicator screen.The constant gene segment C information Including, for example, information such as gene transcripts, exon id, chromosome id, initial position and final positions.Length conversion is proportional Refer to: since the length of the introne of gene is higher by several times than exon, if Direct graphic method will lead to naked eyes and can't see at all Exon.If wishing the variation for only showing 20bp around exon and exon, can by it is all include subregion all use it is equal Length replaces.If entire painting canvas to be regarded as to the painting canvas of 1*1, each position of exon region can be converted into painting canvas Upper locating ratio value: physical location/(all exon region total lengths+length of intron * introne number) are converted into Ratio.
Chromosome map is drawn, chromosomal G-banding is marked with different colours, all G are shown into zone segment length and are converted into ratio Example judges that each section is located at p arm or q arm, draws the figure of the chromosome where gene and marks on the figure of the chromosome Gene position;For example, left part represents chromosome in Fig. 1 and Fig. 2, the rectangle of each color-grading or semicircle generation One G of table chromosome shows band, arranges and draws by absolute position sequence;Here it is possible to make a figure for a gene, first Chromosome is drawn, which position that gene is located at chromosome is then marked.And
Genetic mutation information labeling comprising:
Step A, judges whether the gene has genetic mutation site, if it is, following step B is carried out, if it is not, then not Any genetic mutation information is marked, is only exported using chromosome map and constant gene segment C figure as final result.
Step B, judgement currently make whether map space places all genetic mutation site information enough, if it is, carrying out Following step C;If it is not, then pop-up miscue, informs that genetic mutation site is too many, can not map.It should be noted that right In reporting the case where can not mapping, some sites can be first filtered out, then map, the number of sites mapped is preferably 50 Within a, the site retained after filtering is preferably the site or the relevant site of disease/prognosis of user's concern.
Step C, judges whether current point is located in the gene, if it is, carrying out following step D;If it is not, then pop-up Warning, informing current point carry out following step H not within the scope of the gene;Here, it is currently used in and genetic mutation detection is presented As a result map space is referred to as currently made in the space of relevant figure.In general, making a figure for a gene, a figure can be shown Content out is limited, that is, is limited as map space.Under the premise of keeping content clearly and layout is beautiful, guarantee figure Most about 50 variation information can only be drawn in the case that shape can be identified visually, in figure, need not be mapped if more than 50 ?.On the other hand, variant sites of nearly all patient on a gene are no more than 15, and 50 can substantially protect Card can map to all samples.It is that opsition dependent sequence is drawn into variant sites information one by one when mapping, is drawing Variant sites information is exactly current point;Current point mapping before, the newest variant sites information finished be exactly on a bit;If worked as After preceding point is finished, i.e., the variant sites information that will be drawn is exactly next point;Left point is exactly remaining all finishes not yet Variant sites information.
Step D, judges whether current spatial places current point and left point enough;If it is, carrying out step E;If It is no, then the genetic mutation information of distance to a declared goal mark current point is moved directly up, and carries out following step H.
Step E judges whether current point and upper distance are especially close, so that the markup information of this two o'clock can be overlapped;Such as Fruit is the genetic mutation information of distance to a declared goal mark current point then to be moved down from upper position, and carry out following step H;If It is no, then carry out following step G;Here, " especially close " or " especially remote " is all to carry out the distance of point-to-point transmission and preset value Compare.If the distance of point-to-point transmission is less than preset value (for example, 0.01), for " especially close ";If the distance of point-to-point transmission It is then " especially remote " greater than preset value (for example, 0.1).The preset value can be set as needed.
Step G, judge current point whether apart from it is upper especially remote and apart from next point it is especially close, if it is, upwards The genetic mutation information of mobile distance to a declared goal mark current point, and carry out following step H;If it is not, then directly in current location The genetic mutation information of current point is marked, and carries out following step H;And
Step H judges whether current point is the last one genetic mutation site, and if it is mark terminates, and sentences above-mentioned Disconnected process obtains result and exports as final result, if it is not, then skipping to the lower variant sites of gene and carrying out above-mentioned steps C.
Invention effect
The device and method according to the present invention for being used to construct user friendly chromosomal gene variation map, can be automatic Change, is true, specific variation situation that is intuitive and aesthetically showing any gene on whole chromosome.It further, can also be by base Because testing result is expressed as colored image, so that data are easier to recognize by vision-based detection, that improves genetic test result can The property read.
Detailed description of the invention
Fig. 1 is the BRCA1 variation detection data user friendly dyeing for showing the sample VB01562 obtained in embodiment 1 The figure of body genetic mutation map.
Fig. 2 is the BRCA2 variation detection data user friendly dyeing for showing the sample VB01562 obtained in embodiment 1 The figure of body genetic mutation map.
The specific embodiment of invention
Carrying out sample using BRCA1/2 user friendly chromosomal gene of the present invention variation map construction device, (sample is compiled Number VB01562) variation data chromosomal gene makes a variation map construction, and which includes:
Data acquisition module, for obtaining genetic mutation detection data, gene information and G-band chromosome data.
Wherein, genetic mutation detection data include for example raw sequencing data by processing, compare, mutation algorithm detection and Snp or indel the variation information obtained after annotation;Gene information includes each base for example provided from refseq database Because all transcripts, locating for chromosome, the information such as exon number, initial position and final position.
Data preparation module is connected with the data acquisition module, for matching the transcript of input gene, extracts Genetic mutation detection data all within the scope of 20bp and the genetic mutation is examined around the exon and exon of the transcript Measured data arranges output by specified format.
Wherein, specified format refers to that will extract the form that information is desirably shown in figure arranges, such as " No. 9 Exon, heterozygosis: c.6513G > C:p.V2171V " is output to the temporary file of a for example entitled gene_mutpos.txt In, final algorithm, which has been run, can delete all temporary files.
Constant gene segment C figure drafting module is connected with the data preparation module, for drawing constant gene segment C figure, In, according to constant gene segment C information, all exon length are converted proportional, while the accumulation of each exon addition one is isometric Include sub-segments.
Chromosome map drafting module, be connected with the data preparation module and with the constant gene segment C figure drafting module It is connected, for drawing chromosome map, wherein chromosomal G-banding is marked with different colours, all G are shown into zone segment length It converts proportional, judges that each section is located at p arm or q arm, draw the figure of the chromosome where gene and in the chromosome Gene position is marked on figure.And
Genetic mutation information labeling module, with the data preparation module, constant gene segment C figure drafting module and chromosome Figure drafting module is connected comprising:
Submodule A, for judging whether the gene has genetic mutation site, if it is, output order starts following sons Module B, if it is not, then any genetic mutation information is not marked, it is only that chromosome map and constant gene segment C figure is defeated as final result Out.
Submodule B is connected with the submodule A, currently makees whether map space places all bases enough for judging Because of variant sites information, if it is, output order starts following submodule C;If it is not, then informing genetic mutation site too It is more, it can not map.
Submodule C is connected with the submodule B and submodule H, for judging whether current point is located at the gene It is interior, if it is, carrying out output order starts following submodule D;If it is not, then pop-up warning, informs current point not in the base Because in range, and output order starts following submodule H.
Submodule D is connected with the submodule C and submodule H, works as judging whether current spatial is placed enough Preceding point and left point;If it is, output order starts following submodule E;If it is not, then moving distance to a declared goal mark directly up The genetic mutation information of current point is infused, and output order starts following submodule H.
Submodule E is connected with the submodule D and submodule H, for judge current point and it is upper apart from whether It is especially close, so that the markup information of this two o'clock can be overlapped;If it is, moving down distance to a declared goal mark current point from upper position Genetic mutation information, and output order starts following submodule H;If it is not, then output order starts following submodule G.
Wherein, " especially close " or " especially remote " is all to be compared the distance of point-to-point transmission with preset value;If two Distance between point is less than preset value 0.01, then is " especially close ";If the distance of point-to-point transmission is greater than preset value 0.1, It is then " especially remote ".The distance to a declared goal is 0.01.
Whether submodule G is connected with the submodule E and submodule H, for judging current point apart from upper special It is not remote and especially close apart from next point, if it is, moving up the genetic mutation information of distance to a declared goal mark current point, and defeated Instruction starts following submodule H out;If it is not, then directly in the genetic mutation information of current location mark current point, and export Instruction starts following submodule H.And
Submodule H, for judging whether current point is the last one genetic mutation site, if it is, mark terminates, it will The result generated in above-mentioned submodule is exported as final result;Refer to if it is not, then skipping to next genetic mutation site and exporting It enables and starts above-mentioned submodule C.
Chromosome is constructed using above-mentioned BRCA1/2 user friendly chromosomal gene variation map construction device of the invention After genetic mutation map, we obtain the BRCA1 of sample VB01562 and BRCA2 variation detection datas to visualize file VB01562_BRCA1.png (see Fig. 1) and VB01562_BRCA2.png.It can in file VB01562_BRCA1.png (see Fig. 2) See, sample VB01562 is not detected in BRCA1 gene extron and surrounding 20bp and morphs, only by chromosome map and gene Section figure is exported as final result;And visible sample VB01562 detects 6 BRCA1 in file VB01562_BRCA2.png Gene extron and the interior variation occurred of surrounding 20bp.
Industrial applicibility
In accordance with the invention it is possible to which providing one kind can automate, really, intuitively and aesthetically show any base on whole chromosome The device and method for being used to construct user friendly chromosomal gene variation map of the specific variation situation of cause.

Claims (6)

1. a kind of for constructing the device of user friendly chromosomal gene variation map comprising:
Data acquisition module, for obtaining genetic mutation detection data, gene information and chromosomal G-banding data;
Data preparation module is connected with the data acquisition module, for matching the transcript of input gene, extracts this turn Record genetic mutation detection data all within the scope of 15~30bp, preferably 20bp around this exon and exon and by the base Because variation detection data arranges output by specified format;
Constant gene segment C figure drafting module is connected with the data preparation module, for that will own according to constant gene segment C information The conversion of exon length is proportional, while the accumulation of each exon is added one and isometric includes sub-segments;
Chromosome map drafting module is connected with the constant gene segment C figure drafting module, for chromosomal G-banding is different All G are shown zone segment length and are converted into ratio by color mark, are judged that each section is located at p arm or q arm, are drawn gene institute Chromosome figure and on the figure of the chromosome mark gene position;
And
Genetic mutation information labeling module is drawn with the data preparation module, constant gene segment C figure drafting module and chromosome map Molding block is connected comprising:
Submodule A, for judging whether the gene has genetic mutation site, if it is, output order starts following submodules B is only exported using chromosome map and constant gene segment C figure as final result if it is not, then not marking any genetic mutation information;
Submodule B is connected with the submodule A, currently makees whether map space places all genes changes enough for judging Ectopic sites information, if it is, output order starts following submodule C;If it is not, then pop-up miscue, informs that gene becomes Ectopic sites are too many, can not map;
Submodule C is connected with the submodule B and submodule H, for judging whether current point is located in the gene, such as Fruit is that then output order starts following submodule D;If it is not, then pop-up warning, inform current point not within the scope of the gene, And output order starts following submodule H;
Submodule D is connected with the submodule C and submodule H, for judging whether current spatial places current point enough And left point;If it is, carrying out submodule E;If it is not, then the gene for moving distance to a declared goal mark current point directly up becomes Different information, and output order starts following submodule H;
Submodule E is connected with the submodule D and submodule H, for judging whether current point and upper distance are special Closely, so that the markup information of this two o'clock can be overlapped;If it is, moving down the base of distance to a declared goal mark current point from upper position Because of the information that makes a variation, and output order starts following submodule H;If it is not, then output order starts following submodule G;
Whether submodule G is connected with the submodule E and submodule H, for judging current point apart from upper especially remote And it is especially close apart from next point, if it is, moving up the genetic mutation information of distance to a declared goal mark current point, and exports and refer to It enables and starts following submodule H;If it is not, then directly marking the genetic mutation information of current point in current location, and output order Start following submodule H;
And
Submodule H, for judging whether current point is the last one genetic mutation site, if it is mark terminates, will be above-mentioned Deterministic process obtains result and exports as final result, if it is not, then skipping to the lower variant sites of gene and output order starting Above-mentioned submodule C.
2. according to claim 1 for constructing the device of user friendly chromosomal gene variation map, wherein described Data acquisition module is used to obtain genetic mutation detection data, gene information and chromosomal G-banding data,
The genetic mutation detection data includes that raw sequencing data obtains after processing, comparison, mutation algorithm detection and annotation Snp or indel the variation information arrived;
The gene information includes all transcripts of each gene, the locating chromosome provided from refseq database, outside Aobvious son number, initial position and final position.
3. according to claim 1 for constructing the device of user friendly chromosomal gene variation map, wherein especially Closely refer to the distance of point-to-point transmission less than 0.01, the distance for especially far referring to point-to-point transmission is greater than 0.1.
4. a kind of method for constructing user friendly chromosomal gene variation map comprising:
Data acquisition obtains genetic mutation detection data, gene information and chromosomal G-banding data;
Data preparation, the transcript of matching input gene, extracts 15~30bp around the exon and exon of the transcript, excellent It selects genetic mutation detection data all within the scope of 20bp and arranges the genetic mutation detection data by specified format and export;
Constant gene segment C figure is drawn, according to constant gene segment C information, all exon length are converted proportional, while each exon Accumulation is added one and isometric includes sub-segments;
Chromosome map is drawn, chromosomal G-banding is marked with different colours, all G are shown into zone segment length and are converted into ratio, are sentenced Each section that breaks is located at p arm or q arm, draws the figure of the chromosome where gene and marks gene institute on the figure of the chromosome In position;
And
Genetic mutation information labeling comprising:
Step A, judges whether the gene has genetic mutation site, if it is, following step B is carried out, if it is not, then not marking Any genetic mutation information is only exported using chromosome map and constant gene segment C figure as final result;
Step B, judgement currently make whether map space places all genetic mutation site information enough, if it is, carrying out following Step C;If it is not, then pop-up miscue, informs that genetic mutation site is too many, can not map;
Step C, judges whether current point is located in the gene, if it is, carrying out following step D;If it is not, then pop-up police It accuses, informing current point carries out following step H not within the scope of the gene;
Step D, judges whether current spatial places current point and left point enough;If it is, carrying out step E;If it is not, then The genetic mutation information of distance to a declared goal mark current point is moved directly up, and carries out following step H;
Step E judges whether current point and upper distance are especially close, so that the markup information of this two o'clock can be overlapped;If so, The genetic mutation information of distance to a declared goal mark current point is then moved down from upper position, and carries out following step H;If it is not, then Carry out following step G;
Step G, judge current point whether apart from it is upper especially remote and apart from next point it is especially close, if it is, moving up Distance to a declared goal marks the genetic mutation information of current point, and carries out following step H;If it is not, then directly being marked in current location The genetic mutation information of current point, and carry out following step H;And
Step H judges whether current point is the last one genetic mutation site, and if it is mark terminates, and judges above-mentioned Journey obtains result and exports as final result, if it is not, then skipping to the lower variant sites of gene and carrying out above-mentioned steps C.
5. the method according to claim 4 for constructing user friendly chromosomal gene variation map, wherein described Data acquisition module is used to obtain genetic mutation detection data, gene information and chromosomal G-banding data,
The genetic mutation detection data includes that raw sequencing data obtains after processing, comparison, mutation algorithm detection and annotation Snp or indel the variation information arrived;
The gene information includes all transcripts of each gene, the locating chromosome provided from refseq database, outside Aobvious son number, initial position and final position.
6. according to claim 4 for constructing the device of user friendly chromosomal gene variation map, wherein especially Closely refer to the distance of point-to-point transmission less than 0.01, the distance for especially far referring to point-to-point transmission is greater than 0.1.
CN201711423213.5A 2017-12-25 2017-12-25 Device and method for constructing user-friendly chromosome gene variation map Active CN110021363B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711423213.5A CN110021363B (en) 2017-12-25 2017-12-25 Device and method for constructing user-friendly chromosome gene variation map

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711423213.5A CN110021363B (en) 2017-12-25 2017-12-25 Device and method for constructing user-friendly chromosome gene variation map

Publications (2)

Publication Number Publication Date
CN110021363A true CN110021363A (en) 2019-07-16
CN110021363B CN110021363B (en) 2021-01-15

Family

ID=67187019

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711423213.5A Active CN110021363B (en) 2017-12-25 2017-12-25 Device and method for constructing user-friendly chromosome gene variation map

Country Status (1)

Country Link
CN (1) CN110021363B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040241730A1 (en) * 2003-04-04 2004-12-02 Zohar Yakhini Visualizing expression data on chromosomal graphic schemes
US20130102476A1 (en) * 2010-01-08 2013-04-25 Douglas Hurd Combined cgh & allele specific hybridisation method
CN103955630A (en) * 2014-03-26 2014-07-30 田埂 Method for preparing reference database and performing target area sequence alignment on to-be-tested free nucleic acid samples
US20140229117A1 (en) * 2010-10-13 2014-08-14 Complete Genomics, Inc. Methods for estimating genome-wide copy number variations
CN106520940A (en) * 2016-11-04 2017-03-22 深圳华大基因研究院 Chromosomal aneuploid and copy number variation detecting method and application thereof
CN107133494A (en) * 2017-04-21 2017-09-05 天津大学 A kind of new analysis biological genome copies the method for visualizing of number variation
CN107194208A (en) * 2017-04-25 2017-09-22 北京荣之联科技股份有限公司 A kind of genetic analysis annotates method and apparatus

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040241730A1 (en) * 2003-04-04 2004-12-02 Zohar Yakhini Visualizing expression data on chromosomal graphic schemes
US20130102476A1 (en) * 2010-01-08 2013-04-25 Douglas Hurd Combined cgh & allele specific hybridisation method
US20140229117A1 (en) * 2010-10-13 2014-08-14 Complete Genomics, Inc. Methods for estimating genome-wide copy number variations
CN103955630A (en) * 2014-03-26 2014-07-30 田埂 Method for preparing reference database and performing target area sequence alignment on to-be-tested free nucleic acid samples
CN106520940A (en) * 2016-11-04 2017-03-22 深圳华大基因研究院 Chromosomal aneuploid and copy number variation detecting method and application thereof
CN107133494A (en) * 2017-04-21 2017-09-05 天津大学 A kind of new analysis biological genome copies the method for visualizing of number variation
CN107194208A (en) * 2017-04-25 2017-09-22 北京荣之联科技股份有限公司 A kind of genetic analysis annotates method and apparatus

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JIN ZHANG ET AL.: ""INTEGRATE-Vis: a tool for comprehensive gene fusion visualization"", 《SCIENTIFIC REPORTS》 *
苏珊: ""分形图形与基因序列可视化"", 《中国优秀硕士学位论文全文数据库 基础科学辑》 *

Also Published As

Publication number Publication date
CN110021363B (en) 2021-01-15

Similar Documents

Publication Publication Date Title
US10991558B2 (en) Interactive analysis of mass spectrometry data including peak selection and dynamic labeling
US9640376B1 (en) Interactive analysis of mass spectrometry data
Schwessinger et al. Sasquatch: predicting the impact of regulatory SNPs on transcription factor binding from cell-and tissue-specific DNase footprints
US20030218634A1 (en) System and methods for visualizing diverse biological relationships
JP2004133903A (en) Method and system for simultaneous visualization and manipulation of multiple data types
CN106021984A (en) Whole-exome sequencing data analysis system
CN101233509A (en) Method of processing and/or genome mapping of ditag sequences
US20200105370A1 (en) Genome browser
CN111243753B (en) Multi-factor correlation interactive analysis method for medical data
GB2371664A (en) Reading utility meters
CA2371718A1 (en) Methods for normalization of experimental data
CN103451279B (en) Gene SNP (single nucleotide polymorphism) site detection method based on SOLID (supported oligo ligation detection) sequencing technique
CN110021363A (en) For constructing the device and method of user friendly chromosomal gene variation map
KR102572274B1 (en) An apparatus for analyzing nucleic sequencing data and a method for operating it
CN106568724B (en) Curve of spectrum pretreatment and feature mining method and device
CN110335642A (en) The expression of genome/protein group sequence, visualization, compare and report
CN104346375B (en) A kind of method and device for making middle character library
US20130309660A1 (en) Methods of characterizing, determining similarity, predicting correlation between and representing sequences and systems and indicators therefor
Albert et al. Navigating the CoGe Online Software Suite for Polyploidy Research
CN109741788A (en) A kind of SNP site analysis method and system
Lautenschlager True colours or red herrings?
Natsume et al. Whole genome sequencing of a wild yam species Dioscorea tokoro reveals a genomic region associated with sex
US6847381B2 (en) Dendrogram displaying method
Lin Whole Genome DNA Methylation Analysis of Brachypodium distachyon Using Next-Generation Sequencing (BS-seq)
CN114842911B (en) Gene detection process optimization method and device based on precise medical treatment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20201201

Address after: 322000 3rd floor, building 9, standard factory building, No. 10, Gaoxin Road, chuojiang street, Yiwu City, Jinhua City, Zhejiang Province

Applicant after: ANNOROAD (YIWU) MEDICAL INSPECTION Co.,Ltd.

Applicant after: Anouta gene technology (Beijing) Co.,Ltd.

Address before: 100176 Beijing City, Daxing District branch of Beijing economic and Technological Development Zone Street 88 Hospital No. 8 Building 2 unit 701 room

Applicant before: Anouta gene technology (Beijing) Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant
TA01 Transfer of patent application right

Effective date of registration: 20210104

Address after: 322000 3rd floor, building 9, standard workshop, No.10 Gaoxin Road, Houjiang street, Yiwu City, Jinhua City, Zhejiang Province

Applicant after: ANNOROAD (YIWU) MEDICAL INSPECTION Co.,Ltd.

Applicant after: Anouta gene technology (Beijing) Co.,Ltd.

Address before: Room 701, unit 2, building 8, yard 88, Kechuang 6th Street, Daxing District, Beijing 100176

Applicant before: Anouta gene technology (Beijing) Co.,Ltd.

TA01 Transfer of patent application right