CN107133494A - A kind of new analysis biological genome copies the method for visualizing of number variation - Google Patents

A kind of new analysis biological genome copies the method for visualizing of number variation Download PDF

Info

Publication number
CN107133494A
CN107133494A CN201710266656.1A CN201710266656A CN107133494A CN 107133494 A CN107133494 A CN 107133494A CN 201710266656 A CN201710266656 A CN 201710266656A CN 107133494 A CN107133494 A CN 107133494A
Authority
CN
China
Prior art keywords
gene
chromosome
visualizing
number variation
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710266656.1A
Other languages
Chinese (zh)
Inventor
宋凯
毕家豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201710266656.1A priority Critical patent/CN107133494A/en
Publication of CN107133494A publication Critical patent/CN107133494A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Biophysics (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses the method for visualizing that a kind of new analysis biological genome copies number variation, all genes of each chromosome, abscissa is used as according to the ucstart positions of each gene, the average or median of the corresponding all sample copy numbers of the gene are used as the longitudinal axis, then scatterplot is drawn with MATLAB, a kind of color represents a kind of cancer.If contrasting kinds cancer type, we know it is which kind of cancer types according to what color can just will be apparent that, and the comparison effect of visualization between them is optimal.If contrasting the tumor sample and normal sample under same cancer types, tumor sample is expressed as a kind of color, and normal sample is expressed as another color, and the comparison effect of visualization between them is also optimal.Method for visualizing compared to before, our this method for visualizing, can be apparent from the copy number variation of the different cancer types of clear contrast or the copy number variation of different type sample is contrasted under same cancer types.

Description

A kind of new analysis biological genome copies the method for visualizing of number variation
Technical field
The invention belongs to bioinformatics or the technical field of gene technology, it is related to a kind of offer biological gene visual Method.
Background technology
With the development of bioscience, after the completion of the Human Genome Project, exploration of the mankind to biological phenomena is more and more deeper Enter.It is known that human body gene gene more than 30,000, they are made up of about 3,000,000,000 base-pairs, are distributed in nucleus 23 pairs of chromosomes in.
DNA sequence dna total amount is an astronomical figure really, but the function macro-molecular protein of researching DNA and its coding, Day by day the sequence and structure increased is collected, arranged, stored, issues, extracts and processed.And therefrom analyze and find newly Sequence, so as to constantly disclose the molecular basis of Human Physiology and pathologic process, is provided for the prevention of human diseases, diagnosis and treatment Foundation, its meaning is very great, and has formd " bioinformatics " new branch of science.
Full-length genome copies the visual research of number variation, is also an important content.Current full-length genome copy number The visualization method that makes a variation is relatively simple, is characterized in intuitive and convenient, but copy number variation is carried out under multiple cancer types Contrast, or the effect of the contrast of the copy number variation of different type sample is not fine under same cancer types.
The content of the invention
In order to solve problems of the prior art, the present invention provides a kind of new analysis biological genome copy number and become Different method for visualizing (Mountain plot), overcomes and copy number variation contrast is carried out under multiple cancer types in the prior art, Or under same cancer types different type sample copy number variation contrast effect it is bad the problem of.
The technical scheme is that:A kind of new analysis biological genome copies the method for visualizing of number variation, including Following steps:
(1) determination of transverse and longitudinal coordinate:Human body has 22 autosomes, and gene gene more than 30,000, each gene has Which specific corresponding chromosome and corresponding specific original position, all genes of each chromosome (22 chromosomes, Namely 22 figures), according to the original position of each gene as abscissa, with the mean value function in matlab softwares or in Function of the location number travels through the corresponding all sample copy numbers of the gene, regard the average or I d median of the gene as ordinate;
(2) draw:With the plot functions in matlab, graphical pointv;Because the gene number on every chromosome is a lot, institute It is that point is very close to have the result drawn out;
(3) there is specific position coordinates in the centromere of every chromosome in biology, with the line functions in matlab, uses The perpendicular thick stick of black shows centromere, while by chromosome p arms, the gene on q arms is separated.
The beneficial effects of the invention are as follows:
Method for visualizing compared to before, our this method for visualizing, can be apparent from clear contrast several The copy number variation of different cancer types copy number variation of different type sample either under same cancer types Contrast.The black in centromere erects thick stick, and two arms of chromosome are separated.It is autosomal 22 with a kind of very intuitive way The copy number state value of all genes is showed.
Brief description of the drawings
Fig. 1 is adenocarcinoma of lung, squamous cell lung carcinoma tumor sample and non-tumour tuberculosis sample copying in the range of full-length genome Shellfish number variation distribution map;
Wherein red part represents the variation distribution of adenocarcinoma of lung tumour copy number, and blue portion represents lung squamous cancer tumour and copied The variation distribution of shellfish number variation;Green portion represents the distribution of non-malignant tuberculosis sample copy number, and how many sample wherein n represents Number.
Embodiment
The present invention will be described in detail with reference to the accompanying drawings and detailed description.
For a kind of cancer types, human body gene gene more than 30,000, each gene has which specific corresponding dye Colour solid and corresponding specific original position ucstart, we are each chromosome (22 chromosomes, that is, 22 figures) All genes, according to the ucstart positions of each gene as abscissa, the average of the corresponding all sample copy numbers of the gene Or then median draws scatterplot, a kind of color represents a kind of cancer as the longitudinal axis with MATLAB plot functions.As above scheme It is shown:The gene copy number state value of the sample of three types, can intuitively be contrasted very much.
Generally speaking, if to contrast kinds cancer type, we know which kind of is according to what color can just will be apparent that Cancer types, the comparison effect of visualization between them is optimal.If contrast tumor sample under same cancer types and Normal sample, tumor sample is expressed as a kind of color, and normal sample is expressed as another color, and the comparison between them is visual It is also optimal to change effect.And we are labelled with the perpendicular thick stick of black according to the position in centromere, by chromosome p arms, on q arms Gene is separated.This method for visualizing that we invent, carries out the contrast of copy number variation under multiple cancer types, or The contrast of person's copy number variation of different type sample under same cancer types, both of these case can be done.
Described above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications also should It is considered as protection scope of the present invention.

Claims (1)

1. a kind of new analysis biological genome copies the method for visualizing of number variation, it is characterised in that comprise the following steps:
(1) determination of transverse and longitudinal coordinate:Human body has 22 autosomes, and gene gene more than 30,000, each gene has specifically Which corresponding chromosome and corresponding specific original position, all genes of each chromosome, according to each gene Original position travels through the gene with the mean value function in matlab softwares or median function corresponding all as abscissa Sample copy number, regard the average or I d median of the gene as ordinate;
(2) draw:With the plot functions in matlab, graphical pointv;Because the gene number on every chromosome is a lot, Suo Youhui The result made is that point is very close;
(3) there is specific position coordinates in the centromere of every chromosome in biology, with the line functions in matlab, uses black Perpendicular thick stick centromere is showed, while by chromosome p arms, the gene on q arms is separated.
CN201710266656.1A 2017-04-21 2017-04-21 A kind of new analysis biological genome copies the method for visualizing of number variation Pending CN107133494A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710266656.1A CN107133494A (en) 2017-04-21 2017-04-21 A kind of new analysis biological genome copies the method for visualizing of number variation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710266656.1A CN107133494A (en) 2017-04-21 2017-04-21 A kind of new analysis biological genome copies the method for visualizing of number variation

Publications (1)

Publication Number Publication Date
CN107133494A true CN107133494A (en) 2017-09-05

Family

ID=59715071

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710266656.1A Pending CN107133494A (en) 2017-04-21 2017-04-21 A kind of new analysis biological genome copies the method for visualizing of number variation

Country Status (1)

Country Link
CN (1) CN107133494A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110021363A (en) * 2017-12-25 2019-07-16 安诺优达基因科技(北京)有限公司 For constructing the device and method of user friendly chromosomal gene variation map

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ANA H.BARRAGÁN LID: ""Profiling of DNA copy number in sarcomas by array comparative genomic hybridization and identification of candidate cancer genes"", 《UNIVERSITY OF OSLO》 *
DARREN A ABBEY等: ""YMAP: a pipeline for visualization of copy number variation and loss of heterozygosity in eukaryotic pathogens"", 《GENOME MEDICINE》 *
ZHE-WEI QIU等: ""Genome-wide copy number variation pattern analysis and a classification signature for non-small cell lung cancer"", 《HTTPS://DOI.ORG/10.1002/GCC.22460》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110021363A (en) * 2017-12-25 2019-07-16 安诺优达基因科技(北京)有限公司 For constructing the device and method of user friendly chromosomal gene variation map
CN110021363B (en) * 2017-12-25 2021-01-15 安诺优达(义乌)医学检验有限公司 Device and method for constructing user-friendly chromosome gene variation map

Similar Documents

Publication Publication Date Title
Zhang et al. Dissecting transcriptional heterogeneity in primary gastric adenocarcinoma by single cell RNA sequencing
CN101100672B (en) Modified HPV E6-E7 fusion gene and coding protein thereof
CN113257350B (en) ctDNA mutation degree analysis method and device based on liquid biopsy and ctDNA performance analysis device
CN107169314A (en) Analyze biological genome gene expression, copy the method for visualizing of number variation
Xie et al. Evaluating cancer-related biomarkers based on pathological images: a systematic review
Das et al. Comparative evaluation of grading of breast carcinomas in fine needle aspirates by two methods
CN101691614A (en) Method for rapidly identifying rice black-streaked dwarf virus and southern rice black-streaked dwarf virus
CN103914257A (en) Number inputting and editing component, system and method based on touch screen
CN107133494A (en) A kind of new analysis biological genome copies the method for visualizing of number variation
CN105052830A (en) Fatty-liver-related liver cancer model building method based on knockout mice
Donadini et al. Oral cancer genesis and progression: DNA near-diploid aneuploidization and endoreduplication by high resolution flow cytometry
Luo et al. Deep learning-based fully automated differential diagnosis of eyelid basal cell and sebaceous carcinoma using whole slide images
Shirasawa et al. Structure and expression of an integrated human papillomavirus type 16 genome amplified in a cervical carcinoma cell line
Gu et al. Physical mapping of the 18S-26S rDNA by fluorescent in situ hybridization (FISH) in Camellia reticulata polyploid complex (Theaceae)
CN109825498A (en) For the preparation method of the probe of target nucleic acid target
CN113946730B (en) Gene data-based visual method for analyzing chromatin hierarchical structure
Carvalho et al. Mitochondrial DNA mapping of social-biological interactions in Brazilian Amazonian African-descendant populations
Liu et al. Patrilineal background of esophageal cancer and gastric cardia cancer patients in a Chaoshan high-risk area in China
Azimi et al. Breast cancer histopathology image analysis pipeline for tumor purity estimation
Menon et al. Exploring histological similarities across cancers from a deep learning perspective
Zhou et al. Late Ordovician (Hirnantian)'Mucronaspis (Songxites')-dominant trilobite fauna from northwestern Zhejiang, China
Macerola et al. Digital gene expression analysis on cytology smears can rule out malignancy in follicular-patterned thyroid tumors
Carder et al. Pleomorphic lobular carcinoma in situ
CN105177168A (en) Reagent kit for screening early-stage cervical cancers based on cervical liquid-based cytology and DNA methylation
Guo et al. Prediction of lymph node metastasis in primary gastric cancer from pathological images and clinical data by multimodal multiscale deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170905