CN107609341A - Based on shortest path from global interactions between protein network extraction sub-network method and system - Google Patents

Based on shortest path from global interactions between protein network extraction sub-network method and system Download PDF

Info

Publication number
CN107609341A
CN107609341A CN201710703560.7A CN201710703560A CN107609341A CN 107609341 A CN107609341 A CN 107609341A CN 201710703560 A CN201710703560 A CN 201710703560A CN 107609341 A CN107609341 A CN 107609341A
Authority
CN
China
Prior art keywords
network
shortest path
interactions
protein
sub
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710703560.7A
Other languages
Chinese (zh)
Inventor
郝彤
王倩
孙金生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Tianjin Normal University
Original Assignee
Tianjin Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Normal University filed Critical Tianjin Normal University
Priority to CN201710703560.7A priority Critical patent/CN107609341A/en
Publication of CN107609341A publication Critical patent/CN107609341A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of method and system for extracting sub-network from global interactions between protein network based on shortest path, destination node is matched two-by-two and extracts shortest path, shortest path is split to obtain all interactions between protein pair, visual sub-network is obtained to data by interactions between protein.The inventive method is simple, fast, should be readily appreciated that and operates, and the albumen in global interactions between protein network in the absence of direct interaction can be made to find its interconnected shortest path, and form sub-network using the node involved by shortest path.

Description

Based on shortest path from global interactions between protein network extraction sub-network method and system
Technical field
The invention belongs to field of bioinformatics, especially relates to one kind from global interactions between protein network based on most short The method and system of path extraction sub-network.
Background technology
Global interactions between protein network describes as the important feature of bioinformatics, directly reflects living organism function.And Usually needed in practical study for some function progress in-depth study, such as immunologic function, signal transduction pathway, this With regard to needing the sub-network that selection is related to objective function in global interactions between protein network further to be studied, i.e., in the overall situation Destination node interested is chosen in interactions between protein network according to research purpose, and the sub-network that these nodes are formed is carried out deep Enter research.
Prior art means are typically directly by global interactions between protein net in the associated path between obtaining destination node All paths related to destination node all extract composition sub-network in network, the sub-network carried such as cytoscape softwares Abstraction function.But for the angle of biology, shorter path of information flow is more efficient, also more meets the mistake of biological evolution Journey, i.e. biology are always towards the direction evolution most useful for its own existence.Therefore more reasonable believable destination node how is built Sub-network, turning into needs the subject matter that solves.
The content of the invention
The problem to be solved in the present invention is to propose a kind of method, finds in global interactions between protein network and appoints between destination node The node anticipated involved by point-to-point transmission shortest path, and form sub-network using these nodes.
In order to achieve the above object, the technical scheme taken of the present invention is:One kind is mutual from global albumen based on shortest path The method for making to extract sub-network in network, including:Destination node is matched two-by-two and extracts shortest path, shortest path is split All interactions between protein pair are obtained, visual sub-network is obtained to data by interactions between protein.
Further, the extracting method of the destination node is:Arbitrarily chosen from global interactions between protein network interested Destination node, the quantity of destination node is no more than global interactions between protein nodes sum.
Further, the extracting method of the shortest path is:
(1) all destination nodes of selection are matched two-by-two, the group of all pairings is obtained according to permutation and combination method Close;
(2) two albumen being related to during each is combined calculate most short between 2 points as beginning and end Path;
(3) shortest path all in every kind of combination all extracts.
Further, the specific method of the shortest path fractionation is:All shortest paths extracted all split into egg The interactions between protein pair of interaction two-by-two in vain, the repetitive proteins obtained in different shortest paths are mutually opposed deletion after fractionation, that is, obtained Form all interactions between protein pair of interactions between protein sub-network.
Further, the method that obtains of the visualization sub-network is:By obtained interactions between protein to data input Cytoscape softwares, obtain visual sub-network figure.
Present invention also offers a kind of system for extracting sub-network from global interactions between protein network based on shortest path, bag Include:
Shortest path extraction module, for being matched destination node two-by-two and extracting shortest path;
Shortest path splits module, for splitting shortest path to obtain all interactions between protein pair;
Sub-network module:For data to be obtained with visual sub-network by interactions between protein.
Further, in addition to destination node chooses module, emerging for arbitrarily choosing sense from global interactions between protein network The destination node of interest, the quantity of destination node are no more than global interactions between protein nodes sum.
Further, the shortest path extraction module includes:
Pairing unit, for being matched two-by-two to all destination nodes of selection, institute is obtained according to permutation and combination method There is the combination of pairing;
Computing unit, two albumen for being related to during each is combined calculate at 2 points as beginning and end Between shortest path;
Extraction unit, for shortest path all in every kind of combination all to be extracted.
Further, the shortest path splits module and included:
Split cells, for all shortest paths extracted all to be split into the interactions between protein pair of albumen interaction two-by-two;
Unit is deleted, for the repetitive proteins obtained in different shortest paths mutually to be opposed deletion after fractionation.
Further, the sub-network module includes:
Cytoscape software units, for, to data input cytoscape softwares, obtaining can by obtained interactions between protein Depending on the sub-network figure of change.
The method and system proposed by the present invention for extracting sub-network from global interactions between protein network based on shortest path, with Prior art is compared, and is had the beneficial effect that:
(1) by means of the invention it is possible to make the albumen in global interactions between protein network in the absence of direct interaction find it Interconnected shortest path, and form sub-network using the node involved by shortest path;
(2) the inventive method is simple, fast, should be readily appreciated that and operates.
(3) method of this method by extracting shortest path, obtain closer to real information transmittance process in organism Interactions between protein path, then fractionation and visualization by shortest path, obtain the sub-network being more consistent with objective function, with Prior art is compared, and this method more conforms to biological evolution theory, and obtained sub-network is more rationally credible, with more biology Meaning.
Brief description of the drawings
Fig. 1 is the schematic flow sheet of the present invention;
The shortest path that Fig. 2 is the present invention chooses schematic diagram;
Fig. 3 is that sub-network figure is immunized in the wnt signal paths of the embodiment of the present invention.
Embodiment
With reference to specific embodiment, the present invention will be further described.
Need to choose destination node from global interactions between protein network in bioinformatics research, but between these destination nodes Directly interaction phenomenon two-by-two may be not present, can be found between destination node using the method for the present invention most short between any two points Node involved by path, and form sub-network using these nodes.The method of the present invention is as shown in figure 1, be divided into target section Point is chosen, extracts shortest path, path is split and sub-network visualizes four steps.
(1) destination node is chosen:According to the research purpose of user, it is emerging sense can arbitrarily to be chosen from global albumen network The destination node of interest, the quantity of destination node are no more than global network interior joint sum.
(2) shortest path is extracted:All destination nodes of selection are matched two-by-two, obtained according to permutation and combination method The combination of all pairings.Two albumen being related to during each is combined are calculated between 2 points as beginning and end Shortest path.As shown in Fig. 2 do not have direct interaction between destination node A albumen and Z albumen, but by following What four approach connected together:A-B-C-D-Z;A-E-F-Z;A-G-H-Z;A-W-Q-M-N-Z.And shortest path rule is to carry Take out A to Z between institute through node at least that paths.It is A-E-F-Z and A-G-H-Z in fig. 2.Involved by point-to-point transmission Shortest path may more than one, we need to extract them.All albumen can be obtained in this way Shortest path between two albumen of centering.
(3) path is split:All shortest paths extracted in step (2) are all split into the shape of albumen interaction two-by-two Formula, as the A-E-F-Z paths in Fig. 2 may be split into A-E, tri- interactions between protein pair of E-F, F-Z.By different shortest paths after fractionation In obtained repetitive proteins mutually oppose deletion, that is, obtain forming all interactions between protein pair of interactions between protein sub-network.
(4) sub-network visualizes:By the interactions between protein obtained in step (3) to data input cytoscape softwares, you can Obtain visual sub-network figure.
Lower mask body is applied to the extraction conduct that sub-network is immunized in Environment of Litopenaeus vannamei Low wnt signal paths with the inventive method Embodiment illustrates:
Experiment uses in the early-stage Study of laboratory the Environment of Litopenaeus vannamei Low immune protein interaction network that builds as global network, The network altogether comprising point 2446, choose condition and be included in global interaction network by 32815, side, target protein, and Approach annotation is the albumen of wnt signal paths in KEGG databases.Then, exempted from using this patent methods described from Environment of Litopenaeus vannamei Low The shortest path between the albumen with immunologic function in wnt signal paths, structure are extracted in epidemic disease albumen overall situation interaction network Sub-network is immunized into wnt signal paths.Specific method is as follows:
(1) destination node is chosen:Choose and be included in Environment of Litopenaeus vannamei Low immune protein overall situation interaction network, and Approach annotates the albumen for wnt signal paths as destination node in KEGG databases, and 6 destination nodes are obtained, are respectively P09615, P18289, Q9V3Q6, Q7JWS8, P49768, P18537, but be not arbitrarily between the two between these destination nodes All there is direct interaction relationship;
(2) all destination nodes of selection are matched two-by-two, the group of all pairings is obtained according to permutation and combination method Close, totally 15 pairs.Wherein there was only two pairs of direct interaction relationships of presence, direct interaction relationship is not present in other. Two albumen being related to during each is combined calculate the shortest path between 2 points, are obtained as beginning and end 59 shortest paths;
(3) shortest path obtained in step (2) is split as to the albumen pair of interaction two-by-two, 132 albumen are obtained Mutually oppose, wherein there are 56 repetitive proteins mutually to oppose, remove after repeating, 76 interactions between protein pair are obtained;
(4) interactions between protein that step (3) obtains is obtained into visual wnt letters to data input cytoscape softwares Number path is immunized sub-network, and the network includes node 41,76, side, as shown in Figure 3.
The foregoing is only the present invention specific embodiment, the protection domain being not intended to limit the present invention, it is all Within the spirit and principles in the present invention, any modification, equivalent substitution and improvements done etc., the protection of the present invention should be included in Within the scope of.

Claims (10)

  1. A kind of 1. method for extracting sub-network from global interactions between protein network based on shortest path, it is characterised in that including:Will Destination node is matched and extracts shortest path two-by-two, and shortest path is split to obtain all interactions between protein pair, passes through interactions between protein Data are obtained with visual sub-network.
  2. 2. according to the method for claim 1, it is characterised in that the extracting method of the destination node is:From global albumen Destination node interested is arbitrarily chosen in interaction network, the quantity of destination node is no more than global interactions between protein nodes Sum.
  3. 3. according to the method for claim 1, it is characterised in that the extracting method of the shortest path is:
    (1) all destination nodes of selection are matched two-by-two, the combination of all pairings is obtained according to permutation and combination method;
    (2) two albumen being related to during each is combined calculate the shortest path between 2 points as beginning and end;
    (3) shortest path all in every kind of combination all extracts.
  4. 4. according to the method for claim 1, it is characterised in that the specific method that the shortest path is split is:Extract All shortest paths all split into the interactions between protein pair of albumen interaction two-by-two, the weight that will be obtained after fractionation in different shortest paths Recoverin is mutually opposed deletion, that is, obtains forming all interactions between protein pair of interactions between protein sub-network.
  5. 5. according to the method for claim 1, it is characterised in that it is described visualization sub-network the method that obtains be:It will obtain Interactions between protein to data input cytoscape softwares, obtain visual sub-network figure.
  6. A kind of 6. system for extracting sub-network from global interactions between protein network based on shortest path, it is characterised in that including:
    Shortest path extraction module, for being matched destination node two-by-two and extracting shortest path;
    Shortest path splits module, for splitting shortest path to obtain all interactions between protein pair;
    Sub-network module:For data to be obtained with visual sub-network by interactions between protein.
  7. 7. system according to claim 6, it is characterised in that also module is chosen including destination node, for from global egg Destination node interested is arbitrarily chosen in white interaction network, the quantity of destination node, which is no more than in global interactions between protein network, to be saved Point sum.
  8. 8. system according to claim 6, it is characterised in that the shortest path extraction module includes:
    Pairing unit, for being matched two-by-two to all destination nodes of selection, all match somebody with somebody is obtained according to permutation and combination method To combination;
    Computing unit, two albumen for being related to during each is combined are calculated between 2 points as beginning and end Shortest path;
    Extraction unit, for shortest path all in every kind of combination all to be extracted.
  9. 9. system according to claim 6, it is characterised in that the shortest path, which splits module, to be included:
    Split cells, for all shortest paths extracted all to be split into the interactions between protein pair of albumen interaction two-by-two;
    Unit is deleted, for the repetitive proteins obtained in different shortest paths mutually to be opposed deletion after fractionation.
  10. 10. system according to claim 6, it is characterised in that the sub-network module includes:
    Cytoscape software units, for obtained interactions between protein to data input cytoscape softwares, to be visualized Sub-network figure.
CN201710703560.7A 2017-08-16 2017-08-16 Based on shortest path from global interactions between protein network extraction sub-network method and system Pending CN107609341A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710703560.7A CN107609341A (en) 2017-08-16 2017-08-16 Based on shortest path from global interactions between protein network extraction sub-network method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710703560.7A CN107609341A (en) 2017-08-16 2017-08-16 Based on shortest path from global interactions between protein network extraction sub-network method and system

Publications (1)

Publication Number Publication Date
CN107609341A true CN107609341A (en) 2018-01-19

Family

ID=61065136

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710703560.7A Pending CN107609341A (en) 2017-08-16 2017-08-16 Based on shortest path from global interactions between protein network extraction sub-network method and system

Country Status (1)

Country Link
CN (1) CN107609341A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040059522A1 (en) * 2002-09-23 2004-03-25 Kyungsook Han Method for partitioned layout of protein interaction networks
CN101246520A (en) * 2008-03-18 2008-08-20 中南大学 Protein complex recognizing method based on range estimation
CN102339362A (en) * 2011-11-08 2012-02-01 苏州大学 Method for extracting protein interaction relationship
CN103093119A (en) * 2013-01-24 2013-05-08 南京大学 Method for recognizing significant biologic pathway through utilization of network structural information
CN106874708A (en) * 2017-01-23 2017-06-20 陕西师范大学 The method that key protein matter is recognized using the artificial bee colony optimized algorithm of the mechanism of looking for food

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040059522A1 (en) * 2002-09-23 2004-03-25 Kyungsook Han Method for partitioned layout of protein interaction networks
CN101246520A (en) * 2008-03-18 2008-08-20 中南大学 Protein complex recognizing method based on range estimation
CN102339362A (en) * 2011-11-08 2012-02-01 苏州大学 Method for extracting protein interaction relationship
CN103093119A (en) * 2013-01-24 2013-05-08 南京大学 Method for recognizing significant biologic pathway through utilization of network structural information
CN106874708A (en) * 2017-01-23 2017-06-20 陕西师范大学 The method that key protein matter is recognized using the artificial bee colony optimized algorithm of the mechanism of looking for food

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
GUI,TUANTUAN 等: ""Identification of Hepatocellular Carcinoma–Related Genes with a Machine Learning and Network Analysis"", 《JOURNAL OF COMPUTATIONAL BIOLOGY》 *

Similar Documents

Publication Publication Date Title
Mandric et al. Profiling immunoglobulin repertoires across multiple human tissues using RNA sequencing
JP6961727B2 (en) Generate a copy of interest
Hopkins et al. Diversity of environmental single-stranded DNA phages revealed by PCR amplification of the partial major capsid protein
CN105740381B (en) A method of user interest is excavated based on complex network characteristic and neural network clustering
Gil et al. Unity makes strength: a review on mutualistic symbiosis in representative insect clades
Zhao et al. Morphological and molecular identification of four new resupinate species of Perenniporia (Polyporales) from southern China
CN105843860B (en) A kind of microblogging concern recommended method based on parallel item-based collaborative filtering
CN107945037A (en) A kind of social networks based on node structure feature goes de-identification method
CN104598648B (en) A kind of microblog users interactive mode gender identification method and device
Wagner et al. The evolutionary history, diversity, and ecology of willows (Salix L.) in the European Alps
Bellan Clarke transformation solution of asymmetrical transients in three-phase circuits
Measey et al. A molecular phylogeny for sub-Saharan amphisbaenians
Qu et al. A phylogenetic and taxonomic study on Xylodon (Hymenochaetales): Focusing on three new Xylodon species from southern China
Liu et al. Global diversity and taxonomy of Sidera (Hymenochaetales, Basidiomycota): Four new species and keys to species of the genus
Zhang et al. Providing scale to a known taxonomic unknown—At least a 70-fold increase in species diversity in a cosmopolitan nominal taxon of lichen-forming fungi
Żabicka et al. Interspecific hybridization and introgression influence biodiversity—Based on genetic diversity of Central European Viola epipsila-V. palustris complex
Peña-Venegas et al. Arbuscular mycorrhization in colombian and introduced rubber (Hevea brasiliensis) genotypes cultivated on degraded soils of the Amazon Region
CN102819611A (en) Local community digging method of complicated network
Bourret et al. A barcode-based phylogenetic characterization of Phytophthora cactorum identifies two cosmopolitan lineages with distinct host affinities and the first report of Phytophthora pseudotsugae in California
Huss et al. A novel design flow for a security-driven synthesis of side-channel hardened cryptographic modules
Song et al. Morphological and molecular evidence for two new species within Russula subgenus Brevipes from China
CN107609341A (en) Based on shortest path from global interactions between protein network extraction sub-network method and system
Wendt et al. Pronounced mito-nuclear discordance and various Wolbachia infections in the water ringlet Erebia pronoe have resulted in a complex phylogeographic structure
Doncheva et al. Integrative visual analysis of protein sequence mutations
Wang et al. High-enthalpy geothermal simulation with continuous localization in physics

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180119

RJ01 Rejection of invention patent application after publication