CN100390537C - Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum - Google Patents

Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum Download PDF

Info

Publication number
CN100390537C
CN100390537C CNB2004100908060A CN200410090806A CN100390537C CN 100390537 C CN100390537 C CN 100390537C CN B2004100908060 A CNB2004100908060 A CN B2004100908060A CN 200410090806 A CN200410090806 A CN 200410090806A CN 100390537 C CN100390537 C CN 100390537C
Authority
CN
China
Prior art keywords
fragmention
isotopic
molecular formula
formula
peak
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2004100908060A
Other languages
Chinese (zh)
Other versions
CN1773276A (en
Inventor
高文
张京芬
蔡津津
贺思敏
曾嵘
陈润生
王海鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Computing Technology of CAS
Original Assignee
Institute of Computing Technology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Computing Technology of CAS filed Critical Institute of Computing Technology of CAS
Priority to CNB2004100908060A priority Critical patent/CN100390537C/en
Publication of CN1773276A publication Critical patent/CN1773276A/en
Application granted granted Critical
Publication of CN100390537C publication Critical patent/CN100390537C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

The present invention discloses a method for predicting an ion molecular formula by using isotopic peaks of fragment ions in a tandem mass spectrum. In the method, the single isotopic quality of a fragment ion and the relative abundance of each isotopic peak relative to a single isotopic element are respectively obtained from the tandem mass spectrum and a universal molecular formula with undetermined atom number of each element; then the obtained quality and relative abundance are respectively matched to obtain the nonnegative integer solution of the atom number of each undetermined element in the universal molecular formula; finally the molecular formula of the fragment ion can be obtained. The method of the present invention uses the isotopic peak information of the fragment ion in the tandem mass spectrum to calculate the corresponding molecular formula of the fragment ion in a mode of the isotopic peak of the fragment ion in the tandem mass spectrum. The method of the present invention can provide the accurate molecular formula information of the fragment ion, identify a candidate sequence provided by a database searching method used for identifying a polypeptide sequence and provide references for a de novo method used for solving the polypeptide sequence to generate a high-reliability candidate sequence.

Description

Predict the method for ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum
Technical field
The present invention relates to a kind of proteomic analysis methods, specifically, relate to the method for the molecular formula of the cracked back fragmention that produces of a kind of predicted polypeptide sequence.
Background technology
Utilizing peptide fingerprint mass spectrum and tandem mass spectrum technology and database search at present and directly separating preface (de novo) method and identify that the pre-service of mass spectrometric data and the aftertreatment of qualification result are extremely important in the research of peptide sequence and protein.
Certified polypeptide in mass spectrometer by cracked be fragmention, the quality and the abundance of these fragmentions are measured by mass spectrometer, form tandem mass spectrum.Each fragmention with and isotope ion all in tandem mass spectrum, form corresponding spectrum peak.The isotope summit of considering fragmention causes for the qualification process of peptide or protein to obscure, be about 0.34 such as of poor quality between some amino acid residue, 1 and 1.5da, and the monovalence of same fragmention, divalence, the peak-to-peak mass-to-charge ratio of the isotope of trivalent (m/z) difference is respectively 1,0.5 and 0.333, the m/z difference of these amino acid residue quality differences and isotopic peak is overlapping, and causing needs in qualification process to judge that a spectrum peak in the tandem mass spectrum is certain fragment ion peak or the isotopic peak of another fragmention; In addition, the overlapping phenomenon of the isotopic peak of a plurality of amino acid masses summation backs and certain fragmention can be more.Therefore, one of traditional data preprocessing tasks is to identify the isotopic peak of a fragmention and rejected.
Yet in fact, it is closely-related that the distribution pattern of the isotopic peak of the fragmention that shows in the mass spectrum and the atom of this fragmention are formed (being molecular formula).Therefore can utilize the isotopic peak of fragmention to predict the molecular formula of this fragmention with regard to needing a kind of method, like this, the molecular formula of the fragmention that dopes can provide more information more accurately for database search and the de novo method that peptide is identified on the one hand, on the other hand, for carrying out aftertreatment, qualification result provides more foundation.
Summary of the invention
The object of the present invention is to provide a kind of isotopic peak that utilizes the fragmention in the tandem mass spectrum to predict the method for the molecular formula of this fragmention.
To achieve these goals, the invention provides a kind of method of using the isotopic peak prediction ionic molecule formula of fragmention in the tandem mass spectrum, comprising:
Step 1): from tandem mass spectrum, obtain single isotope and at least one isotopic spectrum peak thereof of a fragmention, calculate the peak-to-peak relative abundance of at least one isotopic spectrum of the list isotopic spectrum peak and the described fragmention of the isotopic quality of list of described fragmention, described fragmention;
Step 2) provide a general molecular formula of fragmention, each atoms of elements number is undetermined in the described general molecular formula;
Step 3): obtain the isotopic quality of theoretic list of fragmention, single isotope and its at least one abundance ratio of isotopes of fragmention with described general molecular formula; The single isotope of the isotopic quality of described theoretic list, fragmention and the relative abundance of its at least one isotope ion are the function of atom number undetermined in the described general molecular formula;
Step 4): will do coupling from tandem mass spectrum quality and relative abundance in the quality that obtain in the step 3) and relative abundance and the step 1), separate with the nonnegative integer that obtains each atoms of elements number undetermined in the described general molecular formula, thereby obtain the molecular formula of described fragmention.
In technique scheme, at least one isotope of the fragmention described in step 1) and the step 3) comprises first isotope and second isotope of fragmention.
In technique scheme, the peak-to-peak relative abundance of at least one isotopic spectrum of the list isotopic spectrum peak of the isotopic quality of list of the described fragmention that obtains in the step 1), described fragmention and described fragmention is constituted the isotopic distribution vector of an experiment; The isotopic quality of theoretic list, single isotope of fragmention and the isotopic distribution vector that its at least one abundance ratio of isotopes constitutes a theory with the fragmention that obtains in the step 3); Described coupling in the step 4) is as the coupling mark with the Euclidean distance between the isotopic distribution vector of the isotopic distribution vector of described experiment and described theory.
In technique scheme, comprise that also the chemical rule constraint condition of using the molecular formula that makes acquisition to meet chemical sense retrains described coupling.
In technique scheme, the nonnegative integer of each atoms of elements number undetermined is separated and is comprised in the described general molecular formula by described coupling acquisition: the real solution that obtains each atoms of elements number undetermined in the described general molecular formula by described coupling; Searching for the nonnegative integer that obtains each atoms of elements number undetermined in the described general molecular formula in the field of described real solution separates.
In technique scheme, also comprise the nonnegative integer of each atoms of elements number undetermined in the described general molecular formula that obtains in the step 4) is separated the step of filtering.Described filtration comprises average isotopic distribution mode method, and this method is filtered described nonnegative integer with single isotope of the isotopic quality of theoretic list of fragmention, fragmention and the statistical relationship between its at least one abundance ratio of isotopes and separated.Described filtration comprises that the chemical rule constraint condition that meets chemical sense with the molecular formula that makes acquisition filters described nonnegative integer and separate.Described filtration comprises separating with the nonnegative integer of two fragmentions to be carried out cross validation and separates with the nonnegative integer of filtering described two fragmentions.
The invention has the advantages that:
1) this method is that the isotope of fragmention in the tandem mass spectrum is composed making full use of of peak information;
2) this method can be composed the pattern at peak by the isotope of tandem mass spectrum fragmention, calculates the molecular formula (order of accuarcy is relevant with mass spectral precision, and precision is high more, and the molecular formula that calculates is reliable more) of this fragmention correspondence rapidly and accurately;
3) this method can provide the information of molecular formula accurately of fragmention, and the candidate sequence that can provide the database search method of identifying peptide sequence is differentiated;
4) the ionic molecule formula that calculates of this method can instruct the de novo method of finding the solution peptide sequence to produce highly reliable candidate's sequence.
Embodiment
Below in conjunction with the drawings and specific embodiments the present invention is described in further detail.
Single isotope of a fragmention is designated as P, and first isotope of this fragmention is designated as P 1, the second isotope fragmention is designated as P 2, the rest may be inferred, and the N isotope ion is designated as P NHere, fragmention list isotope P is meant that the various components at this ion are single isotope (being that proton number is identical with neutron number).And the isotope of fragmention be meant with single isotope fragmention have identical molecular formula, but have than single isotope and more to many ion of extra neutrons, for example first isotope P of fragmention 1Single isotope P than fragmention has an extra neutron more, the second isotope P 2Have two extra neutrons than single isotope P, the rest may be inferred more.In the present invention, the isotope of fragmention is the ion that has extra neutron on the whole than single isotope of fragmention.
Peptide sequence enters mass spectrometer and is ionized, and in mass spectrometer, (Collision-InducedDissociation CID) is cracked into a plurality of fragmentions under the effect to have the separation that the peptide ion (these peptide ions also have identical amino acid sequence usually) of specific mass-to-charge ratio (m/z) colliding-inducing.Thereby tested the measuring of the m/z of these fragmentions forms tandem mass spectrum, and in a tandem mass spectrum, its horizontal ordinate is represented the mass-to-charge ratio (m/z) of fragmention, and its ordinate is the abundance of detected fragmention.
In tandem mass spectrum, the single isotope P that picks out a fragmention with and isotope P 1~P NIn the spectrum peak of at least one correspondence, target of the present invention then is to predict the molecular formula of single isotope P correspondence of fragmention by the distribution situation of these isotopic peaks.In one embodiment of the invention, the single isotope P that from tandem mass spectrum, only picks out this fragmention with and the first isotope P 1With the second isotope P 2Those skilled in the art will readily appreciate that in the description from behind, in other embodiments of the invention,, also can only pick out an isotopic spectrum peak of fragmention for the isotope fragmention---the first isotope fragmention P for example 1, perhaps also can pick out more isotopic spectrum peak, choosing of different number isotope fragmentions can realize method of the present invention, complexity and the precision calculated in the time of still can having influence on the invention process.
From tandem mass spectrum, can also obtain the mass of ion M of single isotope fragmention P e, this is that those skilled in the art is known.
In following calculating, at first define the isotopic distribution vector eIPV=(M of an experiment for convenience e, I 1, I 2), wherein, M eBe the mass of ion of single isotope P of the fragmention that from tandem mass spectrum, obtains, I 1And I 2The first isotope P of the corresponding fragmention of difference 1With the second isotope P 2The spectrum peak with respect to the relative abundance at the spectrum peak of single isotope P, these data all can obtain from tandem mass spectrum.
Then, define isotopic distribution vector tIPV=(M, the T of a theory again 1, T 2), this theory isotopic distribution vector tIPV can obtain from the general molecular formula of fragmention.If the general molecular formula of fragmention is C N1H N2N N3O N4S N5, wherein n1~the n5 of each atom composition number of expression is a undetermined parameter in this molecular formula.Like this, in theoretical isotopic distribution vector tIPV, M is the quality from the fragmention of general molecular formula acquisition, T 1And T 2Be respectively the first isotope fragmention that obtains from the general molecular formula and the second isotope fragmention about single isotope fragmention relative abundance.Theoretical isotopic distribution vector tIPV can specifically can obtain by formula:
M=V×X (1)
T 1=n 1q C+n 2q H+n 3q N+n 4q O1+n 5q S1 (2)
T 2 = n 4 q O 2 + n 5 q S 2 + 1 2 T 1 2 - 1 2 ( n 1 q C 2 + n 2 q H 2 + n 3 q N 2 + n 4 q O 1 2 + n 5 q S 1 2 ) - - - ( 3 )
V=[12 wherein, 1,14,16,32], the numeral among the V is each atoms of elements amount, X=[n1, n2, n3, n4, n5] Tq C, q HAnd q NIt is respectively occurring in nature 13C with respect to 12C, D with respect to H, 14N with respect to 15The relative abundance of N, q 01And q 02It then is respectively occurring in nature 17O with respect to 16O, 18O with respect to 16The relative abundance of O, q S1And q S2It is occurring in nature 33S with respect to 32S, 34S with respect to 32The relative abundance of S, these relative abundances are known numeric value.
As seen, for the isotopic distribution of theory vectorial tIPV=(M, T 1, T 2), M wherein, T 1And T 2Be X=[n1, n2, n3, n4, n5] function.
In the present invention, with the isotopic distribution of theory vectorial tIPV=(M, T 1, T 2) and experiment isotopic distribution vector eIPV=(M e, T 1, T 2) do coupling, so that obtain the molecular formula that the isotopic distribution vector with experiment mates most, also be the atom composition of vector X=[n1 in the general molecular formula, n2, n3, n4, n5] and a nonnegative integer separate.
In one embodiment of the invention, with the coupling mark of the Euclidean distance E between the isotopic distribution vector eIPV of theoretical isotopic distribution vector tIPV and experiment as tIPV and eIPV:
E = δ m 2 + δ 1 2 + δ 2 2 = ( M - M e ) 2 + ( T 1 - I 1 ) 2 + ( T 2 - I 2 ) 2 - - - ( 4 )
With formula (1)~(3) substitution (4), obtain
δ m=n 1*12+n 2*1+n 3*14+n 4*16+n 5*32-M e1, (5)
δ 1=n 1*q C+n 2*q H+n 3*q N+n 4*q O1+n 5*q S1-I 1, (6)
δ 2 = n 4 * q O 2 + n 5 * q S 2 - 1 2 ( n 1 * q C 2 + n 2 * q H 2 + n 3 * q N 2 + n 4 * q O 1 2 + n 5 * q S 1 2 )
+ ( n 1 * q C + n 2 * q H + n 3 * q N + n 4 * q O + n 5 * q S ) * I 1 - 1 2 I 1 2 - I 2 + 1 2 δ 1 2 . - - - ( 7 )
Ignore in the formula (7)
Figure C20041009080600084
, [δ is then arranged mδ 1δ 2]=AX+B obtains
Q ( X ) = E 2 = δ m δ 1 δ 2 δ m δ 1 δ 2 = X T A T AX + 2 B T AX + B T B , - - - ( 8 )
Then have:
E = Q ( X ) = X T A T AX + 2 B T AX + B T B - - - ( 9 )
Here in formula (9), X=[n1, n2, n3, n4, n5] TBe the atom composition of vector of fragmention undetermined, A and B are the constant matricess that is made of known quantity, and known quantity comprises the M that obtains from tandem mass spectrum here e, I 1And I 2, and V=[12,1,14,16,32] and formula (2) and (3) in each abundance ratio of isotopes.
Described Euclidean distance E minimizes with formula (9), separates for one that can obtain X.Usually,, preferably also to some chemical rule constraint conditions be set to formula (9) for the molecular formula that makes acquisition meets chemical sense, for example:
● the fragmention quality of the molecular formula correspondence that obtains with X must be at scope [M e-δ, M e+ δ] in, δ is the maximum magnitude of m/z error, δ can be determined by mass spectrometric measuring accuracy.Just to satisfy | VX-M e|≤δ.
● for certain element in the fragmention molecular formula, the m/z that uses ion is divided by the minimum isotopic mass number of this element quality, and the integral part of getting the gained result is exactly the upper limit of this element number.For example the atomic weight of element O is 16, if the mass-to-charge ratio of ion is m/z, then in the fragmention number of O element on be limited to
Figure C20041009080600087
Promptly in X
Figure C20041009080600088
Similarly, also can obtain similar constraint condition for other element in the fragmention.
● in fragmention, the number of C necessarily less than the number of H (promptly in X the number of n1<n3), O and N necessarily less than the number of C (promptly at n4<n1 and n3<n1) or the like.These constraint conditions lie in the composition mode of the molecular composition mode of amino acid residue and leading ion type, and those skilled in the art is easy to sum up out according to their characteristics.
● in the ion with an electronics, the number sum of H and N is an odd number.Reason is if ion has an electric charge, so just has unsatisfied chemical bond to exist, and H and N have the odd number quantivalency and C, O, S have the even number quantivalency.
Should be appreciated that purpose that those skilled in the art also can meet chemical sense from the molecular formula that makes fragmention constructs other constraint condition.
In above-mentioned constraint condition or other constraint condition part or all can be expressed as a linear inequality DX≤G.Like this,, can solve this minimization problem of Euclidean distance E by the QUADRATIC PROGRAMMING METHOD FOR of standard in conjunction with formula (9), as shown in Equation (10):
Figure C20041009080600091
The optimum solution of the X that obtains with QUADRATIC PROGRAMMING METHOD FOR from formula (10) is the X that separates in the real number field R, in order to seek real molecular formula, can be with X RBe used as starting point, then the nonnegative integer candidate solution of Local Search X in its neighborhood.Exactly, be exactly to each and X RExist one to give a mark, use formula (9) to estimate the matching degree of these nonnegative integer candidate solutions in other words apart from the nonnegative integer candidate's candidate solution molecular formula in the scope of d.The value of d adapts with mass range of ions.Avoided enumerating all possible molecular formula like this, can in big mass range, predict the ionic molecule formula and guarantee higher reliability and operational efficiency.
Through Local Search, still can produce the candidate molecules formula of some, comprising some illegal and with experiment tandem mass spectrum unmatched molecular formula (can be called invalid with impossible molecular formula), in order to improve the degree of accuracy of prediction, preferably need eliminating as much as possible they.Can utilize one or more methods that comprise in average isotopic distribution pattern, chemical rule constraint and the cross validation to filter the candidate molecules formula in the present invention.These methods specifically describe as follows:
A. average isotopic distribution pattern
Said average isotopic distribution pattern is theoretical isotopic distribution vector tIPV=(M, T 1, T 2) in ingredient M, T 1And T 2Between statistical relationship.In order to seek the theoretical average isotopic distribution pattern of fragmention, the isotopic of theoretical fragmention of polypeptide that the inventor has calculated the trypsin hydrolysis correspondence of all proteins in the existing Protein Data Bank is evenly distributed and standard deviation, disclosed ingredient M, the T of tIPV 1And T 2Between relation.Specifically, the inventor at first carries out the protein among the SWISS-PROT theoretical enzyme and cuts and calculate polypeptide; Select quality in that (polypeptide in the 60u~3000u), this scope correspondence Q-TOF MS/MS and tested mass spectral critical field then.In addition, it should be noted that the isotope of S + 2Very high (probability of appearance is 0.04210 to S, approximately is at the content of occurring in nature 18And in most cases can to comprise the polypeptide of the S more than five very rare 20 times of O).Therefore, we can be divided into six classes: S with above-mentioned molecular formula 0, S 1, S 2, S 3, S 4And S 5+, the number of corresponding contained S is 0,1,2,3 respectively, the peptide section more than 4 and 5 and 5.The inventor by these six classifications to adding up.Statistical result showed T 1Linear with mass M, T 2Then be the secondary relation with M, and T 2Along with T 1Increase and increase and and T 1Become quadratic function relation.
Like this, pass through T 1, T 2Can filter the candidate molecules formula with the above-mentioned distribution relation of M, to get rid of those invalid and/or impossible molecular formula.
B. chemical rule constraint
The chemical rule constraint here is similar with the constraint condition DX≤G in the formula (10), and its difference is: in formula (10), constraint condition DX≤G is used for constraint formulations
E = Q ( X ) = X T A T AX + 2 B T AX + B T B
So that obtain the X that separates in the real number field of X under this constraint condition RAnd here, these constraint conditions are used to constrain in X RThe field in the nonnegative integer that obtains of search separate the candidate molecules formula so that these candidate molecules formulas are filtered.
C. cross validation
Especially, the fragmention of the b series of a peptide section all is a homology, comprises b-, a-, and b*-, a*-, b °-, a ° of type ion, they share an identical original amino acid, can infer that thus their isotopic distribution pattern is very similar.Y series ion also is like this.If the M of certain two fragmention in the mass spectrum eDiffer 28,17 or 18, and the I of these two fragmentions 1And I 2Very approaching, just can think the eIPV homology of two fragmention correspondences.Then, we just can use the eIPV of homology to carry out cross validation to predicting the outcome.For example, in the candidate molecules formula tabulation in a fragmention C is arranged for two fragmentions of homology A1H A2N A3O A4S A5If, C A1-1H A2N A3O A4-1S A5Do not appear in the candidate molecules formula tabulation of another fragmention, so just can think candidate molecules formula C A1H A2N A3O A4S A5Be the result of mating at random and it is got rid of.

Claims (8)

1. method with the isotopic peak of fragmention in tandem mass spectrum prediction ionic molecule formula comprises:
Step 1): from tandem mass spectrum, obtain single isotope and at least one isotopic spectrum peak thereof of a fragmention, calculate the isotopic distribution vector of the peak-to-peak relative abundance of at least one isotopic spectrum of the list isotopic spectrum peak of the isotopic quality of list of described fragmention, described fragmention and described fragmention as experiment;
Step 2): a general molecular formula of fragmention is provided, and each atoms of elements number is undetermined in the described general molecular formula;
Step 3): obtain single isotope of the isotopic quality of theoretic list, fragmention of fragmention and its at least one abundance ratio of isotopes isotopic distribution vector with described general molecular formula as theory; Single isotope of the isotopic quality of described theoretic list, fragmention and its at least one abundance ratio of isotopes are the function of atom number undetermined in the described general molecular formula;
Step 4): the Euclidean distance of the isotopic distribution vector of the described experiment that obtains from tandem mass spectrum in the isotopic distribution vector of the described theory that obtains in the step 3) and the step 1) is minimized, obtain the molecular formula of the real number field of described fragmention;
Step 5): Local Search obtains the candidate molecules formula in nonnegative integer territory in the neighborhood of the molecular formula of described real number field, and obtains the predictive molecule formula of described fragmention from described candidate molecules formula;
Wherein, at least one isotope of the fragmention described in described step 1) and the step 3) comprises first isotope and second isotope of fragmention; And
The isotopic distribution vector eIPV=(M of described experiment e, I 1, I 2), wherein, M eBe the isotopic mass of ion of list of the fragmention that from tandem mass spectrum, obtains, I 1And I 2First isotope of corresponding fragmention and the second isotopic spectrum peak are with respect to the relative abundance at single isotopic spectrum peak respectively;
Isotopic distribution vector tIPV=(M, the T of described theory 1, T 2) obtain in the following way:
M=V×X (1)
T 1=n 1q C+n 2q H+n 3q N+n 4q O1+n 5q S1 (2)
T 2 = n 4 q O 2 + n 5 q S 2 + 1 2 T 1 2 - 1 2 ( n 1 q C 2 + n 2 q H 2 + n 3 q N 2 + n 4 q O 1 2 + n 5 q S 1 2 ) - - - ( 3 )
Wherein, described general molecular formula is C N1H N2N N3O N4S N5, n1~n5 is that each atom is formed number in the described general molecular formula of described fragmention, M be the quality from the fragmention of described general molecular formula acquisition, T 1And T 2Be respectively the first isotope fragmention that obtains from described general molecular formula and the second isotope fragmention relative abundance about single isotope fragmention, the row that V is made up of each atoms of elements amount in the described general molecular formula is vectorial, X=[n1, n2, n3, n4, n5] Tq C, q HAnd q NIt is respectively occurring in nature 13C with respect to 12C, D with respect to H, 14N with respect to 15The relative abundance of N, q 01And q 02It then is respectively occurring in nature 17O with respect to 18O, 18O with respect to 16The relative abundance of O, q S1And q S2It is occurring in nature 33S with respect to 32S, 34S with respect to 32The relative abundance of S.
2. the method for predicting the ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum according to claim 1, it is characterized in that, with the coupling mark of the Euclidean distance E between the isotopic distribution vector eIPV of theoretical isotopic distribution vector tIPV and experiment as tIPV and eIPV:
E = δ m 2 + δ 1 2 + δ 2 2 = ( M - M e ) 2 + ( T 1 - I 1 ) 2 + ( T 2 - I 2 ) 2 - - - ( 4 )
3. the method for predicting the ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum according to claim 1 and 2, it is characterized in that, in described step 4), also comprise with making the molecular formula of acquisition meet the chemical rule of chemical sense as the minimized constraint condition of described Euclidean distance.
4. the method for predicting the ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum according to claim 1, it is characterized in that Local Search described in the step 5) is meant the candidate molecules formula of searching for the nonnegative integer territory in there is the scope of a distance in the molecular formula with described real number field.
5. the method for using the isotopic peak prediction ionic molecule formula of fragmention in the tandem mass spectrum according to claim 1 is characterized in that, also comprises the step that the candidate molecules formula in described nonnegative integer territory is filtered in step 5).
6. the method for predicting the ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum according to claim 5, it is characterized in that, described filtration step comprises average isotopic distribution mode method, and this method is with the isotopic quality of theoretic list of fragmention, single isotope of fragmention and the candidate molecules formula that the statistical relationship between its at least one abundance ratio of isotopes filters described nonnegative integer territory.
7. the method for predicting the ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum according to claim 5, it is characterized in that described filtration step comprises that the chemical rule constraint condition that meets chemical sense with the molecular formula that makes acquisition filters the candidate molecules formula in described nonnegative integer territory.
8. the method for predicting the ionic molecule formula with the isotopic peak of fragmention in the tandem mass spectrum according to claim 5, it is characterized in that described filtration step comprises with the candidate molecules formula in the nonnegative integer territory of two fragmentions and carries out the candidate molecules formula of cross validation with the nonnegative integer territory of filtering described two fragmentions.
CNB2004100908060A 2004-11-12 2004-11-12 Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum Expired - Fee Related CN100390537C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100908060A CN100390537C (en) 2004-11-12 2004-11-12 Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100908060A CN100390537C (en) 2004-11-12 2004-11-12 Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum

Publications (2)

Publication Number Publication Date
CN1773276A CN1773276A (en) 2006-05-17
CN100390537C true CN100390537C (en) 2008-05-28

Family

ID=36760343

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100908060A Expired - Fee Related CN100390537C (en) 2004-11-12 2004-11-12 Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum

Country Status (1)

Country Link
CN (1) CN100390537C (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2810473C (en) * 2010-09-15 2018-06-26 Dh Technologies Development Pte. Ltd. Data independent acquisition of product ion spectra and reference spectra library matching
CN102445544B (en) * 2010-10-15 2013-10-30 中国科学院计算技术研究所 Method and system for increasing judgment accuracy of monoisotopic peaks
EP2741078A4 (en) * 2011-08-03 2015-03-11 Shimadzu Corp Method and equipment for mass spectrometry data analysis
CN103389335A (en) * 2012-05-11 2013-11-13 中国科学院大连化学物理研究所 Analysis device and method for identifying biomacromolecules
CN103792275A (en) * 2013-09-24 2014-05-14 中国科学院成都生物研究所 High-resolution mass spectrum accurate molecular formula forecasting method
EP3293754A1 (en) 2016-09-09 2018-03-14 Thermo Fisher Scientific (Bremen) GmbH Method for identification of the monoisotopic mass of species of molecules
US10615015B2 (en) * 2017-02-23 2020-04-07 Thermo Fisher Scientific (Bremen) Gmbh Method for identification of the elemental composition of species of molecules
CN111089928A (en) * 2020-01-16 2020-05-01 贵州理工学院 Method, system, device and medium for analyzing mass spectrum ion peak of organic matter
CN111524549B (en) * 2020-03-31 2023-04-25 中国科学院计算技术研究所 Integral protein identification method based on ion index
CN113514531B (en) * 2021-04-27 2022-10-25 清华大学 Fragment ion prediction method and application of compound
CN115439752B (en) * 2022-09-22 2023-04-18 上海市环境科学研究院 Method for identifying atmospheric organic species, computer device and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1251615A (en) * 1997-02-14 2000-04-26 乔治华盛顿大学 Assay for measurement of DNA synthesis rates
US6391649B1 (en) * 1999-05-04 2002-05-21 The Rockefeller University Method for the comparative quantitative analysis of proteins and other biological material by isotopic labeling and mass spectroscopy
US6537432B1 (en) * 1998-02-24 2003-03-25 Target Discovery, Inc. Protein separation via multidimensional electrophoresis

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1251615A (en) * 1997-02-14 2000-04-26 乔治华盛顿大学 Assay for measurement of DNA synthesis rates
US6537432B1 (en) * 1998-02-24 2003-03-25 Target Discovery, Inc. Protein separation via multidimensional electrophoresis
US6391649B1 (en) * 1999-05-04 2002-05-21 The Rockefeller University Method for the comparative quantitative analysis of proteins and other biological material by isotopic labeling and mass spectroscopy

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
有机质谱中应用计算机推算分子式的方法. 苏跃增等.西北师范大学学报(自然科学版),第33卷第3期. 1997 *
计算机据低分辨率质谱数据推算分子式的方法研究. 苏跃增等.新疆师范大学学报(自然科学版),第16卷第1期. 1997 *

Also Published As

Publication number Publication date
CN1773276A (en) 2006-05-17

Similar Documents

Publication Publication Date Title
US9312110B2 (en) System and method for grouping precursor and fragment ions using selected ion chromatograms
Hernandez et al. Automated protein identification by tandem mass spectrometry: issues and strategies
CN100390537C (en) Method for predicting ion molecular formula utilizing fragmental ion is otopic peak in tandem mass-spectrum
CN101871945B (en) Spectrum library generating method and spectrogram identifying method of tandem mass spectrometry
US8193485B2 (en) Method and apparatus for identifying proteins in mixtures
CN103698447B (en) A kind of method utilizing energetic encounter to induce the cracked technical appraisement albumen of ionization
CN104034792B (en) Secondary protein mass spectrum identification method based on mass-to-charge ratio error recognition capability
CN108140060A (en) For handling the technology of mass spectrometric data
JP2010117377A (en) Mass spectrometer system
JP2005091344A (en) Mass spectrometry system
US8694264B2 (en) Mass spectrometry system
CN101055558B (en) Mass spectrum effective peak selection method based on data isotope mode
CN104182658A (en) Tandem mass spectrogram identification method
JP2008281411A (en) Protein database retrieval method and recording medium
Zou et al. Charge state determination of peptide tandem mass spectra using support vector machine (SVM)
Datta et al. Spectrum fusion: using multiple mass spectra for de novo peptide sequencing
Zhang et al. PeakSelect: preprocessing tandem mass spectra for better peptide identification
US10825672B2 (en) Techniques for mass analyzing a complex sample based on nominal mass and mass defect information
Yuan et al. Features‐based deisotoping method for tandem mass spectra
AU2004300084A1 (en) Protein analysis method
CN115436347A (en) Physicochemical property scoring for structure identification in ion spectroscopy
US11600359B2 (en) Methods and systems for analysis of mass spectrometry data
Kouvaraki Chatzilampou A new software platform for peptide and protein mass spectra interpretation
Oh et al. Peptide identification by tandem mass spectra: an efficient parallel searching
Lee et al. Comparative Analysis of Disulfide Bond Determination Using Computational-Predictive Methods and Mass Spectrometry-Based Algorithmic Approach

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080528

Termination date: 20201112