WO2015110018A1 - Detection device and detection method for detecting pre-disease state - Google Patents

Detection device and detection method for detecting pre-disease state Download PDF

Info

Publication number
WO2015110018A1
WO2015110018A1 PCT/CN2015/071237 CN2015071237W WO2015110018A1 WO 2015110018 A1 WO2015110018 A1 WO 2015110018A1 CN 2015071237 W CN2015071237 W CN 2015071237W WO 2015110018 A1 WO2015110018 A1 WO 2015110018A1
Authority
WO
WIPO (PCT)
Prior art keywords
index
dnb
sample data
distance
disease
Prior art date
Application number
PCT/CN2015/071237
Other languages
French (fr)
Chinese (zh)
Inventor
陈洛南
刘锐
合原一幸
Original Assignee
中国科学院上海生命科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院上海生命科学研究院 filed Critical 中国科学院上海生命科学研究院
Publication of WO2015110018A1 publication Critical patent/WO2015110018A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
    • G16B5/20Probabilistic models
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Definitions

  • the present invention relates to a detecting device and a detecting method for detecting a pre-disease state.
  • Some diseases are relatively flat, such as chronic inflammation, which can usually be controlled by drug intervention and health care; but many diseases have sudden deterioration, such as liver cancer, its condition The deterioration is very fast, there is generally no discomfort before the onset of the disease, and once the symptoms appear to go to the hospital for treatment, often the patient is in the middle and late stage, and the survival time after the onset is not much.
  • This type of disease with a sudden deterioration of the disease has a very similar feature, that is, there is a "critical point" or a key node in the course of the disease.
  • Non-Patent Documents 1-5 Before the arrival of this critical point, the condition is not particularly obvious, which often causes the patient to ignore the condition and delay the optimal timing of treatment; and after the critical point, the condition does not develop smoothly. However, it suddenly deteriorates from the stable period in a short period of time and becomes a serious illness. It is for this reason that the diagnosis of such diseases is often not timely, which makes the treatment in the critical illness period difficult, the curative effect is poor, and the survival time after the onset is short, so it is very harmful.
  • the "pre-disease state” is a critical state before the "critical point” of disease progression reaches.
  • Appropriate treatment at this stage can return the disease to the "normal state”, so it is called the reversible phase.
  • the treatment is very difficult, and it is difficult to return the disease to a relatively normal state, so it is called the irreversible phase.
  • Pre-disease state The period is the key time node, the molecules that drive the disease state are the key factors, and their regulatory networks are also the key network leading to rapid disease deterioration.
  • early prediction and diagnosis of the pre-existing disease state is particularly important in the development of the disease, which is the last chance for patients with many diseases to be effectively controlled.
  • the normal state is not significantly different from the pre-disease state. Therefore, for many complex diseases, early prediction or pre-diagnosis is a very difficult problem, and there is no effective method.
  • the increasingly mature high-throughput bio-big data provides a valuable opportunity to fully understand biological processes and their anomalous mechanisms.
  • We can conduct more extensive research on the pathological processes of complex diseases especially by developing new theories and new methods based on biological big data to identify early warning signals (ie key time nodes or pre-disease states) of complex disease pathological processes. Identify key factors in disease development and extract key networks. This not only clarifies the molecular mechanism of the development of complex diseases, but also helps fight complex diseases and provides new methods and potential drug targets for the prevention, diagnosis and treatment of complex diseases.
  • Adipocyte differentiation is such a process.
  • a pluripotent stem cell maintains the potential to differentiate into a variety of cells before becoming a "pre-adipocyte". Once it becomes a pre-adipocyte, it undergoes rapid clonal expansion and subsequent terminal differentiation, thereby producing mature adipocytes.
  • the system gradually transforms from a normal state to a pre-disease state, and then the disease progresses further and rapidly develops into an early state or a disease state. In general, this dramatic change can be described as a bifurcation from a mathematical point of view. Therefore, how to detect key nodes and their key factors from small samples has very important scientific significance in the biological and medical fields.
  • Pre-disease state when the system is in a normal state, if it is continuously driven by external stimuli or some internal factors, then the system enters the pre-disease state, which is a critical stage before the mutation point of disease deterioration arrives (actually normal) a limit of state).
  • the system at this stage is very sensitive to external disturbances. Appropriate treatment can return the disease to a relatively normal period, but if it is not treated in time, the disease can easily cross the mutation point to reach the disease stage ( Figure 1, Figure 3).
  • the state of the disease which indicates that the condition has deteriorated into a critical illness, or chronic inflammation has been malignantly transformed into cancer.
  • the system is in a steady state again.
  • the treatment is very difficult, and it is difficult to return the disease to a relatively normal state (Fig. 1, Fig. 3).
  • Fig. 1 shows that the three stages of complex disease development experience normal state, pre-disease state and disease state, respectively, and (b) is normal state, which is the state in which the system is at a local minimum. During this period, the system is in a stable state and gradually or steadily changes. The system in this state has strong resistance to external interference, and (c) is the pre-disease state, which is a critical state and is relatively normal.
  • the limit that is, is a state before the upcoming fierce transition. This state is still reversible and can be returned to normal under appropriate system parameter disturbances.
  • the system in this state has higher potential energy, so the system is sensitive to external interference when the system is in this state, the external disturbance can drive the system to enter the disease state beyond the critical point, and (d) is the disease state, which is another stable state, the system (e) shows a network at a normal state, (e) shows a network at a normal state, where the color of the node represents the degree to which the gene expression deviates from the mean, the edge represents the correlation between the two genes, and (f) shows a network in a pre-disease state in which there is a set of genes (Z1, Z2, Z3) in the network.
  • the expression deviates from the mean value, and there is a strong correlation between the genes in this group, and the correlation with other genes becomes weak.
  • the first difficulty is that the pre-disease state corresponds to the state where the system parameters are close to the critical point. At this time, the system does not undergo a phase change, so the state of the system does not change significantly compared with the normal state. Therefore, it is a very difficult nonlinear problem to accurately predict the early stage of malignant transformation.
  • the second difficulty comes from the complex disease itself, because many complex diseases are the result of a combination of factors such as gene level, transcription level, and protein level. Therefore, although some research on these complex diseases has made some progress.
  • Non-Patent Document 6 A method of detecting a candidate marker of a biomarker for an early warning signal of a pre-existing state before a state transition. According to this method, early detection of disease can be achieved by detecting a dynamic network marker (DNB) that occurs immediately after a disease state is transferred.
  • DNB dynamic network marker
  • Non-Patent Document 6 can only perform early prediction of diseases for multi-sample data.
  • Early predictions of complex diseases often face difficulties in data collection, that is, the study of complex diseases, especially for the clinical application of most complex diseases, cannot be sampled for long periods of time, because people do not Go to the hospital frequently before the body feels really unwell. Therefore, the early prediction of the malignant transformation of complex diseases or the diagnosis of the "pre-disease state" is a complex nonlinear problem that can only be realized based on small sample data or even single-sample data. In this case, the above conventional DNB-based prediction method cannot be used because there is only one sample.
  • Non-Patent Document 1 Venegas, J.G., et al., "Self-organized plaque in asthma as a prelude to catastrophe", UK, Nature, Nature Publishing Group, 2005, Vol. 434, pp. 777-782
  • Non-Patent Document 2 McSharry, PE, Smith, LA, Tarassenko, L, "Predicting seizures: Is it appropriate to use a nonlinear method", UK, Nature Medicine, Nature Publishing Group, 2003, Vol. 9 , pp. 241-242
  • Non-Patent Document 3 Roberto, PB, Eliseo, G., Josef, C., "Conversion Model for Evaluating the Change Point of Logistic Regression", USA, Medical Statistics, Wiley Blackwell Publishers, 2003, Vol. 22, pp. 1141-1162
  • Non-Patent Document 4 Paek, S., et al., "Hearing preservation after gamma knife surgery on vestibular schwannomas", USA, Cancer, Wiley Blackwell, 2005, Vol. 1040 , pp. 580-590
  • Non-Patent Document 5 Liu, J.K., Rovit, R.L., Canwell, W.T., "Pituitary Stroke", USA, Proceedings of Neurosurgery, Thieme Press, 2001, Vol. 12, pp. 315-320
  • Non-Patent Document 6 Chen Luonan, Liu Rui, Liu Zhiping, Li Meiyi, and He Yuanyi, “Detecting Early Warning Signals for Sudden Deterioration of Complex Diseases by Dynamic Network Markers", Science Report, March 29, 2012 Day, Internet (website: http://www.natureasia.com/ja-jp/srep/abstracts/35129)
  • the present invention has been completed in view of the above problems, and based on the DNB theory, a detection device and a detection method for predicting a malignant mutation of a complex disease based on a single sample (high-throughput data) have been developed, and the DNB theory can be effectively applied to a single-sample case. It is impossible to do with traditional prediction methods.
  • a first aspect of the present invention provides a detecting apparatus for detecting a pre-disease state, comprising: a sample acquiring unit that acquires collation sample data and single-sample data of a detection object; and a DNB setting unit, The DNB setting unit sets a DNB member; an index calculating unit that obtains a composite index by calculating a distance between a probability distribution of the reference sample data and the single sample data; and a state discriminating unit that When the composite index exceeds the threshold, it is determined that the test subject is in the pre-disease state.
  • the index calculation unit includes: a first index calculation unit that uses a distance between the comparison sample data and a probability distribution of the DNB members in the one-sample data as the first index; 2 index calculation unit, the second index calculation unit compares the distance between the probability distributions of the DNB members and the non-DNB members in the sample data as the second index; the third index calculation unit, the third index calculation unit compares the sample data And a distance between the probability distributions of the non-DNB members in the single sample data as the third index; and a correction value setting unit that sets the correction value, the index calculation unit is based on the first index, the first The 2 index, the 3rd index, and the correction value are combined.
  • the index calculation unit uses a product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value as a composite index.
  • the distance between the probability distributions is a KL distance.
  • the correction value is a positive number equal to or less than one.
  • the correction value is 0.01.
  • an output unit is further included, which outputs a probability distribution of the DNB members in an analog display manner.
  • a first aspect of the present invention provides a detection method for detecting a pre-disease state, comprising the steps of: a sample acquisition step of acquiring comparison sample data and single sample data of a detection object; a DNB setting step of setting a DNB a member; an index calculation step of obtaining a composite index by calculating a distance between a probability distribution of the reference sample data and the single sample data; and a state discriminating step of determining that the detected object is in a pre-disease state when the composite index exceeds the threshold.
  • the present invention provides a detection device and a detection method for predicting a malignant mutation of a complex disease based on a single sample (high-throughput data), and can effectively apply the DNB theory to a single-sample situation, which can be simple and accurate.
  • the pre-disease state was detected.
  • Figure 1 is a schematic diagram showing three stages of the development of a complex disease.
  • Fig. 2 is a block diagram showing the configuration of a detecting device of the present invention.
  • FIG. 3 is a schematic diagram showing mutations predicting complex diseases based on single sample data in accordance with an embodiment of the present invention.
  • FIG. 4 is a schematic diagram showing numerical simulation results for verifying the correctness and reliability of an embodiment of the present invention.
  • FIG. 5 is a schematic diagram showing an example of detecting according to an embodiment of the present invention.
  • FIG. 6 is a schematic diagram showing another example of detecting according to an embodiment of the present invention.
  • a first aspect of the invention is a detection device for detecting a pre-disease state.
  • Fig. 3 is a block diagram showing the configuration of a detecting device of the present invention. Unlike the multi-samples in the prior art, single-sample (high-throughput data) is utilized in the present embodiment to predict mutations in complex diseases.
  • detection The apparatus 1 includes: a sample acquisition unit 10 that acquires comparison sample data and single sample data of the detection object; a DNB setting unit 11 that sets a DNB member; and an index calculation unit that calculates a probability distribution between the comparison sample data and the single sample data The distance is obtained to obtain a composite index; and the state discriminating unit 12 determines that the detected object is in the pre-disease state when the composite index exceeds the threshold.
  • the control sample data and the single sample data refer to the two sets of control and case samples, in which the control group (control group) contains a large number of samples, and the case group contains only one sample (single sample).
  • the setting method of the DNB member is well known, and the setting method described in Non-Patent Document 6 can be used, and will not be described herein.
  • the index calculation unit includes: a first index calculation unit 13 that takes the distance between the comparison sample data and the probability distribution of the DNB members in the one-sample data as the first index; the second index calculation unit 14 compares the sample data The distance between the probability distributions of the middle DNB members and the non-DNB members is taken as the second index; the third index calculating unit 15 takes the distance between the comparison sample data and the probability distribution of the non-DNB members in the single sample data as the third index; The correction value setting unit 16 sets the correction value, and the index calculation unit obtains the composite index based on the first index, the second index, the third index, and the correction value. It should be understood that it is also possible to use a single indicator calculation unit to calculate the distance between the probability distribution of the control sample data and the single sample data, and subdividing into the above four units is only an alternative.
  • the composite index may be a product of the first index, the second index, and a reciprocal of the sum of the third index and the correction value.
  • FIG. 3 is a schematic diagram showing mutations predicting complex diseases based on single sample data in accordance with an embodiment of the present invention. Since it is necessary to estimate the degree of similarity of the distribution of the two sets of samples, that is, to calculate the probability distance of the distribution of the two sets of samples, in the present invention, no matter which method of calculating the probability distance is used, The present invention is implemented, such as histogram intersection distance, chi-square test, quadratic distance, matching distance, Kolmogorov-Smirnov test (KS distance), earth moving distance, KL distance, and the like.
  • KS distance Kolmogorov-Smirnov test
  • K-L distance is also called relative entropy, KL divergence, information divergence, information gain, and is two probability points.
  • the significance is: in the same event space, the event space of the probability distribution P(k), if encoded by the probability distribution Q(k), averages how many bits per basic event (symbol) code length is increased.
  • K-L distance The Kullback-Leibler distance (K-L distance) is defined as:
  • This composite index I is called DNB-S score, where the KL distance between the Case data and the DNB members in the control data (D KL (case DNB , control DNB )) corresponds to the above-mentioned first index, the DNB member in the Case data.
  • the KL distance between the non-DNB member and the non-DNB member corresponds to the KL distance between the non-DNB members in the second index
  • Case data, and control data corresponds to the above third index
  • is a small positive number, in order to avoid the occurrence of the zero denominator correction value, taken as 0 to 1, preferably 0.01.
  • the composite index is the product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value, but this design is only an example, as long as it is based on the first index, the first The 2 index, and the 3rd index and the correction value can also be designed in other ways.
  • (a) shows that the development of a complex disease is divided into three states
  • (b) shows the basis for a one-sample prediction of the disease system, that is, based on a case sample, many control samples (control group Samples, such as samples of healthy people), existing DNB networks.
  • control group Samples such as samples of healthy people
  • These samples may be, for example, gene expressions, based on the DNB theory, whether the case sample is in the pre-disease state, and (c) showing the distribution obtained using the KL distance, which can be seen in the case data when the system is in the pre-disease state.
  • the DNB biomolecules have a bimodal distribution, while the DNB biomolecules in the control data have a unimodal distribution.
  • the non-DNB molecules in the case data have a unimodal distribution, which makes the single case sample taken from the pre-disease state.
  • the DNB-S score When the system is in a normal state or disease state, the DNB-S score has a lower value, and when the system is in the pre-disease state, the DNB-S score is at a higher value, which can provide a reliable signal for early warning of system mutation.
  • the DNB-S score has the following characteristics:
  • the KL distance (D KL (case DNB , control DNB )) between the DKB members in the Case data and the control data is increased because the distribution of biomolecules in the case data and the control data in the DNB is significantly different, that is, in the case data.
  • the DNB molecule exhibits a bimodal distribution, while the DNB molecules in the control data exhibit a unimodal distribution (see Figure 3).
  • the KL distance (D KL (case DNB , case non-DNB )) between the DNB member and the non-DNB member in the Case data increases, because even in the same case data, the biomolecule in the DNB exhibits a bimodal distribution in the case, and The non-DNB molecule exhibited a unimodal distribution (see Figure 3).
  • the third item in the DNB-S score has the following properties:
  • the first two items can be used to distinguish whether a single sample is in a normal state or a pre-disease state
  • the third item can be used to distinguish whether a single sample is in a pre-disease state or in a disease state. status.
  • the state discriminating unit 12 can issue an early warning signal.
  • the detecting means may further include an output unit (not shown in Fig. 2) for outputting the probability distribution of the DNB members in an analog display manner (refer to (c) in Fig. 3).
  • a second aspect of the invention is a detection method for detecting a pre-disease state, the detection method comprising the following steps.
  • the comparison sample data and the single sample data of the detection object are acquired.
  • the sample acquiring unit 10 performs this step.
  • the DNB setting unit 11 performs this step.
  • the composite index is obtained by calculating the distance between the probability distribution of the control sample data and the single sample data.
  • the index calculating unit performs the step.
  • the state discriminating unit 12 performs this step.
  • the detection means of the first aspect of the present invention may be similar to the composite index, and the indicator calculation step further includes: a first index calculation step, and the control sample data and the single sample data in the DNB The distance between the probability distributions of the members is taken as the first index; the second index calculation step is to use the distance between the probability distributions of the DNB members and the non-DNB members in the comparison sample data as the second index; the third index calculation step, which will be compared The distance between the sample data and the probability distribution of the non-DNB members in the single sample data is taken as the third index; and the correction value setting step is performed to set the correction value.
  • the composite index is obtained based on the first index, the second index, the third index, and the correction value.
  • the product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value is used as a composite index.
  • the present invention can be implemented regardless of the method of calculating the probability distance, but it is preferable to perform calculation using the K-L distance.
  • the composite index I designed by the K-L distance is also the same as the detecting device in the first aspect of the present invention described above, and will not be described herein.
  • FIG. 4 is a schematic diagram showing numerical simulation results for verifying the correctness and reliability of an embodiment of the present invention.
  • the range of the parameter P is -0.1 to 0.32, and ⁇ i(t) is white noise.
  • We use the Euler method to simulate the process from the normal state (P>0) to the critical point (P 0) and into the disease state (P ⁇ 0), as shown in Figure 4. This numerical simulation demonstrates the correctness and reliability of the composite index I, that is, the DNB-S score set in the above embodiment of the present invention.
  • Figure 4(a) shows a 16-dimensional Mie network in which nodes 1-7 belong to DNB members, nodes 9-16 belong to non-DNB members, and (b) show predictions of 5 sets of single-sample data.
  • an accurate warning signal can be found, that is, the five groups of BS scores all rise significantly, exceeding the threshold, and (c) shows the analog display.
  • the DNB members (nodes 1-7) in the case data have significantly different distributions, ie, the DNBs in the case data exhibit a bimodal distribution (both sides), while the DNBs in the control data exhibit a unimodal distribution (central), (d ) shows that the simulation shows that even though they are in the case data, the DNB members (nodes 1-7) and the non-DNB members (nodes 8-16) have significantly different distributions, ie the DNB members exhibit a bimodal distribution (both sides), Non-DNB members exhibit a unimodal distribution (central).
  • the detection device and the detection method of the present invention that is, the validity and accuracy of the composite index I, that is, the DNB-S score, are tested using real clinical data or test data.
  • the specific use of the DNB-S score is based on published data: (1) clinical trial data on the incidence of influenza after injection of H3N2 virus in an individual (GSE30550); (2) acute injury to the lungs in mice exposed to toxic gases Gene data (GSE2565).
  • FIG. 5 is a schematic diagram showing an example of detecting according to an embodiment of the present invention.
  • DNB has been successfully applied in a single-sample early detection of a particular disease, a clinical trial of individual morbidity after injection of a cold virus (high-throughput test data GSE30550).
  • the data of 17 individuals were tested by applying the DNB-S score (each individual had only one sample at one sampling time point), and 9 of them developed symptomatic subjects after the injection of the cold virus. Eight individuals did not develop cold symptoms (asymptomatic subjects) after the injection of the cold virus.
  • in (a) 9 individuals with morbidity were examined and passed before each individual became ill.
  • the DNB-S score detected that it was in the pre-disease state (beyond the threshold) and signaled an early warning; in (b), eight individuals who did not develop the disease and found no significant change in the DNB-S score (not Exceeded the threshold). Therefore, it was demonstrated that the DNB-S score is valid and accurate for this clinical trial data. Specifically, the 9 affected individuals were tested before the 7th sampling time point (36 hours), and in the trial, all the affected individuals were at the 8th sampling time point (45 hours) or after. The symptoms were detected (see Figure 5). For the 8 non-infected individuals, the pre-disease status was not detected by the DNB-S score, so no warning signal was issued.
  • the virus did not develop lesions (asymptomatic), their data was insensitive to the DNB-S score (not exceeding the threshold) ((b)), and (c) was the dynamic network mapped to the first individual (1st subject) Development map, using the existing PPI network, to map the data to the network, wherein the color of the node represents the fluctuation of the expression of the biomolecule, and the molecules belonging to the DNB members are specially arranged in the lower left corner, and can be seen Under normal conditions (0-12 hours), there is no big change in the network structure of the system, but in the pre-disease state (36 hours), the network structure of the system (especially the DNB members in the lower left corner) changes greatly.
  • the signal (d) is a table of information on the onset and morbidity of 17 individuals based on clinical data, the time of onset, and the time point at which the disease was detected using the DNB-S score. It can be seen that the DNB-S score is indeed before the onset of the individual. An early warning signal is issued to accurately detect the pre-existing disease state, and there is no response to the individual who does not develop the disease.
  • FIG. 6 is a schematic diagram showing another example of detecting according to an embodiment of the present invention.
  • the DNB-S score was successfully applied in a single-sample early detection of another specific disease, phosgene inhalation acute lung injury (high-throughput test data GSE2565).
  • the data was taken from an acute lung injury test in which six individuals (mouse) were exposed to phosgene. There were a total of 9 sampling time points. At each sampling time point, each individual had a case sample and a Control (control) sample.
  • the data was initially detected for disease using the DNB-S score (Figure 6). As shown in Fig. 6, (a) shows a curve of six individuals using the DNB-S score for testing.
  • DNB-S score did not respond to these 6 test subjects not exposed to phosgene and did not detect any warning signals (not exceeding the threshold).
  • the successful application of DNB-S score in this disease also demonstrates the effectiveness and accuracy of embodiments of the present invention.
  • Fig. 6 the data of acute injury of lungs in 6 individuals were predicted by DNB-S score, and (a) the curves of 6 individuals using the DNB-S score were examined. It was observed that the six test subjects exposed to phosgene detected a pre-existing condition at the fourth time point (4 hours) and sent a clear signal (beyond the threshold), and according to experimental observations, Individuals were found to have symptoms of lung injury at the fifth and sixth time points (8 hours–12 hours). Therefore, the pre-disease status was accurately determined using the DNB-S score, and the system was successfully alerted to acute mutations. ) The curves of the individuals in the 6 control groups using the DNB-S score were shown.
  • the DNB-S score did not respond to the 6 test subjects not exposed to phosgene, and no warning was detected.
  • the signal (not exceeding the threshold), the successful application of the DNB-S score on the disease demonstrates the effectiveness and accuracy of embodiments of the present invention.
  • the present invention effectively applies the DNB theory to a single sample case, not only the structure and the square.
  • the method is simple and efficient and accurate.

Abstract

A detection device (1) and detection method for detecting a pre-disease state. The device (1) comprises a sample acquisition unit (10), wherein the sample acquisition unit (10) acquires comparison sample data and single sample data of a detection object; a DNB setting unit (11), wherein the DNB setting unit (11) sets DNB members; index calculation units (13, 14, 15, 16), wherein the index calculation units (13, 14, 15, 16) obtain composite indexes by calculating the distance between the probability distribution of the comparison sample data and the single sample data; and a state judgement unit (12), wherein the state judgement unit (12) judges that the detection object is in a pre-disease state when the composite indexes exceed a threshold value. Therefore, the DNB theory is effectively applied to single sample conditions, so that the pre-disease state can be simply and accurately detected.

Description

用于对前疾病状态进行检测的检测装置及检测方法Detection device and detection method for detecting pre-disease state 技术领域Technical field
本发明涉及一种用于对前疾病状态进行检测的检测装置及检测方法。The present invention relates to a detecting device and a detecting method for detecting a pre-disease state.
背景技术Background technique
现代社会随着环境污染、人口的增加及生活节奏的不断加快,人们的压力日渐增大,许多人在压力下形成了不良的饮食习惯及生活习惯。在这种情况下,患上各种复杂疾病(癌症、糖尿病、心脑血管疾病等)的人数增多,例如患肝部恶性肿瘤和患糖尿病等疾病的人数不断上升。在这些复杂疾病中,有一部分疾病的病情发展相对平缓,如慢性炎症,这类疾病通常可以通过药物干涉和保健手段得到一定地控制;但很多疾病却具有突然恶化的现象,例如肝癌,其病情恶化很快,发病之前一般没有什么不适,而一旦出现了症状去医院就诊,往往患者已属于中晚期,发病后生存时间也已不多。这一类具有病情突然恶化现象的疾病都有一个很相似的特点,即在病程变化中存在一个“临界点”(critical point)或关键节点。(参照非专利文献1-5)在该临界点到来之前,病情不是特别的明显,这往往使得患者忽视了病情,耽误了治疗的最佳时机;而在临界点之后,病情就不是平缓地发展,而是在很短的时间内从稳定期突然恶化而成为重病期。正是由于这个原因,对这类疾病的确诊常常不及时,使得在重病期的治疗难度大、疗效差,发病后生存时间短,因此具有很大的危害性。如何及时地在早期诊断这类复杂疾病,关键在于找到疾病突然恶化前的预警特征或信号,预测“临界点”和突然恶化现象发生的条件,这已经成为了生物理论和临床医学研究上的一个热点问题。With the environmental pollution, the increase of population and the accelerating pace of life, people's pressure is increasing day by day. Many people have formed bad eating habits and living habits under pressure. In this case, the number of people suffering from various complicated diseases (cancer, diabetes, cardiovascular and cerebrovascular diseases, etc.) has increased, such as the number of people suffering from liver malignant tumors and diseases such as diabetes. Among these complex diseases, some diseases are relatively flat, such as chronic inflammation, which can usually be controlled by drug intervention and health care; but many diseases have sudden deterioration, such as liver cancer, its condition The deterioration is very fast, there is generally no discomfort before the onset of the disease, and once the symptoms appear to go to the hospital for treatment, often the patient is in the middle and late stage, and the survival time after the onset is not much. This type of disease with a sudden deterioration of the disease has a very similar feature, that is, there is a "critical point" or a key node in the course of the disease. (Refer to Non-Patent Documents 1-5) Before the arrival of this critical point, the condition is not particularly obvious, which often causes the patient to ignore the condition and delay the optimal timing of treatment; and after the critical point, the condition does not develop smoothly. However, it suddenly deteriorates from the stable period in a short period of time and becomes a serious illness. It is for this reason that the diagnosis of such diseases is often not timely, which makes the treatment in the critical illness period difficult, the curative effect is poor, and the survival time after the onset is short, so it is very harmful. The key to timely diagnosis of such complex diseases in a timely manner is to find early warning characteristics or signals before the sudden deterioration of the disease, and to predict the conditions of "critical point" and sudden deterioration, which has become a study in biological theory and clinical medical research. Hot Issues.
如图3中的(a)所示,一般来说,“前疾病状态”(pre-disease state)是疾病恶化的“临界点”到达之前一个临界状态。在该阶段适当的治疗可以使疾病重新恢复到“正常状态”(normal state),故称为可逆阶段。但当疾病的进展一旦越过临界点迅速到达“疾病状态”(disease state)时,治疗的难度非常大,很难再使病情回到相对正常状态,故称为非可逆阶段。因此,前疾病状态 的期间是关键时间节点,驱动前疾病状态的分子是关键因子,它们的调控网络也是导致疾病快速恶化的关键网络。显然,在疾病发生发展中,前疾病状态的早期预测和诊断尤为重要,这是很多疾病患者病情得到有效控制的最后机会。然而,与疾病状态不同,正常状态与前疾病状态并无明显不同,所以,对很多复杂疾病来说,早期预测或诊断前疾病状态是一个非常困难的问题,现在还无有效的方法。但日趋成熟的高通量生物大数据为全面了解生物过程及其异常机制提供了一个宝贵的契机。我们可以更广泛地开展对复杂疾病的病理过程的研究,特别是通过开发基于生物大数据的新理论和新方法,识别复杂疾病病变过程的预警信号(即关键时间节点或前疾病状态),确定表征疾病发展的关键因子,提取关键网络。这不仅可以阐明复杂疾病发生发展的分子机理,还将有助于抗击复杂疾病,并为预防、诊断、治疗复杂疾病提供新方法和潜在药靶。As shown in (a) of FIG. 3, in general, the "pre-disease state" is a critical state before the "critical point" of disease progression reaches. Appropriate treatment at this stage can return the disease to the "normal state", so it is called the reversible phase. But when the disease progresses faster than the critical point and quickly reaches the "disease state", the treatment is very difficult, and it is difficult to return the disease to a relatively normal state, so it is called the irreversible phase. Pre-disease state The period is the key time node, the molecules that drive the disease state are the key factors, and their regulatory networks are also the key network leading to rapid disease deterioration. Obviously, early prediction and diagnosis of the pre-existing disease state is particularly important in the development of the disease, which is the last chance for patients with many diseases to be effectively controlled. However, unlike the disease state, the normal state is not significantly different from the pre-disease state. Therefore, for many complex diseases, early prediction or pre-diagnosis is a very difficult problem, and there is no effective method. But the increasingly mature high-throughput bio-big data provides a valuable opportunity to fully understand biological processes and their anomalous mechanisms. We can conduct more extensive research on the pathological processes of complex diseases, especially by developing new theories and new methods based on biological big data to identify early warning signals (ie key time nodes or pre-disease states) of complex disease pathological processes. Identify key factors in disease development and extract key networks. This not only clarifies the molecular mechanism of the development of complex diseases, but also helps fight complex diseases and provides new methods and potential drug targets for the prevention, diagnosis and treatment of complex diseases.
事实上,不仅仅是复杂疾病过程,在许多生物过程中,如细胞分化、细胞增殖和疾病的进展等过程都涉及“跳跃式的”状态转化,即系统状态的急剧改变或定性变化。脂肪细胞分化就是这样一个过程。一个多能干细胞在成为“前脂肪细胞”以前都保持着分化为多种细胞的潜力,一旦成为前脂肪细胞后就进行急剧的克隆扩增及随后的终端分化,从而产生成熟的脂肪细胞。疾病进展过程也是如此,系统逐渐从一个正常状态转化到前疾病状态,然后病情进一步恶化,急剧发展为疾病早期状态或疾病状态。一般来说,这种急剧的变化从数学的观点来看可以被描述为分叉现象。因此,如何由小样本检测到关键节点及其关键因子在生物和医学领域具有非常重要的科学意义。In fact, not only complex disease processes, but also in many biological processes, such as cell differentiation, cell proliferation, and disease progression, involve "jumping" state transitions, ie, dramatic changes in the state of the system or qualitative changes. Adipocyte differentiation is such a process. A pluripotent stem cell maintains the potential to differentiate into a variety of cells before becoming a "pre-adipocyte". Once it becomes a pre-adipocyte, it undergoes rapid clonal expansion and subsequent terminal differentiation, thereby producing mature adipocytes. The same is true of the disease progression process. The system gradually transforms from a normal state to a pre-disease state, and then the disease progresses further and rapidly develops into an early state or a disease state. In general, this dramatic change can be described as a bifurcation from a mathematical point of view. Therefore, how to detect key nodes and their key factors from small samples has very important scientific significance in the biological and medical fields.
现代医学和生物学的研究成果表明,在生物体的各个器官内,是各个功能模块或生物分子的动态协同作用共同决定了器官的功能和状态,因此,我们把复杂疾病的发展和恶性转化过程可看作是一个复杂动力系统的时间演化动态过程,把影响疾病的外在因素视为动力系统中的参数,把参与疾病演变的分子浓度当作系统中的状态变量,于是疾病的突然恶化现象就对应了系统的突变现象。病程中的关键节点对应着动力系统中参数的临界点,特别是疾病恶性转化的前 期可看作动力系统的临界状态。要获得恶性转化的早期预警信号就成为如何界定“临界点”、如何探测和识别恶性转化早期的生物信号、如何确定复杂动态动力系统是否处于临界状态的问题。The results of modern medical and biological research show that the dynamic synergy of various functional modules or biomolecules in various organs of an organism determines the function and state of organs. Therefore, we develop complex diseases and malignant transformation processes. It can be regarded as a dynamic process of time evolution of a complex dynamic system. The external factors affecting disease are regarded as parameters in the dynamic system, and the molecular concentration involved in disease evolution is regarded as a state variable in the system, and the sudden deterioration of the disease It corresponds to the sudden change of the system. The key nodes in the course of the disease correspond to the critical point of the parameters in the dynamic system, especially before the malignant transformation of the disease The period can be regarded as the critical state of the power system. An early warning signal to obtain a malignant transformation becomes a question of how to define a "tipping point", how to detect and identify biological signals in the early stages of malignant transformation, and how to determine whether a complex dynamic dynamic system is at a critical state.
如图1和图3所示,疾病的发展可划分为以下3个状态。As shown in Figures 1 and 3, the development of the disease can be divided into the following three states.
正常状态,该状态描述正常阶段或病情较疾病期轻微的缓慢变化阶段,包括疾病的潜伏阶段、癌变前的慢性炎症阶段或病情得到有效控制而处于相对健康的阶段,这是一个较为稳定的状态(图1,图3)。Normal state, which describes a normal phase or a slightly slower phase of the disease than the disease period, including the latent stage of the disease, the chronic inflammation stage before the cancer, or the disease is effectively controlled and in a relatively healthy stage, which is a relatively stable state. (Figure 1, Figure 3).
前疾病状态,当系统处于正常状态时,如果持续受到外界刺激或内部某些因素的驱动,那么系统就进入前疾病状态,该状态是疾病恶化的突变点到达之前一个临界阶段(实际上是正常状态的一个极限)。处于该阶段的系统对外界的扰动非常敏感,适当的治疗可以使疾病重新回到相对正常期,但如果没有及时的治疗,疾病就很容易越过突变点到达疾病阶段(图1,图3)。Pre-disease state, when the system is in a normal state, if it is continuously driven by external stimuli or some internal factors, then the system enters the pre-disease state, which is a critical stage before the mutation point of disease deterioration arrives (actually normal) a limit of state). The system at this stage is very sensitive to external disturbances. Appropriate treatment can return the disease to a relatively normal period, but if it is not treated in time, the disease can easily cross the mutation point to reach the disease stage (Figure 1, Figure 3).
疾病状态,该状态代表病情已经恶化成为重病期,或慢性炎症已经恶性转化成为癌症。系统再次处于一个稳定状态。一般来说,当疾病到达这一阶段时,治疗的难度非常大,很难再使病情回到相对正常状态(图1,图3)。The state of the disease, which indicates that the condition has deteriorated into a critical illness, or chronic inflammation has been malignantly transformed into cancer. The system is in a steady state again. In general, when the disease reaches this stage, the treatment is very difficult, and it is difficult to return the disease to a relatively normal state (Fig. 1, Fig. 3).
在图1中,(a)示出了复杂疾病发展的三个阶段分别经历了正常状态,前疾病状态和疾病状态,(b)是正常状态,这是系统处于一个势能局部最低的状态,在此期间,系统是在一个稳定的状态,并逐渐或平稳地改变,处于该状态的系统有较强抵抗外界干扰的能力,(c)是前疾病状态,这是一个临界状态,是相对正常状态的极限,即,是临近即将到来的激烈转变之前的一个状态。此状态仍是可逆的,在适当的系统参数扰动下可以转回正常状态。该状态下的系统具有较高的势能,因此系统处于该状态时对外界干扰很敏感,外界扰动可以驱使系统越过临界点进入疾病状态,(d)是疾病状态,这是另一个稳定状态,系统处于疾病状态时候具有较低的势能,(e)示出了正常状态时候的网络,其中节点的颜色代表基因表达偏离均值的程度,边代表两个基因之间的相关性,(f)示出了前疾病状态下的网络,该状态下的网络中有一组基因(Z1、Z2、Z3) 的表达偏离均值很大,并且这组基因之间有很强的相关性,同时与其他基因的相关性变得很弱,(g)示出了疾病状态下的网络,处于这个状态时,基因表达偏离均值程度又回落为较小,基因两两之间的相关性也变得和正常状态差不多,从(h)中可以看出,在前疾病状态,Z1,Z2,Z3表达震动很剧烈,但是相关性非常高。In Fig. 1, (a) shows that the three stages of complex disease development experience normal state, pre-disease state and disease state, respectively, and (b) is normal state, which is the state in which the system is at a local minimum. During this period, the system is in a stable state and gradually or steadily changes. The system in this state has strong resistance to external interference, and (c) is the pre-disease state, which is a critical state and is relatively normal. The limit, that is, is a state before the upcoming fierce transition. This state is still reversible and can be returned to normal under appropriate system parameter disturbances. The system in this state has higher potential energy, so the system is sensitive to external interference when the system is in this state, the external disturbance can drive the system to enter the disease state beyond the critical point, and (d) is the disease state, which is another stable state, the system (e) shows a network at a normal state, (e) shows a network at a normal state, where the color of the node represents the degree to which the gene expression deviates from the mean, the edge represents the correlation between the two genes, and (f) shows a network in a pre-disease state in which there is a set of genes (Z1, Z2, Z3) in the network. The expression deviates from the mean value, and there is a strong correlation between the genes in this group, and the correlation with other genes becomes weak. (g) shows the network under the disease state. In this state, the gene The degree of deviation from the mean value falls back to a smaller extent, and the correlation between the two genes becomes similar to the normal state. It can be seen from (h) that in the pre-disease state, the Z1, Z2, and Z3 expressions are very intense. But the correlation is very high.
因此,前疾病状态的早期预测和诊断尤为重要,这是很多患者病情得到有效控制的重要机会。然而,对前疾病状态的预测有很多的困难。第一方面的困难是,前疾病状态对应着系统参数接近而未到达临界点的状态,这个时候,系统并没有发生相变,因此与正常状态相比,系统的状态并没有明显的改变。所以,要准确预测恶性转化的前期是一个很困难的非线性问题。第二方面的困难来自于复杂疾病的本身,因为很多复杂疾病都是基因水平、转录水平、蛋白质水平等的众多因素综合作用的结果,因此,尽管人们对这些复杂疾病的研究已经取得了一些进展,但至今还没有对复杂疾病构建起准确可靠的动态模型来刻画和研究恶性转化的现象。第三方面的困难是来自数据的采集方面,对生态系统、金融系统等的研究可以长时间、高密度地采样,但是这种数据采集方式对研究复杂疾病是做不到的,因为人们不会在身体感到真正不适之前频繁地去医院检查。正是基于这几方面的问题,对复杂疾病恶性转化的早期预测或“前疾病状态”的诊断是一个只能基于小样本数据甚至单样本数据来实现的复杂非线性问题。这样的问题十分难以解决,因此以往的绝大部分理论和实验工作都集中在针对“疾病状态”或“疾病早期状态”的研究上。对疾病状态的诊断主要是基于分子生物标志物,例如基因、蛋白质和代谢分子等能够标识疾病表型的因子,并可以通过观测其基因表达或蛋白质表达等区分开正常状态和疾病状态。然而,基于分子生物标志物的预测和诊断方法在处理疾病恶化早期或前疾病状态时无能为力,这是由于前疾病状态仅仅是相对正常状态的一个极限阶段,在表达量等水平上都无法区分出前疾病状态和正常状态。Therefore, early prediction and diagnosis of pre-disease status is particularly important, which is an important opportunity for many patients to be effectively controlled. However, there are many difficulties in predicting the pre-disease state. The first difficulty is that the pre-disease state corresponds to the state where the system parameters are close to the critical point. At this time, the system does not undergo a phase change, so the state of the system does not change significantly compared with the normal state. Therefore, it is a very difficult nonlinear problem to accurately predict the early stage of malignant transformation. The second difficulty comes from the complex disease itself, because many complex diseases are the result of a combination of factors such as gene level, transcription level, and protein level. Therefore, although some research on these complex diseases has made some progress. However, to date, no accurate and reliable dynamic models have been constructed for complex diseases to characterize and study the phenomenon of malignant transformation. The third difficulty is from the collection of data. Research on ecosystems, financial systems, etc. can be sampled for a long time and at high density, but this method of data collection cannot be done for studying complex diseases because people will not Go to the hospital frequently before the body feels really unwell. It is based on these questions that the early prediction of the malignant transformation of complex diseases or the diagnosis of “pre-disease state” is a complex nonlinear problem that can only be realized based on small sample data or even single sample data. Such problems are very difficult to solve, so most of the theoretical and experimental work in the past has focused on research on "disease status" or "early disease status." The diagnosis of the disease state is mainly based on molecular biomarkers, such as genes, proteins and metabolic molecules, which can identify the phenotype of the disease, and can distinguish between normal state and disease state by observing its gene expression or protein expression. However, molecular biomarker-based prediction and diagnostic methods are powerless in dealing with early or pre-existing disease states, since pre-disease states are only a limiting stage of relative normal state, and cannot be distinguished at the level of expression and the like. Disease state and normal state.
对此,本发明的发明人们已经提出了一种对成为示出从正常状态向疾病状 态转移之前的前疾病状态的预警信号的生物标志物的候选标志物进行检测的方法(非专利文献6)。根据该方法,通过对马上就要转移到疾病状态时才会出现的动态网络标志物(DNB)进行检测,能实现疾病的早期预测。In this regard, the inventors of the present invention have proposed a pair to show from a normal state to a disease state A method of detecting a candidate marker of a biomarker for an early warning signal of a pre-existing state before a state transition (Non-Patent Document 6). According to this method, early detection of disease can be achieved by detecting a dynamic network marker (DNB) that occurs immediately after a disease state is transferred.
可是,非专利文献6中记载的这种检测方法只能够针对多样本数据来进行疾病的早期预测。对于复杂疾病的早期预测,常常面对来自数据的采集方面的困难,即,研究复杂疾病特别是对大多数复杂疾病的临床应用方面,做不到长时间、高密度地采样,因为人们不会在身体感到真正不适之前频繁地去医院检查。因此,对复杂疾病恶性转化的早期预测或“前疾病状态”的诊断是一个只能基于小样本数据甚至单样本数据来实现的复杂非线性问题。在这种情况下,由于只有一个样本,无法使用上述传统的基于DNB的预测方法。However, the detection method described in Non-Patent Document 6 can only perform early prediction of diseases for multi-sample data. Early predictions of complex diseases often face difficulties in data collection, that is, the study of complex diseases, especially for the clinical application of most complex diseases, cannot be sampled for long periods of time, because people do not Go to the hospital frequently before the body feels really unwell. Therefore, the early prediction of the malignant transformation of complex diseases or the diagnosis of the "pre-disease state" is a complex nonlinear problem that can only be realized based on small sample data or even single-sample data. In this case, the above conventional DNB-based prediction method cannot be used because there is only one sample.
因此,为了有效地将基于DNB进行检测的方法应用在单样本情形上,需要提供一种新的基于DNB对单样本进行测试的方法。Therefore, in order to effectively apply the method based on DNB detection in a single sample situation, it is necessary to provide a new method for testing a single sample based on DNB.
现有技术current technology
【非专利文献1】Venegas,J.G.等著,“如同灾变的前奏般的哮喘中的自组织斑块”,英国,《自然》,自然出版集团,2005年,第434卷,第777-782页[Non-Patent Document 1] Venegas, J.G., et al., "Self-organized plaque in asthma as a prelude to catastrophe", UK, Nature, Nature Publishing Group, 2005, Vol. 434, pp. 777-782
【非专利文献2】McSharry,P.E.、Smith,L.A.、Tarassenko,L著,“癫痫发作的预测:用非线性方法是否恰当”,英国,《自然医学》,自然出版集团,2003年,第9卷,第241-242页[Non-Patent Document 2] McSharry, PE, Smith, LA, Tarassenko, L, "Predicting seizures: Is it appropriate to use a nonlinear method", UK, Nature Medicine, Nature Publishing Group, 2003, Vol. 9 , pp. 241-242
【非专利文献3】Roberto,P.B.、Eliseo,G.、Josef,C.著,“用于对逻辑回归的变点进行评定的转换模型”,美国,《医学统计学》,威立布莱克威尔出版社,2003年,第22卷,第1141-1162页[Non-Patent Document 3] Roberto, PB, Eliseo, G., Josef, C., "Conversion Model for Evaluating the Change Point of Logistic Regression", USA, Medical Statistics, Wiley Blackwell Publishers, 2003, Vol. 22, pp. 1141-1162
【非专利文献4】Paek,S.等著,“对前庭神经鞘瘤进行伽玛刀手术后的听力保存”,美国,《癌症》,威立布莱克威尔出版社,2005年,第1040卷,第580-590页[Non-Patent Document 4] Paek, S., et al., "Hearing preservation after gamma knife surgery on vestibular schwannomas", USA, Cancer, Wiley Blackwell, 2005, Vol. 1040 , pp. 580-590
【非专利文献5】Liu,J.K.、Rovit,R.L.、Couldwell,W.T.著,“垂体卒中”,美国,《神经外科论文集》,Thieme出版社,2001年,第12卷,第315-320页 [Non-Patent Document 5] Liu, J.K., Rovit, R.L., Couldwell, W.T., "Pituitary Stroke", USA, Proceedings of Neurosurgery, Thieme Press, 2001, Vol. 12, pp. 315-320
【非专利文献6】陈洛南、刘锐、刘治平、李美仪、合原一幸著,“通过动态网络标志物对复杂疾病的突然恶化进行早期预警信号的检测”,《科学陈诉》,2012年3月29日,互联网(网址:http://www.natureasia.com/ja-jp/srep/abstracts/35129)[Non-Patent Document 6] Chen Luonan, Liu Rui, Liu Zhiping, Li Meiyi, and He Yuanyi, "Detecting Early Warning Signals for Sudden Deterioration of Complex Diseases by Dynamic Network Markers", Science Report, March 29, 2012 Day, Internet (website: http://www.natureasia.com/ja-jp/srep/abstracts/35129)
发明内容Summary of the invention
本发明鉴于上述难题而完成,基于DNB理论,开发出了基于单样本(高通量数据)预测复杂疾病恶性突变的检测装置和检测方法,可以有效地将DNB理论应用在单样本情形上,这是传统的预测方法所做不到的。The present invention has been completed in view of the above problems, and based on the DNB theory, a detection device and a detection method for predicting a malignant mutation of a complex disease based on a single sample (high-throughput data) have been developed, and the DNB theory can be effectively applied to a single-sample case. It is impossible to do with traditional prediction methods.
本发明的第一方面提供了一种用于对前疾病状态进行检测的检测装置,包括:样本获取单元,该样本获取单元获取检测对象的对照样本数据和单样本数据;DNB设定单元,该DNB设定单元设定DNB成员;指标计算单元,该指标计算单元通过计算对照样本数据和单样本数据的概率分布之间的距离来得到复合指标;以及状态判别单元,该状态判别单元当所述复合指标超过阈值时,判断检测对象处于前疾病状态。A first aspect of the present invention provides a detecting apparatus for detecting a pre-disease state, comprising: a sample acquiring unit that acquires collation sample data and single-sample data of a detection object; and a DNB setting unit, The DNB setting unit sets a DNB member; an index calculating unit that obtains a composite index by calculating a distance between a probability distribution of the reference sample data and the single sample data; and a state discriminating unit that When the composite index exceeds the threshold, it is determined that the test subject is in the pre-disease state.
在另一优选例中,所述指标计算单元包括:第1指数计算单元,该第1指数计算单元将对照样本数据和单样本数据中DNB成员的概率分布之间的距离作为第1指数;第2指数计算单元,该第2指数计算单元将对照样本数据中DNB成员和非DNB成员的概率分布之间的距离作为第2指数;第3指数计算单元,该第3指数计算单元将对照样本数据和单样本数据中非DNB成员的概率分布之间的距离作为第3指数;以及校正值设定单元,该校正值设定单元对校正值进行设定,该指标计算单元基于第1指数、第2指数、第3指数、及校正值得到复合指标。In another preferred example, the index calculation unit includes: a first index calculation unit that uses a distance between the comparison sample data and a probability distribution of the DNB members in the one-sample data as the first index; 2 index calculation unit, the second index calculation unit compares the distance between the probability distributions of the DNB members and the non-DNB members in the sample data as the second index; the third index calculation unit, the third index calculation unit compares the sample data And a distance between the probability distributions of the non-DNB members in the single sample data as the third index; and a correction value setting unit that sets the correction value, the index calculation unit is based on the first index, the first The 2 index, the 3rd index, and the correction value are combined.
在另一优选例中,所述指标计算单元将所述第1指数、所述第2指数、及所述第3指数与所述校正值之和的倒数的乘积作为复合指标。In another preferred example, the index calculation unit uses a product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value as a composite index.
在另一优选例中,所述概率分布之间的距离是KL距离。In another preferred embodiment, the distance between the probability distributions is a KL distance.
在另一优选例中,所述校正值是小于等于1的正数。 In another preferred embodiment, the correction value is a positive number equal to or less than one.
在另一优选例中,所述校正值是0.01。In another preferred embodiment, the correction value is 0.01.
在另一优选例中,还包括输出单元,该输出单元以模拟显示的方式输出DNB成员的概率分布。In another preferred embodiment, an output unit is further included, which outputs a probability distribution of the DNB members in an analog display manner.
本发明的第一方面提供了一种用于对前疾病状态进行检测的检测方法,包括以下步骤:样本获取步骤,获取检测对象的对照样本数据和单样本数据;DNB设定步骤,设定DNB成员;指标计算步骤,通过计算对照样本数据和单样本数据的概率分布之间的距离来得到复合指标;以及状态判别步骤,当所述复合指标超过阈值时判断检测对象处于前疾病状态。A first aspect of the present invention provides a detection method for detecting a pre-disease state, comprising the steps of: a sample acquisition step of acquiring comparison sample data and single sample data of a detection object; a DNB setting step of setting a DNB a member; an index calculation step of obtaining a composite index by calculating a distance between a probability distribution of the reference sample data and the single sample data; and a state discriminating step of determining that the detected object is in a pre-disease state when the composite index exceeds the threshold.
本发明针对现有技术中的不足,提供了基于单样本(高通量数据)预测复杂疾病恶性突变的检测装置和检测方法,可以有效地将DNB理论应用在单样本情形上,能简单又准确地检测出前疾病状态。The present invention provides a detection device and a detection method for predicting a malignant mutation of a complex disease based on a single sample (high-throughput data), and can effectively apply the DNB theory to a single-sample situation, which can be simple and accurate. The pre-disease state was detected.
附图说明DRAWINGS
图1是示出复杂疾病发展的三个阶段的示意图。Figure 1 is a schematic diagram showing three stages of the development of a complex disease.
图2是示出本发明的检测装置的结构的框图。Fig. 2 is a block diagram showing the configuration of a detecting device of the present invention.
图3是示出根据本发明的实施方式基于单样本数据预测复杂疾病的突变的示意图。3 is a schematic diagram showing mutations predicting complex diseases based on single sample data in accordance with an embodiment of the present invention.
图4是示出对本发明的实施方式的正确性和可靠性进行验证的数值模拟结果的示意图。4 is a schematic diagram showing numerical simulation results for verifying the correctness and reliability of an embodiment of the present invention.
图5是示出根据本发明的实施方式进行检测的例子的示意图。FIG. 5 is a schematic diagram showing an example of detecting according to an embodiment of the present invention.
图6是示出根据本发明的实施方式进行检测的另一个例子的示意图。FIG. 6 is a schematic diagram showing another example of detecting according to an embodiment of the present invention.
具体实施方式detailed description
下面结合附图进一步阐明本发明的内容,但下述的例子并不是对本发明的保护范围的限制。The contents of the present invention are further clarified below with reference to the accompanying drawings, but the following examples are not intended to limit the scope of the invention.
本发明的第一方面是一种用于对前疾病状态进行检测的检测装置。图3是示出本发明的检测装置的结构的框图。不同于现有技术中的多样本,在本实施方式中利用单样本(高通量数据)来预测复杂疾病的突变。如图2所示,检测 装置1包括:样本获取单元10,获取检测对象的对照样本数据和单样本数据;DNB设定单元11,设定DNB成员;指标计算单元,通过计算对照样本数据和单样本数据的概率分布之间的距离来得到复合指标;以及状态判别单元12,当所述复合指标超过阈值时,判断检测对象处于前疾病状态。对照样本数据和单样本数据是指control和case这两组样本,其中control组(对照组)中含有大量样本,case组中仅含有一个样本(单样本)。DNB成员的设定方法是公知的,可以使用非专利文献6中记载的设定方法,在此不再赘述。A first aspect of the invention is a detection device for detecting a pre-disease state. Fig. 3 is a block diagram showing the configuration of a detecting device of the present invention. Unlike the multi-samples in the prior art, single-sample (high-throughput data) is utilized in the present embodiment to predict mutations in complex diseases. As shown in Figure 2, detection The apparatus 1 includes: a sample acquisition unit 10 that acquires comparison sample data and single sample data of the detection object; a DNB setting unit 11 that sets a DNB member; and an index calculation unit that calculates a probability distribution between the comparison sample data and the single sample data The distance is obtained to obtain a composite index; and the state discriminating unit 12 determines that the detected object is in the pre-disease state when the composite index exceeds the threshold. The control sample data and the single sample data refer to the two sets of control and case samples, in which the control group (control group) contains a large number of samples, and the case group contains only one sample (single sample). The setting method of the DNB member is well known, and the setting method described in Non-Patent Document 6 can be used, and will not be described herein.
更具体地,指标计算单元包括:第1指数计算单元13,将对照样本数据和单样本数据中DNB成员的概率分布之间的距离作为第1指数;第2指数计算单元14,将对照样本数据中DNB成员和非DNB成员的概率分布之间的距离作为第2指数;第3指数计算单元15,将对照样本数据和单样本数据中非DNB成员的概率分布之间的距离作为第3指数;以及校正值设定单元16,对校正值进行设定,指标计算单元基于第1指数、第2指数、第3指数、及校正值得到复合指标。应当理解的是,即使用一个单独的指标计算单元来计算对照样本数据和单样本数据的概率分布之间的距离也是可以的,细分为上述4个单元只是一种可选的方式。More specifically, the index calculation unit includes: a first index calculation unit 13 that takes the distance between the comparison sample data and the probability distribution of the DNB members in the one-sample data as the first index; the second index calculation unit 14 compares the sample data The distance between the probability distributions of the middle DNB members and the non-DNB members is taken as the second index; the third index calculating unit 15 takes the distance between the comparison sample data and the probability distribution of the non-DNB members in the single sample data as the third index; The correction value setting unit 16 sets the correction value, and the index calculation unit obtains the composite index based on the first index, the second index, the third index, and the correction value. It should be understood that it is also possible to use a single indicator calculation unit to calculate the distance between the probability distribution of the control sample data and the single sample data, and subdividing into the above four units is only an alternative.
在一个优选的方式中,复合指标可以是所述第1指数、所述第2指数、及所述第3指数与所述校正值之和的倒数的乘积。In a preferred embodiment, the composite index may be a product of the first index, the second index, and a reciprocal of the sum of the third index and the correction value.
以下参照图3对复合指标的计算进行详细说明。图3是示出根据本发明的实施方式基于单样本数据预测复杂疾病的突变的示意图。由于需要对这两组样本的分布的相似程度进行估测,也就是说,要计算这两组样本的分布的概率距离,因此在本发明中,不管用哪种计算概率距离的方法,都可以实现本发明,例如直方图相交距离、卡方检验、二次型距离、匹配距离、柯尔莫哥洛夫-斯米尔诺夫检验(K-S距离)、地球移动距离、K-L距离等等。优选为使用K-L距离来进行计算,因为使用K-L距离计算的话,计算量最小,应用最广泛,精度也最高。K-L距离又称为相对熵、KL散度、信息散度、信息增益,是两个概率分 布的差别的非对称性的度量。其意义是:在相同事件空间里,概率分布P(k)的事件空间,若用概率分布Q(k)编码时,平均每个基本事件(符号)编码长度增加了多少比特。The calculation of the composite index will be described in detail below with reference to FIG. 3 is a schematic diagram showing mutations predicting complex diseases based on single sample data in accordance with an embodiment of the present invention. Since it is necessary to estimate the degree of similarity of the distribution of the two sets of samples, that is, to calculate the probability distance of the distribution of the two sets of samples, in the present invention, no matter which method of calculating the probability distance is used, The present invention is implemented, such as histogram intersection distance, chi-square test, quadratic distance, matching distance, Kolmogorov-Smirnov test (KS distance), earth moving distance, KL distance, and the like. It is preferable to use the K-L distance for calculation, because the calculation using K-L distance is the smallest, the most widely used, and the highest precision. K-L distance is also called relative entropy, KL divergence, information divergence, information gain, and is two probability points. A measure of the asymmetry of the difference in cloth. The significance is: in the same event space, the event space of the probability distribution P(k), if encoded by the probability distribution Q(k), averages how many bits per basic event (symbol) code length is increased.
Kullback-Leibler距离(K-L距离)的定义为:The Kullback-Leibler distance (K-L distance) is defined as:
Figure PCTCN2015071237-appb-000001
Figure PCTCN2015071237-appb-000001
其中P和Q分别是两个离散的概率分布,P(k)=ProbP(x=xk),Q(k)=ProbQ(y=yk),并且Where P and Q are respectively two discrete probability distributions, P(k)=Prob P (x=x k ), Q(k)=Prob Q (y=y k ), and
Figure PCTCN2015071237-appb-000002
Figure PCTCN2015071237-appb-000002
Kullback-Leibler距离也可以写作Kullback-Leibler distance can also be written
DKL(P,Q)=H(P,Q)-H(P)D KL (P,Q)=H(P,Q)-H(P)
即K-L距离是一种条件熵,主要用于判断两个概率分布的距离,也被用于计算两个样本A和B之间的相似性(similarity)的大小。如果DKL(PA,PB)=0,那么样本A和B所含有的信息相同,两个样本的相似性达到最大。That is, the KL distance is a conditional entropy, which is mainly used to judge the distance between two probability distributions, and is also used to calculate the similarity between two samples A and B. If D KL (P A , P B ) = 0, then the samples A and B contain the same information, and the similarity between the two samples is maximized.
利用K-L距离设计如下的复合指标I:The following composite index I is designed using the K-L distance:
Figure PCTCN2015071237-appb-000003
Figure PCTCN2015071237-appb-000003
这个复合指标I被称为DNB-S score,其中,Case数据和control数据中DNB成员之间的K-L距离(DKL(caseDNB,controlDNB))对应于上述第1指数、Case数据中DNB成员和非DNB成员之间的K-L距离(DKL(caseDNB,casenon-DNB))对应于上述第2指数、Case数据和control数据中非DNB成员之间的K-L距离(DKL(casenon-DNB,controlnon-DNB))对应于上述第3指数,ε是一个小的正数,是为了避免出现零分母的校正值,取为0~1,优选为0.01。在上述设计的复合指标中,复合指标是第1指数、第2指数、及第3指数与校正值之和的倒数的乘积,但这种设计仅是一个示例,只要是基于第1指数、第2指数、及第3指数与校正值,也可以设计为其他方式。 This composite index I is called DNB-S score, where the KL distance between the Case data and the DNB members in the control data (D KL (case DNB , control DNB )) corresponds to the above-mentioned first index, the DNB member in the Case data. The KL distance between the non-DNB member and the non-DNB member (D KL (case DNB , case non-DNB )) corresponds to the KL distance between the non-DNB members in the second index, Case data, and control data (D KL (case non- DNB , control non-DNB )) corresponds to the above third index, ε is a small positive number, in order to avoid the occurrence of the zero denominator correction value, taken as 0 to 1, preferably 0.01. In the composite index of the above design, the composite index is the product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value, but this design is only an example, as long as it is based on the first index, the first The 2 index, and the 3rd index and the correction value can also be designed in other ways.
如图3所示,(a)示出了复杂疾病的发展分为三个状态,(b)示出了对疾病系统的单样本预测的依据,即基于一个case样本,很多control样本(对照组样本,如健康人的样本),已有的DNB网络。这些样本例如可以是基因表达式,基于DNB理论可以判断该case样本是否处于前疾病状态,(c)示出了利用K-L距离得到的分布,可以看到当系统处于前疾病状态时候,case数据中的DNB生物分子具有双峰分布,而control数据中的DNB生物分子具有单峰分布,同时,case数据中的non-DNB分子具有单峰分布,这个性质使得当单case样本取自前疾病状态时候,DNB与non-DNB分布的区别、DNB分别在case数据与control数据中的分布可以提供显著的信号,(d)示出了利用新的复合指标I即DNB-S score进行预测,当复合指标超过阈值时,判断检测对象处于前疾病状态,该指标仅需要单case样本就可以计算。当系统处于正常状态或疾病状态时候,DNB-S score具有较低值,而当系统处于前疾病状态时候,DNB-S score处于较高值,从而可以为系统突变的早期预警提供可靠的信号。As shown in Figure 3, (a) shows that the development of a complex disease is divided into three states, and (b) shows the basis for a one-sample prediction of the disease system, that is, based on a case sample, many control samples (control group Samples, such as samples of healthy people), existing DNB networks. These samples may be, for example, gene expressions, based on the DNB theory, whether the case sample is in the pre-disease state, and (c) showing the distribution obtained using the KL distance, which can be seen in the case data when the system is in the pre-disease state. The DNB biomolecules have a bimodal distribution, while the DNB biomolecules in the control data have a unimodal distribution. Meanwhile, the non-DNB molecules in the case data have a unimodal distribution, which makes the single case sample taken from the pre-disease state. The difference between the distribution of DNB and non-DNB, the distribution of DNB in case data and control data respectively can provide a significant signal, and (d) shows the prediction using the new composite index I, DNB-S score, when the composite indicator When the threshold value is exceeded, it is judged that the detection object is in the pre-disease state, and the indicator can be calculated only by a single case sample. When the system is in a normal state or disease state, the DNB-S score has a lower value, and when the system is in the pre-disease state, the DNB-S score is at a higher value, which can provide a reliable signal for early warning of system mutation.
根据DNB的性质,当系统接近前疾病状态时候,DNB-S score有如下的特征::According to the nature of DNB, when the system is close to the pre-disease state, the DNB-S score has the following characteristics:
Case数据和control数据中DNB成员之间的K-L距离(DKL(caseDNB,controlDNB))增大,因为DNB中生物分子在case数据和control数据中的分布显著不同,即,case数据中的DNB分子呈现双峰分布,而control数据中的DNB分子呈现单峰分布(见图3)。The KL distance (D KL (case DNB , control DNB )) between the DKB members in the Case data and the control data is increased because the distribution of biomolecules in the case data and the control data in the DNB is significantly different, that is, in the case data. The DNB molecule exhibits a bimodal distribution, while the DNB molecules in the control data exhibit a unimodal distribution (see Figure 3).
Case数据中DNB成员和非DNB成员之间的K-L距离(DKL(caseDNB,casenon-DNB))增大,因为即使同在case数据中,DNB中生物分子在case呈现双峰分布,而non-DNB分子呈现单峰分布(见图3)。The KL distance (D KL (case DNB , case non-DNB )) between the DNB member and the non-DNB member in the Case data increases, because even in the same case data, the biomolecule in the DNB exhibits a bimodal distribution in the case, and The non-DNB molecule exhibited a unimodal distribution (see Figure 3).
这两条性质使得DNB-S score可以准确地指示前疾病状态。另一方面,当系统越过临界点,进入疾病状态以后,DNB-S score中的第三项有如下的性质:These two properties allow the DNB-S score to accurately indicate the pre-disease state. On the other hand, when the system crosses the critical point and enters the disease state, the third item in the DNB-S score has the following properties:
当系统处于正常状态或前疾病状态时候,Case数据和control数据中非DNB成员之间的K-L距离(DKL(casenon-DNB,controlnon-DNB))没有明显的变化;然而,当系统越过临界点,进入疾病状态以后,non-DNB中生物分子在case数据和 control数据中的分布显著不同,即non-DNB分子在疾病状态下(即在case数据中)的表达量具有更高(或低)的均值,因而分布的均值更大(或小)。When the system is in a normal state or a pre-disease state, there is no significant change in the KL distance ( DKL (case non-DNB , control non-DNB )) between the Case data and the control data; however, when the system crosses At the critical point, after entering the disease state, the distribution of biomolecules in non-DNB in case data and control data is significantly different, that is, the expression level of non-DNB molecules in disease state (ie, in case data) is higher (or The mean of the low, and thus the mean of the distribution is larger (or smaller).
根据上面的DNB-S score的特征,我们可以看到前两项可以用来区分单样本是处于正常状态还是处于前疾病状态,第三项可以用来区分单样本是处于前疾病状态还是处于疾病状态。According to the characteristics of the above DNB-S score, we can see that the first two items can be used to distinguish whether a single sample is in a normal state or a pre-disease state, and the third item can be used to distinguish whether a single sample is in a pre-disease state or in a disease state. status.
因此,当判断检测对象处于前疾病状态时,可以由状态判别单元12发出预警信号。Therefore, when it is judged that the detection subject is in the pre-disease state, the state discriminating unit 12 can issue an early warning signal.
另外,检测装置还可以包括一个输出单元(在图2中未示出),以模拟显示的方式输出DNB成员的概率分布(参照图3中的(c))。Further, the detecting means may further include an output unit (not shown in Fig. 2) for outputting the probability distribution of the DNB members in an analog display manner (refer to (c) in Fig. 3).
本发明的第二方面是一种用于对前疾病状态进行检测的检测方法,该检测方法包括以下步骤。A second aspect of the invention is a detection method for detecting a pre-disease state, the detection method comprising the following steps.
首先,获取检测对象的对照样本数据和单样本数据。在本发明第一方面的检测装置中,样本获取单元10执行该步骤。First, the comparison sample data and the single sample data of the detection object are acquired. In the detecting device of the first aspect of the invention, the sample acquiring unit 10 performs this step.
接着,设定DNB成员。在本发明第一方面的检测装置中,DNB设定单元11执行该步骤。Next, set the DNB member. In the detecting device of the first aspect of the invention, the DNB setting unit 11 performs this step.
然后,通过计算对照样本数据和单样本数据的概率分布之间的距离来得到复合指标。在本发明第一方面的检测装置中,指标计算单元执行该步骤。Then, the composite index is obtained by calculating the distance between the probability distribution of the control sample data and the single sample data. In the detecting device of the first aspect of the invention, the index calculating unit performs the step.
最后,当所述复合指标超过阈值时判断检测对象处于前疾病状态。在本发明第一方面的检测装置中,状态判别单元12执行该步骤。Finally, when the composite index exceeds the threshold, it is determined that the test subject is in the pre-disease state. In the detecting device of the first aspect of the invention, the state discriminating unit 12 performs this step.
其中,在执行指标计算步骤时,也可以和本发明第一方面的检测装置得到复合指标相类似,在指标计算步骤中还包括:第1指数计算步骤,将对照样本数据和单样本数据中DNB成员的概率分布之间的距离作为第1指数;第2指数计算步骤,将对照样本数据中DNB成员和非DNB成员的概率分布之间的距离作为第2指数;第3指数计算步骤,将对照样本数据和单样本数据中非DNB成员的概率分布之间的距离作为第3指数;以及校正值设定步骤,对校正值进行设定。 In the execution of the index calculation step, the detection means of the first aspect of the present invention may be similar to the composite index, and the indicator calculation step further includes: a first index calculation step, and the control sample data and the single sample data in the DNB The distance between the probability distributions of the members is taken as the first index; the second index calculation step is to use the distance between the probability distributions of the DNB members and the non-DNB members in the comparison sample data as the second index; the third index calculation step, which will be compared The distance between the sample data and the probability distribution of the non-DNB members in the single sample data is taken as the third index; and the correction value setting step is performed to set the correction value.
在进行了上述细分之后的指标计算步骤中,基于第1指数、第2指数、第3指数、及校正值得到复合指标。In the index calculation step after the above-described subdivision, the composite index is obtained based on the first index, the second index, the third index, and the correction value.
优选为在指标计算步骤中将所述第1指数、所述第2指数、及所述第3指数与所述校正值之和的倒数的乘积作为复合指标。Preferably, in the index calculation step, the product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value is used as a composite index.
同样地,在复合指标的计算中,不管用哪种计算概率距离的方法,都可以实现本发明,但优选为使用K-L距离来进行计算。通过K-L距离来设计的复合指标I也与上述的本发明的第一方面中的检测装置一样,在此不再赘述。Similarly, in the calculation of the composite index, the present invention can be implemented regardless of the method of calculating the probability distance, but it is preferable to perform calculation using the K-L distance. The composite index I designed by the K-L distance is also the same as the detecting device in the first aspect of the present invention described above, and will not be described herein.
以下参照图4对本发明的实施方式的正确性和可靠性进行验证。图4是示出对本发明的实施方式的正确性和可靠性进行验证的数值模拟结果的示意图。The correctness and reliability of the embodiment of the present invention are verified below with reference to FIG. 4 is a schematic diagram showing numerical simulation results for verifying the correctness and reliability of an embodiment of the present invention.
利用一个16-维的网络进行数值模拟,该网络是如下的米氏(Michaelis-Menten)模型:Numerical simulations were performed using a 16-dimensional network, which is the following Michaelis-Menten model:
Figure PCTCN2015071237-appb-000004
Figure PCTCN2015071237-appb-000004
其中参数P的范围是-0.1到0.32,ζi(t)是白噪声。我们利用欧拉(Euler)方法模拟了系统从正常状态(P>0)到达临界点(P=0),并进入疾病状态(P<0)的过程,如图4所示。该数值模拟证明了在本发明的上述实施方式中设定的复合指标I即DNB-S score的正确性与可靠性。The range of the parameter P is -0.1 to 0.32, and ζi(t) is white noise. We use the Euler method to simulate the process from the normal state (P>0) to the critical point (P=0) and into the disease state (P<0), as shown in Figure 4. This numerical simulation demonstrates the correctness and reliability of the composite index I, that is, the DNB-S score set in the above embodiment of the present invention.
图4的(a)示出了16-维的米氏网络,该网络中节点1-7属于DNB成员,节点9-16属于非DNB成员,(b)示处了5组单样本数据的预测结果,当系统接近临界点时(参数P=0),利用DNB-S score,能发现准确的预警信号,即,五组B-S score均显著上升,超过了阈值,(c)示出了模拟显示case数据中的DNB成员(节点1-7)具有显著不同的分布,即,case数据中的DNB呈现双峰分布(两侧),而control数据中的DNB呈现单峰分布(中央),(d)示出了模拟显示即使同处于case数据中,DNB成员(节点1-7)与非DNB成员(节点8-16)具有显著不同的分布,即,DNB成员呈现双峰分布(两侧),而非DNB成员呈现单峰分布(中央)。Figure 4(a) shows a 16-dimensional Mie network in which nodes 1-7 belong to DNB members, nodes 9-16 belong to non-DNB members, and (b) show predictions of 5 sets of single-sample data. As a result, when the system approaches the critical point (parameter P=0), using the DNB-S score, an accurate warning signal can be found, that is, the five groups of BS scores all rise significantly, exceeding the threshold, and (c) shows the analog display. The DNB members (nodes 1-7) in the case data have significantly different distributions, ie, the DNBs in the case data exhibit a bimodal distribution (both sides), while the DNBs in the control data exhibit a unimodal distribution (central), (d ) shows that the simulation shows that even though they are in the case data, the DNB members (nodes 1-7) and the non-DNB members (nodes 8-16) have significantly different distributions, ie the DNB members exhibit a bimodal distribution (both sides), Non-DNB members exhibit a unimodal distribution (central).
以下利用真实的临床数据或试验数据对本发明的检测装置和检测方法,也就是对复合指标I即DNB-S score的有效性、准确性进行检验。对DNB-S score的具体运用基于公开数据:(1)个体注射H3N2病毒后感冒(influenza)是否发病的临床试验数据(GSE30550);(2)小鼠暴露在毒气中对肺部的急性伤害的基因数据(GSE2565)。The detection device and the detection method of the present invention, that is, the validity and accuracy of the composite index I, that is, the DNB-S score, are tested using real clinical data or test data. The specific use of the DNB-S score is based on published data: (1) clinical trial data on the incidence of influenza after injection of H3N2 virus in an individual (GSE30550); (2) acute injury to the lungs in mice exposed to toxic gases Gene data (GSE2565).
检测例1:感冒疾病(influenza)Test Example 1: Cold disease (influenza)
图5是示出根据本发明的实施方式进行检测的例子的示意图。在图5中,显示了DNB成功应用在一个特定的疾病的单样本早期检测中,即注射感冒病毒后个体发病的临床试验(高通量试验数据GSE30550)。通过应用DNB-S score,对17个个体的数据进行了检验(每个个体在一个采样时间点处只有一个样本),其中有9个个体在注射感冒病毒以后出现了感冒症状(symptomatic subjects),有8个个体在注射感冒病毒以后没有出现感冒症状(asymptomatic subjects)。如图5所示,在(a)中,检验了9个发病的个体,在每个个体发病之前就通过 DNB-S score探测到其处于前疾病状态(超过阈值),并发出了早期预警的信号;在(b)中,检验了8个不发病的个体,发现DNB-S score没有明显的改变(未超过阈值)。因此,证明了DNB-S score对该临床实验数据是有效的、准确的。具体来说,检验的9个发病的个体都在第7个采样时间点之前(36小时),而在试验中,所有发病的个体,都是在第8个采样时间点(45小时)或之后才检出症状的(见图5)。而对于8个不发病的个体,通过DNB-S score没有检测到处于前疾病状态,因此也没有发出预警信号。这进一步在临床试验数据上的应用证明了DNB-S score的有效性和准确性。另外,(c)是整个生物分子网络的动态变化图,这个动态变化图描绘了整个网络结构从0小时到45小时的变化。(d)详细记录了对每个个体的发病与否、发病时间,已经通过DNB-S score检测到前疾病状态的时间点,也可以看到对发病的个体,这种检测是及时的。更具体地,在图5中,利用DNB-S score对17个个体的感冒的数据进行了发病的预测,(a)和(b)显示了17个个体利用DNB-S score进行检验的曲线,其中9个个体在注射感冒病毒(H3N2virus)以后发生了病变(symptomatic),利用DNB-S score在其病变早期就能检测出前疾病状态(超过阈值)((a)),8个个体在注射感冒病毒(H3N2virus)以后没有发生病变(asymptomatic),他们的数据对DNB-S score不敏感(未超过阈值)((b)),(c)是对第一个个体(1st subject)绘制的动态网络发展图,利用已有的PPI网络,把数据映射到网络上,其中节点的颜色代表该生物分子表达量的波动程度(fluctuation),其中属于DNB成员的分子被特别安排在左下角,可以看到,在正常状态下(0-12小时),系统的网络结构没有大的变化,但是在病前状态下(36小时),系统的网络结构(特别是左下角的DNB成员)发生很大的变化,提供了清晰的疾病预警信号,(d)是基于临床数据绘制的17个个体的发病与否、发病时间及利用DNB-S score检出发病的时间点等信息表,可以看到,DNB-S score的确在个体发病之前就准确检测出前疾病状态而发出预警信号,而对不发病的个体没有反应。FIG. 5 is a schematic diagram showing an example of detecting according to an embodiment of the present invention. In Figure 5, it is shown that DNB has been successfully applied in a single-sample early detection of a particular disease, a clinical trial of individual morbidity after injection of a cold virus (high-throughput test data GSE30550). The data of 17 individuals were tested by applying the DNB-S score (each individual had only one sample at one sampling time point), and 9 of them developed symptomatic subjects after the injection of the cold virus. Eight individuals did not develop cold symptoms (asymptomatic subjects) after the injection of the cold virus. As shown in Figure 5, in (a), 9 individuals with morbidity were examined and passed before each individual became ill. The DNB-S score detected that it was in the pre-disease state (beyond the threshold) and signaled an early warning; in (b), eight individuals who did not develop the disease and found no significant change in the DNB-S score (not Exceeded the threshold). Therefore, it was demonstrated that the DNB-S score is valid and accurate for this clinical trial data. Specifically, the 9 affected individuals were tested before the 7th sampling time point (36 hours), and in the trial, all the affected individuals were at the 8th sampling time point (45 hours) or after. The symptoms were detected (see Figure 5). For the 8 non-infected individuals, the pre-disease status was not detected by the DNB-S score, so no warning signal was issued. This further application in clinical trial data demonstrates the validity and accuracy of the DNB-S score. In addition, (c) is a dynamic map of the entire biomolecular network depicting changes in the overall network structure from 0 hours to 45 hours. (d) Detailed records of the onset and morbidity of each individual, the time of onset, the time point at which the pre-existing disease state has been detected by the DNB-S score, and the individual on the onset of the disease can also be seen. This test is timely. More specifically, in Fig. 5, the data of the colds of 17 individuals were predicted using the DNB-S score, and (a) and (b) showed the curves of 17 individuals using the DNB-S score for testing. Nine individuals developed a symptomatic (symptomatic) after injection of the cold virus (H3N2virus), and the pre-existing disease state (beyond the threshold) was detected in the early stage of the lesion using the DNB-S score ((a)), and 8 individuals were injected with a cold. The virus (H3N2virus) did not develop lesions (asymptomatic), their data was insensitive to the DNB-S score (not exceeding the threshold) ((b)), and (c) was the dynamic network mapped to the first individual (1st subject) Development map, using the existing PPI network, to map the data to the network, wherein the color of the node represents the fluctuation of the expression of the biomolecule, and the molecules belonging to the DNB members are specially arranged in the lower left corner, and can be seen Under normal conditions (0-12 hours), there is no big change in the network structure of the system, but in the pre-disease state (36 hours), the network structure of the system (especially the DNB members in the lower left corner) changes greatly. Provides clear disease warning The signal, (d) is a table of information on the onset and morbidity of 17 individuals based on clinical data, the time of onset, and the time point at which the disease was detected using the DNB-S score. It can be seen that the DNB-S score is indeed before the onset of the individual. An early warning signal is issued to accurately detect the pre-existing disease state, and there is no response to the individual who does not develop the disease.
检测例2:肺部急性伤害疾病(acute lung injury) Test Example 2: acute lung injury
图6是示出根据本发明的实施方式进行检测的另一个例子的示意图。在图6中,显示了DNB-S score成功应用在另一个特定的疾病的单样本早期检测中,即光气吸入性急性肺损伤(高通量试验数据GSE2565)。该数据是采自对六个个体(鼠)进行了暴露光气之中的肺部急性伤害实验,一共有9个采样时间点,在每个采样时间点,每个个体有一个case样本和一个control(对照组)样本。利用DNB-S score对该数据进行了疾病的早期检测(图6)。如图6所示,(a)显示了6个个体利用DNB-S score进行检验的曲线,从该图可以看到,6个暴露在光气中的试验个体,都在第四个时间点(4小时)处检测到了明显的可以判断为处于前疾病状态的信号(超过阈值),而根据试验观测,这6个个体是在第五、六个时间点(8小时–12小时)时候发现肺部损伤的症状。因此,利用DNB-S score,准确判断了前疾病状态,成功预警了系统的急性突变,(b)显示了对6个对照组的个体利用DNB-S score进行检验的曲线,可以看到,DNB-S score对这6个没有暴露在光气中的试验个体没有反应,检测不到任何的预警信号(未超过阈值)。DNB-S score在该疾病上的成功应用也证明了本发明的实施方式的有效性和准确性。FIG. 6 is a schematic diagram showing another example of detecting according to an embodiment of the present invention. In Figure 6, it is shown that the DNB-S score was successfully applied in a single-sample early detection of another specific disease, phosgene inhalation acute lung injury (high-throughput test data GSE2565). The data was taken from an acute lung injury test in which six individuals (mouse) were exposed to phosgene. There were a total of 9 sampling time points. At each sampling time point, each individual had a case sample and a Control (control) sample. The data was initially detected for disease using the DNB-S score (Figure 6). As shown in Fig. 6, (a) shows a curve of six individuals using the DNB-S score for testing. From this figure, it can be seen that six test subjects exposed to phosgene are at the fourth time point ( At 4 hours), a significant signal (beyond the threshold) that can be judged to be in the pre-disease state was detected, and according to the experimental observation, the six individuals found the lung at the fifth and sixth time points (8 hours–12 hours). Symptoms of injury. Therefore, using the DNB-S score, the pre-disease state was accurately judged, and the acute mutation of the system was successfully alerted, and (b) the curve of the individual using the DNB-S score was examined for the individuals in the six control groups. It can be seen that DNB The -S score did not respond to these 6 test subjects not exposed to phosgene and did not detect any warning signals (not exceeding the threshold). The successful application of DNB-S score in this disease also demonstrates the effectiveness and accuracy of embodiments of the present invention.
更具体地,在图6中,利用DNB-S score对6个个体的肺部急性伤害的数据进行了发病的预测,(a)显示了6个个体利用DNB-S score进行检验的曲线,可以看到,6个暴露在光气中的试验个体,都在第四个时间点(4小时)处检测到了处于前疾病状态而发出了明显的信号(超过阈值),而根据试验观测,这6个个体是在第五、六个时间点(8小时–12小时)时候发现肺部损伤的症状,因此,利用DNB-S score准确判断了前疾病状态,成功预警了系统的急性突变,(b)显示了对6个对照组的个体利用DNB-S score进行检验的曲线,可以看到,DNB-S score对这6个没有暴露在光气中的试验个体没有反应,检测不到任何的预警信号(未超过阈值),DNB-S score在该疾病上的成功应用证明了本发明的实施方式的有效性和准确性。More specifically, in Fig. 6, the data of acute injury of lungs in 6 individuals were predicted by DNB-S score, and (a) the curves of 6 individuals using the DNB-S score were examined. It was observed that the six test subjects exposed to phosgene detected a pre-existing condition at the fourth time point (4 hours) and sent a clear signal (beyond the threshold), and according to experimental observations, Individuals were found to have symptoms of lung injury at the fifth and sixth time points (8 hours–12 hours). Therefore, the pre-disease status was accurately determined using the DNB-S score, and the system was successfully alerted to acute mutations. ) The curves of the individuals in the 6 control groups using the DNB-S score were shown. It can be seen that the DNB-S score did not respond to the 6 test subjects not exposed to phosgene, and no warning was detected. The signal (not exceeding the threshold), the successful application of the DNB-S score on the disease demonstrates the effectiveness and accuracy of embodiments of the present invention.
如上所述,本发明有效地将DNB理论应用在单样本情形上,不但结构和方 法简单而且既有效又准确。As described above, the present invention effectively applies the DNB theory to a single sample case, not only the structure and the square. The method is simple and efficient and accurate.
以上所说明的仅是本发明较佳可行的实施例而已,不能因此即局限本发明的权利范围,对熟悉本领域的普通技术人员来说,举凡运用本发明的技术方案和技术构思做出其它种相应的改变和变形,而所有这些改变和变形都应属在本发明权利要求的保护范围之内。The above description is only a preferred embodiment of the present invention, and thus the scope of the present invention is not limited thereto, and those skilled in the art can make other use of the technical solutions and technical concepts of the present invention. Corresponding changes and modifications are intended to be included within the scope of the appended claims.
附图标记说明Description of the reference numerals
1   检测装置1 detection device
10  样本获取单元10 sample acquisition unit
11  DNB设定单元11 DNB setting unit
12  状态判别单元12 state discriminating unit
13  第1指数计算单元13 1st index calculation unit
14  第2指数计算单元14 2nd index calculation unit
15  第3指数计算单元15 3rd index calculation unit
16  校正值设定单元 16 correction value setting unit

Claims (12)

  1. 一种用于对前疾病状态进行检测的检测装置,其特征在于,包括:A detecting device for detecting a pre-disease state, comprising:
    样本获取单元,该样本获取单元获取检测对象的对照样本数据和单样本数据;a sample acquisition unit that acquires comparison sample data and single sample data of the detection object;
    DNB设定单元,该DNB设定单元设定DNB成员;DNB setting unit, the DNB setting unit sets a DNB member;
    指标计算单元,该指标计算单元通过计算对照样本数据和单样本数据的概率分布之间的距离来得到复合指标;以及An index calculation unit that obtains a composite index by calculating a distance between a probability distribution of the comparison sample data and the single sample data;
    状态判别单元,该状态判别单元当所述复合指标超过阈值时,判断检测对象处于前疾病状态。The state discriminating unit determines that the detection target is in the pre-disease state when the composite index exceeds the threshold.
  2. 如权利要求1所述的检测装置,其特征在于,The detecting device according to claim 1, wherein
    所述指标计算单元包括:The indicator calculation unit includes:
    第1指数计算单元,该第1指数计算单元将对照样本数据和单样本数据中DNB成员的概率分布之间的距离作为第1指数;a first index calculation unit that uses a distance between the comparison sample data and a probability distribution of the DNB members in the one-sample data as the first index;
    第2指数计算单元,该第2指数计算单元将对照样本数据中DNB成员和非DNB成员的概率分布之间的距离作为第2指数;a second index calculation unit that compares a distance between a probability distribution of a DNB member and a non-DNB member in the sample data as a second index;
    第3指数计算单元,该第3指数计算单元将对照样本数据和单样本数据中非DNB成员的概率分布之间的距离作为第3指数;以及a third index calculation unit that uses, as a third index, a distance between the probability distribution of the non-DNB members in the comparison sample data and the one-sample data;
    校正值设定单元,该校正值设定单元对校正值进行设定,a correction value setting unit that sets the correction value,
    该指标计算单元基于第1指数、第2指数、第3指数、及校正值得到复合指标。The index calculation unit obtains a composite index based on the first index, the second index, the third index, and the correction value.
  3. 如权利要求2所述的检测装置,其特征在于,The detecting device according to claim 2, wherein
    所述指标计算单元将所述第1指数、所述第2指数、及所述第3指数与所述校正值之和的倒数的乘积作为复合指标。The index calculation unit uses a product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value as a composite index.
  4. 如权利要求1~3中任意一项所述的检测装置,其特征在于,The detecting device according to any one of claims 1 to 3, characterized in that
    所述概率分布之间的距离是KL距离。 The distance between the probability distributions is the KL distance.
  5. 如权利要求2所述的检测装置,其特征在于,The detecting device according to claim 2, wherein
    所述校正值是小于等于1的正数。The correction value is a positive number equal to or less than one.
  6. 如权利要求5所述的检测装置,其特征在于,A detecting device according to claim 5, wherein
    所述校正值是0.01。The correction value is 0.01.
  7. 如权利要求1所述的检测装置,其特征在于,The detecting device according to claim 1, wherein
    当判断检测对象处于前疾病状态时,所述状态判别单元发出预警信号。The state discriminating unit issues an early warning signal when it is judged that the detection subject is in the pre-disease state.
  8. 如权利要求1所述的检测装置,其特征在于,The detecting device according to claim 1, wherein
    还包括输出单元,该输出单元以模拟显示的方式输出DNB成员的概率分布。Also included is an output unit that outputs a probability distribution of the DNB members in an analog display.
  9. 一种用于对前疾病状态进行检测的检测方法,其特征在于,包括以下步骤:A detection method for detecting a pre-disease state, characterized in that it comprises the following steps:
    样本获取步骤,获取检测对象的对照样本数据和单样本数据;a sample acquisition step of obtaining comparison sample data and single sample data of the detection object;
    DNB设定步骤,设定DNB成员;DNB setting steps to set the DNB member;
    指标计算步骤,通过计算对照样本数据和单样本数据的概率分布之间的距离来得到复合指标;以及An indicator calculation step of obtaining a composite index by calculating a distance between a probability distribution of the control sample data and the single sample data;
    状态判别步骤,当所述复合指标超过阈值时判断检测对象处于前疾病状态。The state discriminating step determines that the detected subject is in the pre-disease state when the composite index exceeds the threshold.
  10. 如权利要求9所述的检测方法,其特征在于,The detecting method according to claim 9, wherein
    所述指标计算步骤包括:The indicator calculation step includes:
    第1指数计算步骤,将对照样本数据和单样本数据中DNB成员的概率分布之间的距离作为第1指数;a first index calculation step of using a distance between the comparison sample data and a probability distribution of the DNB members in the single sample data as the first index;
    第2指数计算步骤,将对照样本数据中DNB成员和非DNB成员的概率分布之间的距离作为第2指数;a second index calculation step of using a distance between a probability distribution of a DNB member and a non-DNB member in the sample data as a second index;
    第3指数计算步骤,将对照样本数据和单样本数据中非DNB成员的概率分布之间的距离作为第3指数;以及a third index calculation step of using a distance between the comparison sample data and a probability distribution of non-DNB members in the one-sample data as the third index;
    校正值设定步骤,对校正值进行设定,Correction value setting step, setting the correction value,
    在该指标计算步骤中基于第1指数、第2指数、第3指数、及校正值得到复合指标。 In the index calculation step, a composite index is obtained based on the first index, the second index, the third index, and the correction value.
  11. 如权利要求9所述的检测方法,其特征在于,The detecting method according to claim 9, wherein
    在所述指标计算步骤中将所述第1指数、所述第2指数、及所述第3指数与所述校正值之和的倒数的乘积作为复合指标。In the index calculation step, a product of the first index, the second index, and the reciprocal of the sum of the third index and the correction value is used as a composite index.
  12. 如权利要求9~11中任意一项所述的检测方法,其特征在于,The detecting method according to any one of claims 9 to 11, wherein
    所述概率分布之间的距离是KL距离。 The distance between the probability distributions is the KL distance.
PCT/CN2015/071237 2014-01-21 2015-01-21 Detection device and detection method for detecting pre-disease state WO2015110018A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410027769.2A CN104794321B (en) 2014-01-21 2014-01-21 Detection device for being detected to preceding morbid state
CN201410027769.2 2014-01-21

Publications (1)

Publication Number Publication Date
WO2015110018A1 true WO2015110018A1 (en) 2015-07-30

Family

ID=53559112

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/071237 WO2015110018A1 (en) 2014-01-21 2015-01-21 Detection device and detection method for detecting pre-disease state

Country Status (2)

Country Link
CN (1) CN104794321B (en)
WO (1) WO2015110018A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106909767B (en) * 2015-12-21 2021-11-05 北京旷博生物技术股份有限公司 System for classifying hepatitis B-related cirrhosis
CN106096276B (en) * 2016-06-13 2018-12-04 江南大学 A kind of swin flu epidemic situation early warning models based on dynamic network marker
CN109147939A (en) * 2018-09-21 2019-01-04 宜昌市疾病预防控制中心 A kind of sampling apparatus and the methods of sampling for disease control
CN109979591B (en) * 2019-03-12 2021-01-01 众安信息技术服务有限公司 Method and device for analyzing plaque progress factor based on graph neural network

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6025128A (en) * 1994-09-29 2000-02-15 The University Of Tulsa Prediction of prostate cancer progression by analysis of selected predictive parameters
CN1803085A (en) * 2005-01-14 2006-07-19 费兆馥 Human subhealth state determining method and its checking device
US20070038385A1 (en) * 2001-06-18 2007-02-15 Tatiana Nikolskaya Methods for identification of novel protein drug targets and biomarkers utilizing functional networks
CN103268431A (en) * 2013-05-21 2013-08-28 中山大学 Cancer hypotype biomarker detecting system based on student t distribution

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009050643A1 (en) * 2007-10-16 2009-04-23 Koninklijke Philips Electronics N.V. Estimation of diagnostic markers
CN101921759B (en) * 2010-09-08 2013-05-29 南京医科大学 Serum/plasma miRNA serum marker related to cervical carcinoma and precancerous lesions thereof and application thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6025128A (en) * 1994-09-29 2000-02-15 The University Of Tulsa Prediction of prostate cancer progression by analysis of selected predictive parameters
US20070038385A1 (en) * 2001-06-18 2007-02-15 Tatiana Nikolskaya Methods for identification of novel protein drug targets and biomarkers utilizing functional networks
CN1803085A (en) * 2005-01-14 2006-07-19 费兆馥 Human subhealth state determining method and its checking device
CN103268431A (en) * 2013-05-21 2013-08-28 中山大学 Cancer hypotype biomarker detecting system based on student t distribution

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
CHEN, LUONAN ET AL.: "Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers", SCIENTIFIC REPORTS, vol. 2, no. 342, 29 March 2012 (2012-03-29), pages 1 - 8, XP055192452 *
LIU, RUI ET AL.: "Dynamical network biomarkers for identifying critical transitions and their driving networks of biologic processes", QUANTITATIVE BIOLOGY, vol. 1, no. 2, 5 June 2013 (2013-06-05), pages 105 - 114, XP055215527 *
LIU, RUI ET AL.: "Identifying critical transitions and their leading biomolecular networks in complex diseases", SCIENTIFIC REPORTS, vol. 2, no. 813, 10 December 2012 (2012-12-10), pages 1 - 9, XP055211831 *
ZENG, TAO ET AL.: "Network biomarkers reveal dysfunctional gene regulationsduring disease progression", FEBS JOURNAL, vol. 280, no. 22, 22 October 2013 (2013-10-22), pages 5682 - 5695, XP055215529 *

Also Published As

Publication number Publication date
CN104794321B (en) 2018-11-09
CN104794321A (en) 2015-07-22

Similar Documents

Publication Publication Date Title
Madsen et al. Diagnostic properties of metabolic perturbations in rheumatoid arthritis
Liquet et al. A novel approach for biomarker selection and the integration of repeated measures experiments from two assays
Chung et al. How variability in clinical phenotypes should guide research into disease mechanisms in asthma
Jarvis et al. Development and validation of a decision tree early warning score based on routine laboratory test results for the discrimination of hospital mortality in emergency medical admissions
WO2015110018A1 (en) Detection device and detection method for detecting pre-disease state
MX2008013978A (en) Methods and apparatus for identifying disease status using biomarkers.
Stiegel et al. Analysis of inflammatory cytokines in human blood, breath condensate, and urine using a multiplex immunoassay platform
WO2023087917A1 (en) Cognitive decision-making evaluation method and system based on multi-dimensional hierarchical drift diffusion model
Abdel-Aziz et al. eNose breath prints as a surrogate biomarker for classifying patients with asthma by atopy
Proboszcz et al. Relationship between blood and induced sputum eosinophils, bronchial hyperresponsiveness and reversibility of airway obstruction in mild-to-moderate chronic obstructive pulmonary disease
Ooka et al. Proteomics endotyping of infants with severe bronchiolitis and risk of childhood asthma
Huang et al. Application of artificial intelligence modeling technology based on multi-omics in noninvasive diagnosis of inflammatory bowel disease
Liu et al. Mapping the Chinese version of the EORTC QLQ-BR53 onto the EQ-5D-5L and SF-6D utility scores
Hu et al. Comparison of the clinical characteristics and comprehensive assessments of the 2011 and 2017 GOLD classifications for patients with COPD in China
Tan et al. Current applications of artificial intelligence combined with urine detection in disease diagnosis and treatment
Barbour et al. Molecular models of multiple sclerosis severity identify heterogeneity of pathogenic mechanisms
Klotsche et al. oA novel nonparametric approach for estimating cut-offs in continuous risk indicators with application to diabetes epidemiology
Wang et al. Statistical inference methods for recurrent event processes with shape and size parameters
Tallarita et al. Bayesian autoregressive frailty models for inference in recurrent events
Kim et al. Classification using longitudinal trajectory of biomarker in the presence of detection limits
San Lucas et al. Classifying mild traumatic brain injuries with functional network analysis
CN111582328A (en) COVID-19 classification identification method based on lung CT image
Neves et al. Artificial neural networks in diagnosis of liver diseases
TW202030741A (en) Liver fibrosis assessment model, liver fibrosis assessment system and liver fibrosis assessment method
Shen et al. The use of statistics in heart rhythm research: a review

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15740334

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 271016)

122 Ep: pct application non-entry in european phase

Ref document number: 15740334

Country of ref document: EP

Kind code of ref document: A1