CN111110269A - Infant pneumonia rapid intelligent diagnosis method based on mixed deep learning model - Google Patents
Infant pneumonia rapid intelligent diagnosis method based on mixed deep learning model Download PDFInfo
- Publication number
- CN111110269A CN111110269A CN202010013667.0A CN202010013667A CN111110269A CN 111110269 A CN111110269 A CN 111110269A CN 202010013667 A CN202010013667 A CN 202010013667A CN 111110269 A CN111110269 A CN 111110269A
- Authority
- CN
- China
- Prior art keywords
- data
- pneumonia
- output
- input
- neural network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 206010035664 Pneumonia Diseases 0.000 title claims abstract description 73
- 238000000034 method Methods 0.000 title claims abstract description 52
- 238000003745 diagnosis Methods 0.000 title claims abstract description 34
- 238000013136 deep learning model Methods 0.000 title claims abstract description 14
- 238000013528 artificial neural network Methods 0.000 claims abstract description 48
- 238000012549 training Methods 0.000 claims abstract description 40
- 230000015654 memory Effects 0.000 claims abstract description 19
- 230000002159 abnormal effect Effects 0.000 claims abstract description 8
- 238000004140 cleaning Methods 0.000 claims abstract description 4
- 230000000241 respiratory effect Effects 0.000 claims description 44
- 230000036760 body temperature Effects 0.000 claims description 22
- 210000002569 neuron Anatomy 0.000 claims description 17
- 230000008569 process Effects 0.000 claims description 15
- 238000004364 calculation method Methods 0.000 claims description 13
- 230000006870 function Effects 0.000 claims description 13
- 230000004913 activation Effects 0.000 claims description 12
- 230000036387 respiratory rate Effects 0.000 claims description 12
- 238000012545 processing Methods 0.000 claims description 8
- 238000012360 testing method Methods 0.000 claims description 7
- 210000004027 cell Anatomy 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 claims description 6
- 230000005236 sound signal Effects 0.000 claims description 6
- 230000036391 respiratory frequency Effects 0.000 claims description 5
- 230000029058 respiratory gaseous exchange Effects 0.000 claims description 5
- 238000001228 spectrum Methods 0.000 claims description 5
- 230000000644 propagated effect Effects 0.000 claims description 4
- 241001522296 Erithacus rubecula Species 0.000 claims description 3
- 230000009471 action Effects 0.000 claims description 3
- 230000035565 breathing frequency Effects 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 3
- 150000001875 compounds Chemical class 0.000 claims description 3
- 125000004122 cyclic group Chemical group 0.000 claims description 3
- 238000001514 detection method Methods 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 230000001502 supplementing effect Effects 0.000 claims description 3
- 230000009466 transformation Effects 0.000 claims description 3
- 238000002372 labelling Methods 0.000 claims description 2
- 230000006378 damage Effects 0.000 abstract description 4
- 208000027418 Wounds and injury Diseases 0.000 abstract description 3
- 208000014674 injury Diseases 0.000 abstract description 3
- 230000008030 elimination Effects 0.000 abstract 1
- 238000003379 elimination reaction Methods 0.000 abstract 1
- 230000008518 non respiratory effect Effects 0.000 abstract 1
- 208000037656 Respiratory Sounds Diseases 0.000 description 8
- 206010037833 rales Diseases 0.000 description 8
- 210000004072 lung Anatomy 0.000 description 6
- 238000011161 development Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 201000001178 Bacterial Pneumonia Diseases 0.000 description 2
- 201000004813 Bronchopneumonia Diseases 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- 206010061218 Inflammation Diseases 0.000 description 2
- 208000005392 Spasm Diseases 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 210000000621 bronchi Anatomy 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004054 inflammatory process Effects 0.000 description 2
- 230000003902 lesion Effects 0.000 description 2
- 230000007787 long-term memory Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000002685 pulmonary effect Effects 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 230000006403 short-term memory Effects 0.000 description 2
- 210000003437 trachea Anatomy 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 206010001029 Acute pulmonary oedema Diseases 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- 208000004262 Food Hypersensitivity Diseases 0.000 description 1
- 206010016946 Food allergy Diseases 0.000 description 1
- 206010016952 Food poisoning Diseases 0.000 description 1
- 208000019331 Foodborne disease Diseases 0.000 description 1
- 206010019280 Heart failures Diseases 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 208000007101 Muscle Cramp Diseases 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 206010030113 Oedema Diseases 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000002555 auscultation Methods 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 230000036772 blood pressure Effects 0.000 description 1
- 201000009267 bronchiectasis Diseases 0.000 description 1
- 210000003123 bronchiole Anatomy 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000008260 defense mechanism Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 235000020932 food allergy Nutrition 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000036737 immune function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 210000001370 mediastinum Anatomy 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 231100000572 poisoning Toxicity 0.000 description 1
- 230000000607 poisoning effect Effects 0.000 description 1
- 208000008128 pulmonary tuberculosis Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000004148 unit process Methods 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B7/00—Instruments for auscultation
- A61B7/003—Detecting lung or respiration noise
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/02—Detecting, measuring or recording pulse, heart rate, blood pressure or blood flow; Combined pulse/heart-rate/blood pressure determination; Evaluating a cardiovascular condition not otherwise provided for, e.g. using combinations of techniques provided for in this group with electrocardiography or electroauscultation; Heart catheters for measuring blood pressure
- A61B5/0205—Simultaneously evaluating both cardiovascular conditions and different types of body conditions, e.g. heart and respiratory condition
- A61B5/02055—Simultaneously evaluating both cardiovascular condition and temperature
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/725—Details of waveform analysis using specific filters therefor, e.g. Kalman or adaptive filters
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7253—Details of waveform analysis characterised by using transforms
- A61B5/7257—Details of waveform analysis characterised by using transforms using Fourier transforms
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
- A61B5/7267—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/02—Detecting, measuring or recording pulse, heart rate, blood pressure or blood flow; Combined pulse/heart-rate/blood pressure determination; Evaluating a cardiovascular condition not otherwise provided for, e.g. using combinations of techniques provided for in this group with electrocardiography or electroauscultation; Heart catheters for measuring blood pressure
- A61B5/024—Detecting, measuring or recording pulse rate or heart rate
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/08—Detecting, measuring or recording devices for evaluating the respiratory organs
- A61B5/0816—Measuring devices for examining respiratory frequency
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Signal Processing (AREA)
- Biophysics (AREA)
- Public Health (AREA)
- Physiology (AREA)
- Computational Linguistics (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Surgery (AREA)
- Medical Informatics (AREA)
- Heart & Thoracic Surgery (AREA)
- Evolutionary Computation (AREA)
- Pathology (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Cardiology (AREA)
- Psychiatry (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Pulmonology (AREA)
- Fuzzy Systems (AREA)
- Epidemiology (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses a rapid and intelligent infant pneumonia diagnosis method based on a mixed deep learning model, which solves the problem that the body of an infant is damaged due to overlong infant pneumonia diagnosis time or misdiagnosis at present. The method comprises the following steps: the method comprises the following steps: measuring and collecting various physiological index data of the infants, and marking the data according to the fact whether the infants suffer from pneumonia or not; step two: performing data cleaning and abnormal data elimination on the non-respiratory audio data to construct a data set for training a pneumonia diagnosis model; step three: constructing a data set for training a spiral voice recognition model; step four: training a long-time memory neural network LSTM; step five: and (4) carrying out deep neural network DNN training for interpreting various physiological index data of the patient so as to judge whether the patient suffers from pneumonia. The invention can effectively improve the diagnosis speed and the correct rate of infant pneumonia, and avoid serious injury to the infant body caused by overlong diagnosis waiting time or misdiagnosis.
Description
Technical Field
The invention belongs to the field of intelligent medical treatment, and relates to a rapid intelligent diagnosis method for infant pneumonia based on a mixed deep learning model.
Background
Pneumonia refers to inflammation of the terminal airways, alveoli and lung interstitium, and may be caused by disease microorganisms, physicochemical factors, immune injury, allergy and drugs. Bacterial pneumonia is the most common pneumonia and is also one of the most common infectious diseases. Pneumonia in daily life is mainly pneumonia caused by bacterial infection. Before the application of antibiotics, bacterial pneumonia threatens human health greatly, and the occurrence and development of antibiotics obviously reduce the death rate of pneumonia. In daily life, pneumonia is a common disease of infants, and is high in the first place among hospitalized infants, and is also one of diseases causing death of infants. The symptoms of pneumonia in infants can easily be mixed with colds. Therefore, the rapid intelligent diagnosis technology has a vital role in finding the state of an illness and treating the illness according to the illness. Avoiding the serious damage to the body of the infant caused by the overlong time for waiting for diagnosis or misdiagnosis when going to a hospital for a doctor.
Disclosure of Invention
In addition, the infant is ill and cannot be expressed by the speech of the infant, the body is damaged by the pneumonia, misdiagnosis is easy to occur, and therefore rapid diagnosis is needed. The invention provides a rapid and intelligent infant pneumonia diagnosis method based on a mixed deep learning model.
In order to achieve the purpose, the technical scheme adopted by the invention is as follows:
a rapid intelligent diagnosis method for infant pneumonia based on a hybrid deep learning model comprises the following steps:
the method comprises the following steps: measuring and collecting various physiological indexes of the infant, including respiratory audio data, body temperature, heart rate and respiratory frequency;
step two: cleaning the obtained infant non-audio physiological data, removing abnormal data, marking the data, and constructing an infant pneumonia diagnosis physiological index data set, wherein the data set comprises a training set and a testing set and is used for training a Deep Neural Network (DNN);
step three: analyzing the respiratory audio data by utilizing Fourier transform, extracting corresponding characteristic information, and constructing a data set about the spiral sound identification, wherein the data set also comprises a training set and a testing set and is used for training a long-time memory network LSTM;
step four: training a long-time memory neural network LSTM by using the frequency domain characteristics of the respiratory audio and the labels thereof, and judging the spiroid of the respiratory audio data;
step five: and establishing and training a deep neural network DNN for identifying pneumonia by using a spiral voice identification result of the breathing audio data and physiological indexes of body temperature, heart rate and breathing frequency, and interpreting new input data so as to judge whether the patient has pneumonia.
Further, the first step specifically comprises: according to local conditions, various physiological indexes of infant pneumonia patients and normal infants are collected, and the method comprises the following steps: heart rate, body temperature, respiratory rate, respiratory audio data, and then labeling and storing the data by the category of whether or not pneumonia is present.
Further, the second step is specifically: processing data except respiratory audio by using filtering, linear transformation and standardization technologies, and eliminating data groups corresponding to abnormal data and invalid data for deep neural network DNN training;
assuming that there are N detection samples and M characteristic parameters, a matrix from which the original data can be obtained is:
after standardization treatment, each index is obtained:
in the formula (I), the compound is shown in the specification,and σjMean and standard deviation of the jth characteristic parameter:
further, analyzing the breathing audio data in the third step and extracting corresponding characteristic information specifically include:
for respiratory audio data, feature Frequency domain feature extraction (using Mel-Frequency Cepstrum coefficients) is performed using fourier transform, and in the field of sound processing, Mel-Frequency Cepstrum (Mel-Frequency Cepstrum) is a linear transform of a nonlinear Mel scale (Mel scale) logarithmic energy spectrum based on sound Frequency. The Mel-frequency cepstral Coefficient (MFCC) is a Coefficient constituting the Mel-frequency cepstral. For training of long and short term memory networks LSTM, the solution of MFCC typically comprises the following steps:
a. decomposing a voice signal into a plurality of frames;
b. pre-emphasizing the voice signal, and passing the voice signal through a high-pass filter;
c. performing fourier transform to transform the signal into the frequency domain, namely:
d. passing the obtained frequency spectrum of each frame through a Mel filter (triangular overlapping window) to obtain Mel scale;
e. extracting logarithmic energy on each mel scale;
f. the result obtained above is subjected to inverse discrete fourier transform and transformed into the cepstrum domain, that is:
MFCC is the magnitude (amplitudes) of this cepstral plot. Typically 12 coefficients are used, superimposed with the frame energy to obtain 13-dimensional coefficients.
Further, the fourth step is specifically:
training a long-time memory neural network LSTM by using the frequency domain characteristics of the respiratory audio and the labels thereof to obtain a first part of a model, namely a part for recognizing the spiral sound;
the LSTM has three gate structures, namely an input gate, an output gate and a forgetting gate;
(4) forget the door:
let the network "forget" the information that was not useful in the previous step, it will depend on the current input x(t)State of the last moment c(t-1)Output h of the previous moment(t-1)The input part of the forgetting gate mainly comprises a hidden state h at the previous moment(t-1)And input data x of the current time(t)It uses an activation function sigma, generally a sigmoid function, to ensure the output of the forgetting gate is [0,1 ] due to the characteristics of the sigmoid function]Output f(t)Comprises the following steps:
f(t)=σ(Wfh(t-1)+Ufx(t)+bf) (6)
wherein Wf,Uf,bfRespectively, the cyclic weight, the input weight and the offset of the forgetting gate;
(5) an input gate:
after the neural network 'forgets' the information of the previous state, supplementing new memory; according toCurrent input x(t)State of the last moment c(t-1)Output h of the previous moment(t-1)Jointly determining useful information into a new state c(t),And stored in memory cells; the output of which comprises two parts, i(t)And a(t):
i(t)=σ(Wih(t-1)+Uix(t)+bi) (7)
a(t)=tanh(Wah(t-1)+Uax(t)+ba) (8)
Wherein, Wi,WaFor different connection weights, U, of the previous statei,UaFor different connection weights of the inputs, ib,abRespectively, a respective bias portion.
(6) An output gate:
using status cells c(t)Determining how much information is output to hidden layer state unit h(t)When the user passes through the forgetting gate and the input gate, the results of the two gates act on the state unit c(t),c(t)The mathematical expression implemented is:
c(t)=i(t)Θa(t)+ftΘc(t)(9)
wherein i(t)Θa(t)For remaining in state unit c after passing through input gate(t)Information of (1), ftΘc(t-1)To remain at c after passing through a forgetting door(t)The information in (1). At the same time, the layer state h is hidden(t)Updated mainly by two parts, x(t)、h(t-1)Result of the combined action o(t)And status unit c(t)To determine o(t)And h(t)The formula implemented is:
o(t)=σ(Woh(t-1)+Uox(t)+bo) (10)
h(t)=o(t)Θtanh(c(t)) (11)
wherein U iso,Wo,boRespectively inputting x for the current time(t)Output gate weight, last hidden state unit h(t-1)Round robin weight and output gate offset;
and (3) processing the respiratory audio signal according to the flow, inputting the respiratory audio signal into an LSTM network, if the output result is 1, judging that the respiratory audio contains spiral sound, and if the output result is 0, judging that the respiratory audio does not contain spiral sound.
Further, the fifth step is specifically:
inputting data including body temperature, heart rate, respiratory rate and whether the respiratory audio data contain spiral sound (1 or 0) into a deep neural network DNN for training to obtain a second part of the model, namely a pneumonia diagnosis part;
(1) initialization
The convergence of the network is greatly influenced by the setting of the initial value of the connection weight (including a threshold value); the net input value of each neuron is small due to the expected initial value so as to ensure that the neuron works in an activation region, namely a region with sensitive change, and the initial value of each connection weight is a random number;
(2) forward pass process computation
(a) Input learning sample pair (X)q,Tq) Wherein Q is 1,2, … Q;
(b) for each input sample, sequentially calculating the output value of each neuron according to the following formula;
in the formula:output of each hidden layer of DNN, wji,wkj,wlkThe weight coefficients of each hidden layer are respectively, and because the signals in the neural network are transmitted layer by layer, the output of the previous hidden layer in the neural network is used as the input of the next hidden layer, so thatIs the output of the first hidden layer, is also the input of the second hidden layer,is the output of the second hidden layer and is also the input of the third hidden layer;
(3) error back propagation
Calculating equivalent errors of neurons of each layer from the output layer to the input layer, then returning to the step (2), and performing forward propagation calculation and error backward propagation on other learning sample pairs until all α learning sample pairs are similarly calculated;
(4) adjusting connection weight between layers
And modifying the connection weight of each layer according to the formula (13), namely:
in the formula: w is aik(n+1),wkj(n+1),wji(n +1) are the weight coefficients of the hidden layers after the back propagation update by the gradient method, wik(n),wkj(n),wji(n) the current weight coefficients of the hidden layers, η represents the step size of the gradient method,respectively representing gradient information of each hidden layer after being propagated through the neural network;
(5) returning to the step (2), performing forward calculation according to the new connection weight, and if each learning sample pair meets the precision requirement, that is:
wherein, p is 1,2, … …, α, l is 1,2, … …, m, (α, m respectively represents the number of the activation units of each hidden layer and the number of the hidden layers of the neuron), epsilon is a certain value given according to the precision requirement, when the above formula is satisfied, the learning of the network is completed, otherwise, the calculation is repeated until the requirement is satisfied.
The calculation process of the Deep Neural Network (DNN) is actually an updating process of the weight and the threshold of the network, and the weight and the threshold which enable the overall error to be minimum are found;
the final diagnostic model (classification recognition model) is:
in the formula: f. of1,f2,f3Respectively representing the activation functions of all the hidden layers, and other symbols are described in the description;
inputting the output result (1 or 0) of the LSTM network and physiological indexes of body temperature, heart rate and respiratory rate into the deep neural network, if the output is 1, judging that the patient has pneumonia, and if the output is 0, judging that the patient does not have pneumonia.
The working principle of the invention is as follows:
pulmonary rale (whorl) refers to auscultation of breath sounds, when air passes through the trachea containing secretions, or the bronchi narrowed due to spasm or swelling, an additional breath mure, i.e., rale, is heard on the basis of the breath sounds. According to the nature of the pitch, the pitch can be classified into dry pitch and wet pitch. Dry rale is due to the narrowing or incomplete blockage of the trachea, bronchi or bronchioles, and turbulence in the airflow during inhalation or exhalation. The pathological basis is inflammation-induced mucosal congestion edema, secretion increase and bronchial smooth muscle spasm; intraluminal foreign matter, tumor obstruction, and swollen lymph or mediastinum compress the airways. Localized moist rale in the lung only suggests local lesions in the lung, such as pneumonia, pulmonary tuberculosis or bronchiectasis; moist and crackle at both sides are mostly seen in blood stasis of lung and bronchopneumonia caused by heart failure; for example, the wet-swollen and rale-swollen lung fields are often seen in acute pulmonary edema or severe bronchopneumonia. Therefore, pulmonary rale is an important index in the process of pneumonia diagnosis.
In order to integrate the typical indexes of rale and rale, a plurality of physiological indexes of body temperature, heart rate, respiratory rate and the like when the infant suffers from pneumonia, a classification recognition model which is established and trained by the body temperature, heart rate and respiratory rate physiological indexes including whether the infant contains spiral sound or not is established by adopting a deep neural network, data is interpreted, and whether the patient suffers from pneumonia or not is judged.
The deep neural network is a machine learning technology used for establishing and simulating a neural network for analyzing and learning of the human brain and simulating the mechanism of the human brain to explain data. Is part of the Artificial Intelligence (AI) framework. By means of the strong calculation and analysis capability of the deep neural network DNN, the spiral sound can be rapidly and effectively intelligently identified.
DNN implements the concept of multi-layer network learning. When an input mode of the network is given, the input mode is transmitted to the hidden layer unit from the input layer unit, processed layer by the hidden layer unit and then transmitted to the output layer unit, and an output mode is generated after the output layer unit processes the hidden layer unit and is a layer-by-layer state updating process called forward propagation. If the output response has errors with the expected output mode and does not meet the requirements, the error is transferred to the error back propagation, the error values are transmitted layer by layer along the connecting passage, and the connecting weight values of each layer are corrected. For a given set of training patterns, the network is continuously trained by using the training patterns, the forward propagation and error back propagation processes are repeated, and when the requirements of all the training patterns are met, the DNN network is considered to be trained. There are actually two types of signals flowing in a BP network:
(1) the working signal propagates in the forward direction. The input signal passes from the input layer through the hidden layer to the output layer, producing an output signal at the output. The weight value of the network is fixed and invariable in the forward transmission process of the signal, and the state of each layer of neuron only affects the state of the next layer of neuron. If the desired output is not available at the output layer, the input error signal back-propagation process is entered.
(2) The error signal propagates in reverse. The difference between the actual output and the expected output of the network is an error signal, and the error signal is propagated from the output end layer by layer. In the process of error signal back propagation, the weight of the network is adjusted by error feedback. The actual output of the network is closer to the expected output through continuous modification of the weight values.
The invention has the beneficial effects that:
the invention combines a long-time memory network LSTM and a deep neural network DNN (hybrid deep learning model) to carry out intelligent diagnosis on infant pneumonia. According to the existing medical experience, the existence of spiral sound in the breath sound is an important basis for judging that the infant has pneumonia, and the spiral sound is an abnormal sound caused by lung lesion contained in the breath sound and is generally judged by an experienced doctor with a stethoscope. The automatic identification of the spiral sound is similar to voice identification, so the method adopts the most common long-time memory network LSTM in the voice identification to identify the spiral sound of the breathing voice frequency. Meanwhile, the diagnosis of pneumonia is based on several other physiological indicators, including: in order to realize intelligent diagnosis of pneumonia, the body temperature, the heart rate and the respiratory rate need to judge other physiological indexes besides the existence of the spiral sound, therefore, the invention adopts a deep neural network to establish a classification recognition model for the spiral sound recognition result (the existence is 1, and the nonexistence is 0) and other physiological indexes, thereby finally realizing intelligent diagnosis of pneumonia. In summary, the combination of LSTM and DNN for intelligent pneumonia diagnosis comes from understanding the actual pneumonia diagnosis process, and makes use of the advantages of LSTM (LSTM excels in classifying time series signals, such as respiratory audio) and DNN (DNN excels in classifying static data, such as digital identification). Therefore, the invention is an innovation combining actual diagnosis and treatment knowledge and artificial intelligence technology. The invention can effectively improve the diagnosis speed and the correct rate of infant pneumonia, and avoid serious injury to the infant body caused by overlong diagnosis waiting time or misdiagnosis.
Drawings
FIG. 1 is a flow chart of the operation of the present invention;
FIG. 2 is a flow chart of a hybrid deep learning model pneumonia diagnosis of the present invention;
fig. 3 is a schematic diagram of a Deep Neural Network (DNN) architecture.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. In addition, the ordinal terms such as step one, step two, etc. in this document are established based on the step relationships shown in the flowcharts.
In the present invention, the terms such as "fourier transform", "long-short term memory network", etc. conform to the meanings defined in the relevant technical standards of the terms, and it is obvious to those skilled in the art that the specific meanings of the above terms in the present invention can be understood according to specific situations.
Example 1
Referring to fig. 1-2, a method for rapidly and intelligently diagnosing infant pneumonia based on a hybrid deep learning model includes the following steps:
infants refer to children born to the age of 1 year old, and young children refer to children 1-3 years old. Infants have the following physiological characteristics:
firstly, the growth and development period is rapid, the metabolism is vigorous, but the development of each organ is incomplete, and the function is immature.
Secondly, the immune function is low, the defense mechanism is poor, so the medicine is easy to be infected by acute infectious diseases and easy to cause food allergy, poisoning and infection.
The normal physiological index of human body is an important standard for judging whether the human body is healthy or not. The contents include body temperature, heart rate, blood pressure, etc. For example, the normal body temperature is 36-37 deg.C, the normal heart rate is 110-.
The physiological indexes of the infant mainly refer to respiratory audio data, body temperature, heart rate and respiratory frequency. The special data acquisition instrument mainly comprises a respiratory audio detector, a body temperature measurer, a heart rate detector and a respiratory frequency sensor.
The method comprises the following steps: according to local conditions, 300 cases of physiological indexes of infant pneumonia patients and normal infants are collected by visiting local large hospitals (data collection of 2 months in local hospitals) and the method comprises the following steps: the method comprises the steps of obtaining heart rate, body temperature, respiratory rate and respiratory audio data, marking the data according to the type of pneumonia, and dividing the data into a training set (comprising physiological index data of 260 pneumonia patients and physiological index data of 260 normal infants) and a testing set (comprising physiological index data of 40 pneumonia patients and physiological index data of 40 normal infants); because of the enormous amount of data, some experimental data are provided as shown in table 1:
table 1: part of the collected data for experiments
Heart rate | Body temperature (°) | Breathing frequency (times/minute) | Whether there is any spiral sound | Whether or not to suffer from pneumonia |
135 | 36.5 | 55 | 0 | 0 |
137 | 36.6 | 56 | 0 | 0 |
140 | 36.4 | 50 | 0 | 0 |
100 | 37.8 | 65 | 1 | 1 |
150 | 38.5 | 70 | 1 | 1 |
155 | 38.6 | 72 | 1 | 1 |
… | … | … | … | … |
Step two: cleaning the obtained infant non-audio physiological data, removing abnormal data, marking the data, and constructing an infant pneumonia physiological index data set which comprises a training set and a testing set and is used for training a deep neural network DNN;
processing data except respiratory audio by using filtering, linear transformation and standardization technologies, and eliminating data groups corresponding to abnormal data and invalid data for deep neural network DNN training; the method specifically comprises the following steps:
assuming that there are N detection samples and M characteristic parameters, a matrix from which the original data can be obtained is:
after standardization treatment, each index is obtained:
in the formula (I), the compound is shown in the specification,and σjMean and standard deviation of the jth characteristic parameter:
step three: analyzing the breathing audio data in the data by utilizing Fourier transform, extracting corresponding characteristic information, and constructing a data set about the spiral sound, wherein the data set also comprises a training set and a test set;
the method comprises the following steps of analyzing the respiratory audio data and extracting corresponding characteristic information:
for respiratory audio data, feature Frequency domain feature extraction (using Mel-Frequency cepstral coefficients) is performed using fourier transform, and in the field of sound processing, Mel-Frequency Cepstrum (Mel-Frequency Cepstrum) is a linear transform of the log energy spectrum based on the nonlinear Mel scale (Mel scale) of sound frequencies. The Mel-Frequency Cepstral Coefficient (MFCC) is a Coefficient constituting the Mel-Frequency Cepstral. For training of long and short term memory networks LSTM, the solution of MFCC typically comprises the following steps:
a. decomposing a voice signal into a plurality of frames;
b. pre-emphasizing the voice signal, and passing the voice signal through a high-pass filter;
c. performing fourier transform to transform the signal into the frequency domain, namely:
d. passing the obtained frequency spectrum of each frame through a Mel filter (triangular overlapping window) to obtain Mel scale;
e. extracting logarithmic energy on each mel scale;
f. the result obtained above is subjected to inverse discrete fourier transform and transformed into the cepstrum domain, that is:
MFCC is the magnitude (amplitudes) of this cepstral plot. Typically 12 coefficients are used, superimposed with the frame energy to obtain 13-dimensional coefficients.
Step four: training a long-time memory neural network LSTM by using the frequency domain characteristics of the respiratory audio and the labels thereof, and carrying out classification and identification on the respiratory audio data; the method specifically comprises the following steps:
training a long-time memory neural network LSTM by using the frequency domain characteristics of the respiratory audio and the labels thereof to obtain a first part of a model and a spiral sound identification part;
the LSTM has three "gate" structures, namely an input gate, an output gate, and a forgetting gate.
(1) Forget the door:
let the network "forget" the information that was not useful in the previous step, it will depend on the current input x(t)State of the last moment c(t-1)Output h of the previous moment(t-1)The input part of the forgetting gate mainly comprises a hidden state h at the previous moment(t-1)And input data x of the current time(t)It passes through an activation function sigma, generally a sigmoid function, which, due to its characteristics, ensures that the output of the forgetting gate is at[0,1]Output f(t)Comprises the following steps:
f(t)=σ(Wfh(t-1)+Ufx(t)+bf) (6)
wherein Wf,Uf,bfRespectively, the cyclic weight, the input weight and the offset of the forgetting gate;
(2) an input gate:
after the neural network 'forgets' the information of the previous state, supplementing new memory; according to the current input x(t)State of the last moment c(t-1)Output h of the previous moment(t-1)Jointly determining useful information into a new state c(t)And stored in memory cells; the output of which comprises two parts, i(t)And a(t):
i(t)=σ(Wih(t-1)+Uix(t)+bi) (7)
a(t)=tanh(Wah(t-1)+Uax(t)+ba) (8)
Wherein, Wi,WaFor different connection weights, U, of the previous statei,UaFor different connection weights of the inputs, ib,abRespectively, a respective bias portion.
(3) An output gate:
using status cells c(t)Determining how much information is output to hidden layer state unit h(t)When the user passes through the forgetting gate and the input gate, the results of the two gates act on the state unit c(t),c(t)The mathematical expression implemented is:
c(t)=i(t)Θa(t)+ftΘc(t)(9)
wherein i(t)Θa(t)For remaining in state unit c after passing through input gate(t)Information of (1), ftΘc(t-1)To remain at c after passing through a forgetting door(t)The information in (1). At the same time, the layer state h is hidden(t)Updated mainly by two parts, x(t)、h(t-1)Result of the combined action o(t)And status unit c(t)To determine o(t)And h(t)The formula implemented is:
o(t)=σ(Woh(t-1)+Uox(t)+bo) (10)
h(t)=o(t)Θtanh(c(t)) (11)
wherein U iso,Wo,boRespectively inputting x for the current time(t)Output gate weight, last hidden state unit h(t-1)Round robin weight and output gate offset;
and (3) processing the respiratory audio signal according to the flow, inputting the respiratory audio signal into an LSTM network, if the output result is 1, judging that the respiratory audio contains spiral sound, and if the output result is 0, judging that the respiratory audio does not contain spiral sound.
The LSTM back propagation algorithm is slightly different from the traditional neural network back propagation algorithm, for the recurrent neural network, because 'cycles' exist in the network, some weight matrixes are calculated and shared in multiple layers of the network, and each time node depends on previous node information, namely historical information, so that the common BP algorithm cannot be directly used in a recurrent neural network model training stage. The back propagation algorithm utilized by the LSTM is bptt (backpropagation threouthtime), which is to calculate partial derivatives of parameters on a time sequence and calculate gradients of loss functions layer by layer so as to update the parameters.
The collected respiratory audio data of 260 infants with pneumonia and 260 normal infants are used for carrying out LSTM training to obtain a recognition model for spiral sound, and the respiratory audio data of the rest 40 groups of infants with pneumonia and 40 normal infants are used for carrying out model verification, so that the accuracy rate of spiral sound recognition of the model is 85%.
Step five: the method comprises the following steps of establishing and training a deep neural network DNN classification recognition model for pneumonia diagnosis by utilizing a spirophone recognition result, body temperature, heart rate and respiratory frequency physiological indexes, judging and reading various winning data of a patient, and further judging whether the patient suffers from pneumonia, wherein the method specifically comprises the following steps:
inputting data including body temperature, heart rate, respiratory rate and whether the respiratory audio data contains spiral sound (1 or 0) into a deep neural network DNN for training (the structure of the deep neural network DNN refers to FIG. 3), and obtaining a second part of the model, namely a pneumonia diagnosis part;
(1) initialization
The convergence of the network is greatly influenced by the setting of the initial value of the connection weight (including a threshold value); the net input value of each neuron is small due to the expected initial value so as to ensure that the neuron works in an activation region, namely a region with sensitive change, and the initial value of each connection weight is a random number;
(2) forward pass process computation
(a) Input learning sample pair (X)q,Tq) Wherein Q is 1,2, … Q;
(b) for each input sample, sequentially calculating the output value of each neuron according to the following formula;
in the formula:output of each hidden layer of DNN, wji,wkj,wlkThe weight coefficients of each hidden layer are respectively, and because the signals in the neural network are transmitted layer by layer, the output of the previous hidden layer in the neural network is used as the input of the next hidden layer, so thatIs the output of the first hidden layer, is also the input of the second hidden layer,is the output of the second hidden layer and is also the input of the third hidden layer;
(3) error back propagation
Calculating equivalent errors of neurons of each layer from the output layer to the input layer, then returning to the step (2), and performing forward propagation calculation and error backward propagation on other learning sample pairs until all α learning sample pairs are similarly calculated;
(4) adjusting connection weight between layers
And modifying the connection weight of each layer according to the following formula:
in the formula: w is aik(n+1),wkj(n+1),wji(n +1) are the weight coefficients of the hidden layers after the back propagation update by the gradient method, wik(n),wkj(n),wji(n) the current weight coefficients of the hidden layers, η represents the step size of the gradient method,respectively representing gradient information of each hidden layer after being propagated through the neural network;
(5) returning to the step (2), performing forward calculation according to the new connection weight, and if each learning sample pair meets the precision requirement, that is:
wherein, p is 1,2, … …, α, l is 1,2, … …, m, (α, m respectively represents the number of the activation units of each hidden layer and the number of the hidden layers of the neuron), epsilon is a certain value given according to the precision requirement, when the above formula is satisfied, the learning of the network is completed, otherwise, the calculation is repeated until the requirement is satisfied;
the calculation process of the Deep Neural Network (DNN) is actually an updating process of the weight and the threshold of the network, and the weight and the threshold which enable the overall error to be minimum are found;
the final diagnostic model obtained was:
in the formula: f. of1,f2,f3Respectively representing the activation functions of all the hidden layers, and other symbols are described in the description;
inputting the output result (1 or 0) of the LSTM network and physiological indexes of body temperature, heart rate and respiratory rate into the deep neural network, if the output is 1, judging that the patient has pneumonia, and if the output is 0, judging that the patient does not have pneumonia.
The method comprises the steps of carrying out DNN training by using collected physiological data of 260 pneumonia-affected infants and 260 normal infants to obtain a classification recognition model for pneumonia diagnosis, and carrying out model verification by using physiological data of the rest 40 groups of pneumonia-affected infants and 40 normal infants to obtain the accuracy rate of pneumonia diagnosis of the model, wherein the accuracy rate of the pneumonia diagnosis of the model is 80%.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.
Claims (6)
1. A rapid intelligent diagnosis method for infant pneumonia based on a hybrid deep learning model comprises the following steps:
the method comprises the following steps: measuring and collecting various physiological indexes of the infant, including respiratory audio data, body temperature, heart rate and respiratory frequency;
step two: cleaning the obtained infant non-audio physiological data, removing abnormal data, marking the data, and constructing an infant pneumonia diagnosis physiological index data set, wherein the data set comprises a training set and a testing set and is used for training a Deep Neural Network (DNN);
step three: analyzing the respiratory audio data by utilizing Fourier transform, extracting corresponding characteristic information, and constructing a data set about the spiral sound identification, wherein the data set also comprises a training set and a testing set and is used for training a long-time memory network LSTM;
step four: training a long-time memory neural network LSTM by using the frequency domain characteristics of the respiratory audio and the labels thereof, and judging the spiroid of the respiratory audio data;
step five: and establishing and training a deep neural network DNN for identifying pneumonia by using a spiral voice identification result of the breathing audio data and physiological indexes of body temperature, heart rate and breathing frequency, and interpreting new input data so as to judge whether the patient has pneumonia.
2. The method for rapidly and intelligently diagnosing infant pneumonia based on a hybrid deep learning model as claimed in claim 1, wherein the first step is specifically as follows: according to local conditions, various physiological indexes of infant pneumonia patients and normal infants are collected, and the method comprises the following steps: heart rate, body temperature, respiratory rate, respiratory audio data, and then labeling and storing the data by the category of whether or not pneumonia is present.
3. The method for rapidly and intelligently diagnosing infant pneumonia based on a hybrid deep learning model according to claim 1, wherein the second step is specifically as follows: processing data except respiratory audio by using filtering, linear transformation and standardization technologies, and eliminating data groups corresponding to abnormal data and invalid data for deep neural network DNN training;
assuming that there are N detection samples and M characteristic parameters, a matrix from which the original data can be obtained is:
after standardization treatment, each index is obtained:
in the formula (I), the compound is shown in the specification,and σjMean and standard deviation of the jth characteristic parameter:
4. the method for rapidly and intelligently diagnosing infant pneumonia according to claim 1, wherein the method for analyzing respiratory audio data in the third step and extracting corresponding characteristic information specifically comprises:
for the respiratory audio data, Fourier transform is utilized to extract characteristic frequency domain characteristics, the method is used for training a long-time memory network (LSTM), and the MFCC solving comprises the following steps:
a. decomposing a voice signal into a plurality of frames;
b. pre-emphasizing the voice signal, and passing the voice signal through a high-pass filter;
c. performing fourier transform to transform the signal into the frequency domain, namely:
d. passing the frequency spectrum obtained by each frame through a Mel filter to obtain Mel scales;
e. extracting logarithmic energy on each mel scale;
f. the result obtained above is subjected to inverse discrete fourier transform and transformed into the cepstrum domain, that is:
MFCC is the magnitude of this cepstrum; typically 12 coefficients are used, superimposed with the frame energy to obtain 13-dimensional coefficients.
5. The method for rapidly and intelligently diagnosing infant pneumonia based on a hybrid deep learning model according to claim 1, wherein the fourth step is specifically as follows:
training a long-time memory neural network LSTM by using the frequency domain characteristics of the respiratory audio and the labels thereof to obtain a first part of a model, namely a part for recognizing the spiral sound;
the LSTM has three gate structures, namely an input gate, an output gate and a forgetting gate;
(1) forget the door:
let the network "forget" the information that was not useful in the previous step, it will depend on the current input x(t)State of the last moment c(t-1)Output h of the previous moment(t-1)The input part of the forgetting gate mainly comprises a hidden state h at the previous moment(t-1)And input data x of the current time(t)It uses an activation function sigma, generally a sigmoid function, to ensure the output of the forgetting gate is [0,1 ] due to the characteristics of the sigmoid function]Output f(t)Comprises the following steps:
f(t)=σ(Wfh(t-1)+Ufx(t)+bf) (6)
wherein Wf,Uf,bfRespectively, the cyclic weight, the input weight and the offset of the forgetting gate;
(2) an input gate:
after the neural network 'forgets' the information of the previous state, supplementing new memory; according to the current input x(t)State of the last moment c(t-1)Output h of the previous moment(t-1)Jointly determining useful information into a new state c(t)And stored in memory cells; the output of which comprises two parts, i(t)And a(t):
i(t)=σ(Wih(t-1)+Uix(t)+bi) (7)
a(t)=tanh(Wah(t-1)+Uax(t)+ba) (8)
Wherein, Wi,WaFor different connection weights, U, of the previous statei,UaFor different connection weights of the inputs, ib,abRespectively, a respective bias portion;
(3) an output gate:
using status cells c(t)Determining how much information is output to hidden layer state unit h(t)When the user passes through the forgetting gate and the input gate, the results of the two gates act on the state unit c(t),c(t)The mathematical expression implemented is:
c(t)=i(t)Θa(t)+ftΘc(t)(9)
wherein i(t)Θa(t)For remaining in state unit c after passing through input gate(t)Information of (1), ftΘc(t-1)To remain at c after passing through a forgetting door(t)The information in (1). At the same time, the layer state h is hidden(t)Updated mainly by two parts, x(t)、h(t-1)Result of the combined action o(t)And status unit c(t)To determine o(t)And h(t)The formula implemented is:
o(t)=σ(Woh(t-1)+Uox(t)+bo) (10)
h(t)=o(t)Θtanh(c(t)) (11)
wherein U iso,Wo,boRespectively inputting x for the current time(t)Output gate weight, last hidden state unit h(t-1)Round robin weight and output gate offset;
and (3) processing the respiratory audio signal according to the flow, inputting the respiratory audio signal into an LSTM network, if the output result is 1, judging that the respiratory audio contains spiral sound, and if the output result is 0, judging that the respiratory audio does not contain spiral sound.
6. The method for rapidly and intelligently diagnosing infant pneumonia based on a hybrid deep learning model according to claim 1, wherein the fifth step is specifically as follows:
inputting data including body temperature, heart rate, respiratory rate and whether the respiratory audio data contains spiral sound into a deep neural network DNN for training to obtain a second part of the model, namely a pneumonia diagnosis part;
(1) initialization
The convergence of the network is greatly influenced by the setting of the initial value of the connection weight comprising the threshold; the net input value of each neuron is small due to the expected initial value so as to ensure that the neuron works in an activation region, namely a region with sensitive change, and the initial value of each connection weight is a random number;
(2) forward pass process computation
(a) Input learning sample pair (X)q,Tq) Wherein Q is 1,2, … Q;
(b) for each input sample, sequentially calculating the output value of each neuron according to the following formula;
in the formula:output of each hidden layer of DNN, wji,wkj,wlkThe weight coefficients of each hidden layer are respectively, and because the signals in the neural network are transmitted layer by layer, the output of the previous hidden layer in the neural network is used as the input of the next hidden layer, so thatIs the output of the first hidden layer, is also the input of the second hidden layer,is the output of the second hidden layer and is also the input of the third hidden layer;
(3) error back propagation
Calculating equivalent errors of neurons of each layer from the output layer to the input layer, then returning to the step (2), and performing forward propagation calculation and error backward propagation on other learning sample pairs until all α learning sample pairs are similarly calculated;
(4) adjusting connection weight between layers
And modifying the connection weight of each layer according to the formula (13), namely:
in the formula: w is aik(n+1),wkj(n+1),wji(n +1) are the weight coefficients of the hidden layers after the back propagation update by the gradient method, wik(n),wkj(n),wji(n) the current weight coefficients of the hidden layers, η represents the step size of the gradient method,respectively representing gradient information of each hidden layer after being propagated through the neural network;
(5) returning to the step (2), performing forward calculation according to the new connection weight, and if each learning sample pair meets the precision requirement, that is:
wherein, p is 1,2, … …, α, l is 1,2, … …, m, (α, m respectively represents the number of the activation units of each hidden layer and the number of the hidden layers of the neuron), epsilon is a certain value given according to the precision requirement, when the above formula is satisfied, the learning of the network is completed, otherwise, the calculation is repeated until the requirement is satisfied;
the calculation process of the deep neural network DNN is actually an updating process of the weight and the threshold of the network, and the weight and the threshold which enable the overall error to be minimum are found;
the final diagnostic model (classification recognition model) is:
in the formula: f. of1,f2,f3Respectively representing the activation functions of all the hidden layers, and other symbols are described in the description;
inputting the output result (1 or 0) of the LSTM network and physiological indexes of body temperature, heart rate and respiratory rate into the deep neural network, if the output is 1, judging that the patient has pneumonia, and if the output is 0, judging that the patient does not have pneumonia.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010013667.0A CN111110269A (en) | 2020-01-07 | 2020-01-07 | Infant pneumonia rapid intelligent diagnosis method based on mixed deep learning model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010013667.0A CN111110269A (en) | 2020-01-07 | 2020-01-07 | Infant pneumonia rapid intelligent diagnosis method based on mixed deep learning model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111110269A true CN111110269A (en) | 2020-05-08 |
Family
ID=70488411
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010013667.0A Pending CN111110269A (en) | 2020-01-07 | 2020-01-07 | Infant pneumonia rapid intelligent diagnosis method based on mixed deep learning model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111110269A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111803032A (en) * | 2020-07-03 | 2020-10-23 | 赵永翔 | Large-area observation method and system for suspected infection of new coronary pneumonia |
CN111951246A (en) * | 2020-08-11 | 2020-11-17 | 湖南大学 | Multidirectional X-ray chest radiography pneumonia diagnosis method based on deep learning |
CN112863667A (en) * | 2021-01-22 | 2021-05-28 | 杭州电子科技大学 | Lung sound diagnosis device based on deep learning |
CN113081025A (en) * | 2021-03-10 | 2021-07-09 | 华中科技大学 | New crown pneumonia intelligent diagnosis system and device based on lung sounds |
-
2020
- 2020-01-07 CN CN202010013667.0A patent/CN111110269A/en active Pending
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111803032A (en) * | 2020-07-03 | 2020-10-23 | 赵永翔 | Large-area observation method and system for suspected infection of new coronary pneumonia |
CN111803032B (en) * | 2020-07-03 | 2022-08-19 | 赵永翔 | Large-area observation method and system for suspected infection of Xinguan pneumonia |
CN111951246A (en) * | 2020-08-11 | 2020-11-17 | 湖南大学 | Multidirectional X-ray chest radiography pneumonia diagnosis method based on deep learning |
CN112863667A (en) * | 2021-01-22 | 2021-05-28 | 杭州电子科技大学 | Lung sound diagnosis device based on deep learning |
CN112863667B (en) * | 2021-01-22 | 2023-10-24 | 杭州电子科技大学 | Lung sound diagnostic device based on deep learning |
CN113081025A (en) * | 2021-03-10 | 2021-07-09 | 华中科技大学 | New crown pneumonia intelligent diagnosis system and device based on lung sounds |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111110269A (en) | Infant pneumonia rapid intelligent diagnosis method based on mixed deep learning model | |
Ma et al. | Lungbrn: A smart digital stethoscope for detecting respiratory disease using bi-resnet deep learning algorithm | |
JP2023164839A (en) | Method for analysis of cough sound using disease signature to diagnose respiratory disease | |
Emanet et al. | A comparative analysis of machine learning methods for classification type decision problems in healthcare | |
CN110801221B (en) | Sleep apnea fragment detection equipment based on unsupervised feature learning | |
EP3100675A1 (en) | Method and system for recognizing physiological sound | |
Deperlioglu | Heart sound classification with signal instant energy and stacked autoencoder network | |
JP2015514456A (en) | Method and apparatus for processing patient sounds | |
Cinyol et al. | Incorporating support vector machine to the classification of respiratory sounds by Convolutional Neural Network | |
Singh et al. | Short unsegmented PCG classification based on ensemble classifier | |
Dokur | Respiratory sound classification by using an incremental supervised neural network | |
CN113080993B (en) | Automatic arrhythmia analysis method based on improved multi-base classifier integration | |
CN113436726A (en) | Automatic lung pathological sound analysis method based on multi-task classification | |
Yildirim | Automatic classification and diagnosis of heart valve diseases using heart sounds with MFCC and proposed deep model | |
Jiang et al. | Continuous blood pressure estimation based on multi-scale feature extraction by the neural network with multi-task learning | |
Palaniappan et al. | Classification of pulmonary pathology from breath sounds using the wavelet packet transform and an extreme learning machine | |
CN112668556B (en) | Breathing sound identification method and system | |
CN113974607A (en) | Sleep snore detecting system based on impulse neural network | |
JP2023531464A (en) | A method and system for screening for obstructive sleep apnea during wakefulness using anthropometric information and tracheal breath sounds | |
Faustino | Crackle and wheeze detection in lung sound signals using convolutional neural networks | |
Ren et al. | 1D-CNNs model for classification of sputum deposition degree in mechanical ventilated patients based on airflow signals | |
Kaushal et al. | Deep autoencoder neural networks for heart sound classification | |
Shi et al. | Obstructive sleep apnea detection using difference in feature and modified minimum distance classifier | |
TÜRKER et al. | Smart Stethoscope | |
Roy et al. | VGAResNet: A Unified Visibility Graph Adjacency Matrix based Residual Network for Chronic Obstructive Pulmonary Disease Detection Using Lung Sounds |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200508 |
|
RJ01 | Rejection of invention patent application after publication |