CN116720545B - Information flow control method, device, equipment and medium of neural network - Google Patents
Information flow control method, device, equipment and medium of neural network Download PDFInfo
- Publication number
- CN116720545B CN116720545B CN202310999951.3A CN202310999951A CN116720545B CN 116720545 B CN116720545 B CN 116720545B CN 202310999951 A CN202310999951 A CN 202310999951A CN 116720545 B CN116720545 B CN 116720545B
- Authority
- CN
- China
- Prior art keywords
- data
- zero setting
- neural network
- hidden layer
- input
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 83
- 238000000034 method Methods 0.000 title claims abstract description 63
- 238000012545 processing Methods 0.000 claims abstract description 42
- 230000008569 process Effects 0.000 claims abstract description 40
- 238000001228 spectrum Methods 0.000 claims description 21
- 230000009466 transformation Effects 0.000 claims description 12
- 210000004556 brain Anatomy 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 7
- 206010041235 Snoring Diseases 0.000 claims description 6
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 claims description 6
- 239000008280 blood Substances 0.000 claims description 6
- 210000004369 blood Anatomy 0.000 claims description 6
- 230000003183 myoelectrical effect Effects 0.000 claims description 6
- 229910052760 oxygen Inorganic materials 0.000 claims description 6
- 239000001301 oxygen Substances 0.000 claims description 6
- 230000000241 respiratory effect Effects 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 5
- 238000011176 pooling Methods 0.000 claims description 5
- 230000008859 change Effects 0.000 claims description 4
- 239000011230 binding agent Substances 0.000 claims 2
- 230000008667 sleep stage Effects 0.000 description 14
- 230000033764 rhythmic process Effects 0.000 description 8
- 241000282414 Homo sapiens Species 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 230000003187 abdominal effect Effects 0.000 description 4
- 238000013135 deep learning Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000013145 classification model Methods 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 230000000857 drug effect Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 210000002569 neuron Anatomy 0.000 description 3
- 208000019116 sleep disease Diseases 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000003860 sleep quality Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013136 deep learning model Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000002565 electrocardiography Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000000147 hypnotic effect Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 208000020685 sleep-wake disease Diseases 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/042—Knowledge-based neural networks; Logical representations of neural networks
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/24—Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
- A61B5/316—Modalities, i.e. specific diagnostic methods
- A61B5/318—Heart-related electrical modalities, e.g. electrocardiography [ECG]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/24—Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
- A61B5/316—Modalities, i.e. specific diagnostic methods
- A61B5/369—Electroencephalography [EEG]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/24—Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
- A61B5/316—Modalities, i.e. specific diagnostic methods
- A61B5/389—Electromyography [EMG]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/24—Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
- A61B5/316—Modalities, i.e. specific diagnostic methods
- A61B5/398—Electrooculography [EOG], e.g. detecting nystagmus; Electroretinography [ERG]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/48—Other medical applications
- A61B5/4806—Sleep evaluation
- A61B5/4812—Detecting sleep stages or cycles
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/48—Other medical applications
- A61B5/4806—Sleep evaluation
- A61B5/4815—Sleep quality
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
- A61B5/7267—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2131—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on a transform domain processing, e.g. wavelet transform
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
- G06N5/045—Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2123/00—Data types
- G06F2123/02—Data types in the time domain, e.g. time-series data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/08—Feature extraction
- G06F2218/10—Feature extraction by analysing the shape of a waveform, e.g. extracting parameters relating to peaks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/12—Classification; Matching
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Medical Informatics (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- Pathology (AREA)
- Computing Systems (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Surgery (AREA)
- Heart & Thoracic Surgery (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Psychiatry (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Fuzzy Systems (AREA)
- Physiology (AREA)
- Signal Processing (AREA)
- Psychology (AREA)
- Ophthalmology & Optometry (AREA)
- Cardiology (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
Abstract
The present invention relates to the field of neural networks, and in particular, to a method, an apparatus, a device, and a medium for controlling information flow of a neural network. The method comprises the following steps: inputting physiological data of a sample body surface into a target neural network; performing zero setting processing on at least part of input features of at least one hidden layer; when each pair of hidden layers carries out zero setting treatment, determining the variation of model performance of the target neural network before and after the zero setting treatment; based on the variation of model performance corresponding to each zero setting process, determining the zero setting position of the input features entering each hidden layer so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be measured are the same in data type, and are time sequence data. The technical scheme can enhance the interpretability of the neural network.
Description
Technical Field
The present invention relates to the field of neural networks, and in particular, to a method, an apparatus, a device, and a medium for controlling information flow of a neural network.
Background
Neural networks, also known as Artificial Neural Networks (ANNs) or Simulated Neural Networks (SNNs), are a subset of machine learning and are also the core of deep learning algorithms. The name and structure are inspired by the brain of a person, and can imitate the mutual transmission mode of biological neurons. The neural network is composed of node layers and comprises an input layer, a plurality of hidden layers and an output layer. Each node, also called an artificial neuron, is connected to another node with an associated weight and threshold. If the output of any individual node is above a specified threshold, that node is activated and data is sent to the next layer of the network; otherwise, the data is not passed on to the next layer of the network.
Neural networks rely on training data to learn and improve their accuracy over time. Once the learning algorithms are optimized, the accuracy is improved, and the learning algorithms become a powerful tool in the fields of computer science and artificial intelligence, so that the data can be classified and clustered rapidly. However, because neural networks have black box characteristics, and neural networks involve a large number of network parameters, these abstract network parameters are generally independent of the physical nature of the problem to be solved. Therefore, researchers cannot directly interpret the network parameters of neurons as an understandable knowledge, which results in opacity and unexplainability of the neural network, and thus interpretability is particularly important.
One of the research ideas for enhancing the interpretability of the neural network is to introduce priori knowledge of human beings so that the model learns the discriminant criteria of the human beings as much as possible. For example, the sleep staging area has a great deal of rich a priori knowledge, which has unique advantages over the computer vision and natural language areas. However, the division rules of the sleep stages have a certain ambiguity, and even different human experts may diverge to some extent.
Disclosure of Invention
The invention describes an information flow control method, device, equipment and medium of a neural network, which can enhance the interpretability of the neural network.
According to a first aspect, the present invention provides an information flow control method of a neural network, including:
inputting physiological data of a sample body surface into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
zero setting is carried out on at least part of input features of at least one hidden layer;
when each pair of hidden layers carries out zero setting treatment, determining the variation of model performance of the target neural network before and after the zero setting treatment;
based on the variation of model performance corresponding to each zero setting process, determining the zero setting position of the input features entering each hidden layer so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
According to one embodiment, the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
According to one embodiment, the hidden layer comprises at least one of: convolution layer, activation layer, pooling layer and full connection layer.
According to one embodiment, the number of the convolution layers is at least two, and the hidden layer subjected to zero setting processing is the first two convolution layers.
According to one embodiment, the model performance includes at least one of: accuracy, kapa coefficient, and F1 fraction.
According to one embodiment, the zeroing processing is performed on at least part of the input features of the hidden layer, including:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
According to one embodiment, the determining the zeroing position of the input feature entering each hidden layer based on the variation of the model performance corresponding to each zeroing process includes:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
According to a second aspect, the present invention provides an information flow control apparatus of a neural network, comprising:
the input unit is used for inputting the physiological data of the sample body surface into the target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
the zero setting unit is used for carrying out zero setting processing on at least part of input features of at least one hidden layer;
the first determining unit is used for determining the variation of the model performance of the target neural network before and after the zero setting processing when the zero setting processing is carried out on each hidden layer;
the second determining unit is used for determining the zero setting position of the input characteristic of each hidden layer based on the variation of the model performance corresponding to each zero setting process so as to realize the information flow control of the physiological data of the body surface to be detected in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
According to a third aspect, the present invention provides an electronic device comprising a memory and a processor, the memory having stored therein a computer program, the processor implementing the method of the first aspect when executing the computer program.
According to a fourth aspect, the present invention provides a computer readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of the first aspect.
According to the information flow control method, the device, the equipment and the medium of the neural network, the zero setting processing is carried out on at least part of the input features of at least one hidden layer, and the zero setting position of the input features entering each hidden layer is determined based on the variation of the model performance corresponding to each zero setting processing, so that the effective information which cannot be utilized by the target neural network can be actively discarded, the complexity of feature distribution is reduced, and the model performance is improved. That is, the above technical solution can explain the model according to the output result, and enhance the interpretability of the neural network, that is, realize the layer-by-layer controllability of the information flow of the neural network, so as to realize the information flow control of the physiological data of the body surface to be measured in the target neural network.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions of the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described, and it is obvious that the drawings described below are some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 illustrates a flow diagram of a method of information flow control of a neural network, according to one embodiment;
fig. 2 shows a schematic block diagram of an information flow control apparatus of a neural network according to one embodiment.
Detailed Description
The scheme provided by the invention is described below with reference to the accompanying drawings.
Fig. 1 shows a flow diagram of an information flow control method of a neural network according to one embodiment. It is understood that the method may be performed by any apparatus, device, platform, cluster of devices having computing, processing capabilities. As shown in fig. 1, the method includes:
step 100, inputting physiological data of a sample body surface into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
102, aiming at least one hidden layer, carrying out zero setting treatment on at least part of input features of the hidden layer;
104, determining the variation of model performance of the target neural network before and after the zero setting treatment when the zero setting treatment is carried out on each hidden layer;
step 106, determining the zero setting position of the input features entering each hidden layer based on the variation of the model performance corresponding to the zero setting processing performed each time so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be measured are the same in data type, and are time sequence data.
In this embodiment, by performing zero-setting processing on at least part of the input features of at least one hidden layer, and determining the zero-setting position of the input features entering each hidden layer based on the variation of the model performance corresponding to each zero-setting processing, effective information that cannot be utilized by the target neural network can be actively discarded, complexity of feature distribution is reduced, and model performance is improved. That is, the above technical solution can explain the model according to the output result, and enhance the interpretability of the neural network, that is, realize the layer-by-layer controllability of the information flow of the neural network, so as to realize the information flow control of the physiological data of the body surface to be measured in the target neural network.
In the field of data mining, time sequence classification task is an important research direction, and diagnosis and prediction of diseases can be facilitated by analyzing and mining time sequence physiological data, so that development of intelligent medical treatment is promoted.
In one embodiment of the invention, the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
The sample body surface physiological data and the body surface physiological data to be measured are the same in data type, for example, the type of the sample body surface physiological data and the body surface physiological data to be measured are respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest belt data, abdominal belt data, pulse wave data, leg movement data, snore data, pulse rate data or blood oxygen saturation data, and specific data types of the sample body surface physiological data and the body surface physiological data to be measured are not limited.
The following is presented in terms of sleep stage or sleep data (belonging to a subset of body surface physiological data).
Sleep staging is taken as a typical physiological time sequence classification task, is a basic research in the field of sleep monitoring, and is more and more widely focused by people. Sleep staging is an important means of assessing sleep quality and sleep disorders, and sleep professionals often determine sleep stages through Polysomnography (PSG), which consists of electroencephalogram (EEG), electrooculogram (EOG), electromyogram (EMG), and Electrocardiography (ECG), which can be used to diagnose sleep disorders and other common diseases. Among other things, EEG can record not only large PSG activity changes, but also drug effects during different sleep stages and awake states. In addition, sleep can be staged according to PSG waves, the control of a sleep phase is more accurate, and a PSG chart is a more objective, accurate, rapid and widely applied drug effect evaluation method in sleep drug effect research. On the other hand, the sleep PSG explores the distribution and activity of the PSG in various frequency bands and the regularity, detail change and predictability of a nonlinear system in the brain in a linear analysis and nonlinear analysis mode. The PSG analysis method is used as a research parameter for evaluating sleep quality and PSG rhythmicity change, and has certain universality, international acceptance and larger document support and reference values.
The current sleep stage model lacks the interpretability, namely, clinical specialists in the sleep field cannot understand the logic and reasons for judging the model, which means that the clinical specialists can only choose to completely trust or not trust the results of automatic sleep disorder diagnosis in the diagnosis process, which is one of the main barriers of the application of the deep learning-based automatic sleep stage model in the clinical environment. The success of deep learning is due in part to the ability of neural networks to progressively expose relevant useful information. The interpretability of an automatic sleep stage reasonably relates to, but is not limited to, which features the model learns from the input signal, whether the features relate to and reasonably explain the sleep stage. Therefore, interpretability is particularly important, since there is some ambiguity in the division rules of sleep stages, and even different human experts may diverge to some extent.
One of the research ideas for enhancing the interpretability of the model is to introduce priori knowledge of human beings so that the model learns the discriminant criteria of the human beings as much as possible. The sleep staging area has a great deal of rich a priori knowledge, which has unique advantages over the computer vision and natural language areas. Due to the high flexibility of the modern deep learning algorithm, the deep learning model is fit with not only common knowledge in data, but also individual knowledge of some specific data samples or 'noise' in data, so that generalization performance is poor.
It should be noted that "noise" herein is not noise in a conventional sense (e.g., noise filtered with a band-pass filter), but data points that may affect the performance of the model or data points (or effective information) that cannot be effectively utilized by the neural network. It is known to those skilled in the art that the generalization ability of a model can be enhanced with an increase in the number of training times if the effective information is true positive feedback information or true negative feedback information, and that the generalization ability of a model can be reduced if the effective information is doped with "noise", i.e., if the effective information is not known to those skilled in the art, i.e., there is a misjudgment, and if the "noise" data is utilized as effective information in a conventional training process, this is disadvantageous. Therefore, a large amount of experimental comparison of the model performance is used for screening out which information in the original data is the so-called noise, so that the model performance and generalization capability are improved. In addition, the body surface physiological data mentioned in the embodiment of the invention can be original data, namely the data which is filtered by a band-pass filter is not needed, so that the integrity of the original data can be ensured, and the real noise in the original data can be fully mined.
To improve the generalization performance of the model, it is necessary to remove the "noise" of some specific data. Based on this, the inventors creatively found during the development process that: the zeroing process can be considered to be performed on at least part of the input features of at least one hidden layer (namely, the features of each position are regarded as 'noise' to perform the zeroing process, so that the 'noise' is deleted), and as to whether the deleted 'noise' really belongs to the real 'noise', the variation of the model performance of the target neural network before and after the zeroing process can be determined when each pair of hidden layers are subjected to the zeroing process, so that the zeroing position of the input features entering each hidden layer can be determined based on the variation of the model performance corresponding to each zeroing process, and the information flow control of the physiological data of the body surface to be measured in the target neural network can be realized. Therefore, the scheme can delete unnecessary noise components in the body surface physiological data to realize model improvement generalization, so that the black box characteristic of the neural network can be broken (namely, the positions of the zeroed characteristics and the possible commonalities thereof can be known to explain the model).
In one embodiment of the invention, the hidden layer comprises at least one of: convolution layer, activation layer, pooling layer and full connection layer.
In one embodiment of the present invention, the number of the convolution layers is at least two, and the hidden layer performing the zero setting process is the first two convolution layers.
In this embodiment, the inventors have obtained through a large number of experiments that the hidden layer affecting the performance of the model is mainly the first two convolution layers, and the influence of other convolution layers, the activation layer, the pooling layer or the full-connection layer on the performance of the model can be simplified and ignored compared with the first two convolution layers. Therefore, from the viewpoints of improving the model interpretability and reducing the computational resource computation power, the hidden layer subjected to the zeroing processing can be determined as the first two layers of convolution layers.
In one embodiment of the invention, the model properties include at least one of: accuracy, kapa coefficient, and F1 fraction.
In one embodiment of the present invention, the step of "zeroing at least part of the input features of the hidden layer" may specifically include:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of which the frequency is lower than a preset frequency in the first frequency spectrum characteristic to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
In this embodiment, since the body surface physiological data is usually image data, in order to find the commonality rule of the "noise" thereof more quickly, the inventors creatively think that the image data can be subjected to frequency domain transformation processing to obtain one-dimensional spectrum characteristics. Of course, the frequency domain transformation processing is not required, that is, the matrix form of the image data is directly utilized to perform the zeroing experiment of each position until the zeroing position corresponding to the optimal model performance is found.
Still, the following is illustrative in the sleep stage field.
The sleep science field has a great deal of rich prior knowledge, which has unique advantages over the computer vision and natural language fields. According to the sleep event interpretation rules issued by the AASM american sleep medical institute and the sleep stage criteria, the whole night sleep record is divided into a number of 30 seconds windows called "epochs", each epoch window representing a sleep stage. As the frequency range in which each sleep stage is located is different. Thus, the inventors creatively found that: the frequency information is information effective for the deep sleep network. For this purpose, by converting the image data into the spectral features, the commonality of the "noise" thereof can be found more quickly (i.e., the position in the spectral features where the frequency is lower than the preset frequency is "noise", and the feature of the "noise" position is set to zero).
Wherein, the EEG expert divides the EEG into five basic rhythms of delta (delta) wave, theta (theta) wave, alpha (alpha) wave, sigma (sigma) wave and beta (beta) wave according to the frequency characteristic of the EEG. The alpha rhythm mainly occurs in the awake eye-closing state and in the REM stage, and the occurrence rate in the N1 stage is less than 50%; the beta rhythm mainly appears in a conscious eye-opening state, and the beta rhythm appears more after taking hypnotic drugs; the θ rhythm occurs mainly in the late phase of the N1 phase, with amplitude typically greater than 50 μv; the delta rhythm mainly occurs in the N3 phase, the amplitude is higher (more than or equal to 75 mu V), and the proportion in the N2 phase is less than 20%; the waveform of the sigma rhythm, also called spindle wave (fusiform wave), is a characteristic brain wave of the N2 phase, with a duration amplitude typically less than 50 μv. In addition to the five main rhythms, there are several non-fundamental waveforms with distinct characteristics, such as the K-complex wave occurring primarily in the N2 phase, a steep negative wave (upward) followed by a positive wave (downward) is typically observed; the saw tooth wave has steep wave shape or triangle shape, and is the main waveform in REM period.
In one embodiment of the present invention, the step of determining the zeroing position of the input feature entering each hidden layer based on the variation of the model performance corresponding to each zeroing process may specifically include:
aiming at the zero setting processing performed each time, judging whether the model performance of the target neural network is improved before and after the zero setting processing;
if the model performance is improved, judging whether the variation of the model performance of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
In this embodiment, by sequentially determining whether the model performance of the target neural network before and after the zero setting process is improved and determining whether the variation of the model performance of the target neural network before and after the zero setting process is greater than a preset variation, the zero setting position of the input features entering each hidden layer can be determined, thereby realizing the deletion of "noise" data.
In addition, in general, when classifying time series data, the existing classification model requires that data input to the model be developed strictly in time series, for example, 1000 consecutive polysomnography. However, the above-mentioned technical solution realizes active control of the effective information flow in the deep network through the operation of the zeroing process, which can correct the inductive bias of the existing neural network (i.e. data with different time sequences can be input into the target neural network at will, for example, 500 frames before input), so as to achieve the improvement of the performance of the network for a certain sleep stage, which is difficult to achieve in the existing classification model, i.e. the existing classification model is the indiscriminate feature extraction, and does not perform the preferential feature extraction (i.e. zeroing specific positions) for some specific information.
The foregoing describes certain embodiments of the present invention. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims can be performed in a different order than the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
According to an embodiment of another aspect, the present invention provides an information flow control apparatus of a neural network. Fig. 2 shows a schematic block diagram of an information flow control apparatus of a neural network according to one embodiment. It will be appreciated that the apparatus may be implemented by any means, device, platform or cluster of devices having computing, processing capabilities. As shown in fig. 2, the apparatus includes: an input unit 200, a zeroing unit 202, a first determining unit 204 and a second determining unit 206. Wherein the main functions of each constituent unit are as follows:
an input unit 200 for inputting sample body surface physiological data into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
a zeroing unit 202, configured to, for at least one of the hidden layers, perform zeroing processing on at least part of input features of the hidden layer;
a first determining unit 204, configured to determine, when each pair of hidden layers performs a zeroing process, a variation of model performance of the target neural network before and after the zeroing process;
a second determining unit 206, configured to determine, based on a variation of model performance corresponding to each zero setting process performed, a zero setting position of an input feature of each hidden layer, so as to implement information flow control of body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
In one embodiment of the invention, the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
In one embodiment of the present invention, the hidden layer includes at least one of: convolution layer, activation layer, pooling layer and full connection layer.
In one embodiment of the present invention, the number of the convolution layers is at least two, and the hidden layer performing the zero setting process is the first two convolution layers.
In one embodiment of the invention, the model performance includes at least one of: accuracy, kapa coefficient, and F1 fraction.
In one embodiment of the present invention, the zeroing unit is configured to, when executing the zeroing processing on at least part of the input features of the hidden layer, perform the following operations:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
In one embodiment of the present invention, the second determining unit is configured to, when executing the change amount based on the model performance corresponding to each time of the zeroing process, determine a zeroing position of an input feature entering each of the hidden layers, execute the following operations:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
According to an embodiment of another aspect, there is also provided a computer-readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method described in connection with fig. 1.
According to an embodiment of yet another aspect, there is also provided an electronic device including a memory having executable code stored therein and a processor that, when executing the executable code, implements the method described in connection with fig. 1.
The embodiments of the present invention are described in a progressive manner, and the same and similar parts of the embodiments are all referred to each other, and each embodiment is mainly described in the differences from the other embodiments. In particular, for the device embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and reference is made to the description of the method embodiments in part.
Those skilled in the art will appreciate that in one or more of the examples described above, the functions described in the present invention may be implemented in hardware, software, firmware, or any combination thereof. When implemented in software, these functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.
The foregoing embodiments have been provided for the purpose of illustrating the general principles of the present invention in further detail, and are not to be construed as limiting the scope of the invention, but are merely intended to cover any modifications, equivalents, improvements, etc. based on the teachings of the invention.
Claims (6)
1. An information flow control method of a neural network, comprising:
inputting physiological data of a sample body surface into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
zero setting is carried out on at least part of input features of at least one hidden layer;
when each pair of hidden layers carries out zero setting treatment, determining the variation of model performance of the target neural network before and after the zero setting treatment;
based on the variation of model performance corresponding to each zero setting process, determining the zero setting position of the input features entering each hidden layer so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data;
the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, binder data, pulse wave data, leg movement data, snore data, pulse rate data and blood oxygen saturation data;
the model properties include at least one of: accuracy, kapa coefficient and F1 score;
the zeroing processing is performed on at least part of the input features of the hidden layer, and the zeroing processing comprises the following steps:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
performing frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after performing zero setting processing on at least part of input characteristics of the hidden layer;
the determining, based on the variation of the model performance corresponding to each zero setting process, the zero setting position of the input feature entering each hidden layer includes:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
2. The method of claim 1, wherein the hidden layer comprises at least one of: convolution layer, activation layer, pooling layer and full connection layer.
3. The method according to claim 2, wherein the number of the convolution layers is at least two, and the hidden layer subjected to zero setting is the first two convolution layers.
4. An information flow control apparatus of a neural network, comprising:
the input unit is used for inputting the physiological data of the sample body surface into the target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
the zero setting unit is used for carrying out zero setting processing on at least part of input features of at least one hidden layer;
the first determining unit is used for determining the variation of the model performance of the target neural network before and after the zero setting processing when the zero setting processing is carried out on each hidden layer;
the second determining unit is used for determining the zero setting position of the input characteristic of each hidden layer based on the variation of the model performance corresponding to each zero setting process so as to realize the information flow control of the physiological data of the body surface to be detected in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data;
the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, binder data, pulse wave data, leg movement data, snore data, pulse rate data and blood oxygen saturation data;
the model properties include at least one of: accuracy, kapa coefficient and F1 score;
the zeroing unit is used for executing the following operations when executing the zeroing processing on at least part of the input features of the hidden layer:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
performing frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after performing zero setting processing on at least part of input characteristics of the hidden layer;
the second determining unit is configured to, when executing the change amount based on the model performance corresponding to each zero setting process, determine a zero setting position of an input feature entering each hidden layer, execute the following operations:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
5. An electronic device comprising a memory and a processor, the memory having stored therein a computer program, the processor implementing the method of any of claims 1-3 when the computer program is executed.
6. A computer readable storage medium, having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of any of claims 1-3.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310999951.3A CN116720545B (en) | 2023-08-10 | 2023-08-10 | Information flow control method, device, equipment and medium of neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310999951.3A CN116720545B (en) | 2023-08-10 | 2023-08-10 | Information flow control method, device, equipment and medium of neural network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116720545A CN116720545A (en) | 2023-09-08 |
CN116720545B true CN116720545B (en) | 2023-10-27 |
Family
ID=87866496
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310999951.3A Active CN116720545B (en) | 2023-08-10 | 2023-08-10 | Information flow control method, device, equipment and medium of neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116720545B (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110300077A (en) * | 2019-04-15 | 2019-10-01 | 南京邮电大学 | The blind modulation recognition algorithm of spacing related MIMO system based on ExtremeLearningMachine |
CN115969329A (en) * | 2023-02-08 | 2023-04-18 | 长春理工大学 | Sleep staging method, system, device and medium |
CN116430164A (en) * | 2023-03-03 | 2023-07-14 | 中铁武汉勘察设计院有限公司 | Cable online monitoring method based on distributed temperature measurement and fault waveform analysis |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190005384A1 (en) * | 2017-06-29 | 2019-01-03 | General Electric Company | Topology aware graph neural nets |
-
2023
- 2023-08-10 CN CN202310999951.3A patent/CN116720545B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110300077A (en) * | 2019-04-15 | 2019-10-01 | 南京邮电大学 | The blind modulation recognition algorithm of spacing related MIMO system based on ExtremeLearningMachine |
CN115969329A (en) * | 2023-02-08 | 2023-04-18 | 长春理工大学 | Sleep staging method, system, device and medium |
CN116430164A (en) * | 2023-03-03 | 2023-07-14 | 中铁武汉勘察设计院有限公司 | Cable online monitoring method based on distributed temperature measurement and fault waveform analysis |
Non-Patent Citations (1)
Title |
---|
由育阳 ; 由书凯 ; 高健凯 ; 杨志宏.《基于正态逆高斯和特征贡献度的睡眠分期实验研究》.《北京理工大学学报》.2019,第1-6页. * |
Also Published As
Publication number | Publication date |
---|---|
CN116720545A (en) | 2023-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Liu et al. | Electroencephalogram emotion recognition based on empirical mode decomposition and optimal feature selection | |
Güler et al. | Classification of EMG signals using PCA and FFT | |
Kamble et al. | A comprehensive survey on emotion recognition based on electroencephalograph (EEG) signals | |
Pan et al. | A transition-constrained discrete hidden Markov model for automatic sleep staging | |
Kalaivani et al. | Prediction of biomedical signals using deep learning techniques | |
Soni et al. | Graphical representation learning-based approach for automatic classification of electroencephalogram signals in depression | |
Xu et al. | Research on EEG channel selection method for emotion recognition | |
Djamal et al. | Significant variables extraction of post-stroke EEG signal using wavelet and SOM kohonen | |
Jameil et al. | Efficient CNN architecture on FPGA using high level module for healthcare devices | |
Modak et al. | Focal epileptic area recognition employing cross eeg rhythm spectrum images and convolutional neural network | |
Prakash et al. | A system for automatic cardiac arrhythmia recognition using electrocardiogram signal | |
Khosropanah et al. | A hybrid unsupervised approach toward EEG epileptic spikes detection | |
Saini et al. | Discriminatory features based on wavelet energy for effective analysis of electroencephalogram during mental tasks | |
Hassan et al. | Review of EEG Signals Classification Using Machine Learning and Deep-Learning Techniques | |
Almutairi et al. | Classification of sleep stages from EEG, EOG and EMG signals by SSNet | |
CN116720545B (en) | Information flow control method, device, equipment and medium of neural network | |
Rivera | Monitoring of micro-sleep and sleepiness for the drivers using EEG signal | |
Paithane et al. | Electroencephalogram signal analysis using wavelet transform and support vector machine for human stress recognition | |
Sanamdikar et al. | Classification of ECG Signal for Cardiac Arrhythmia Detection Using GAN Method | |
Boudaya et al. | A convolutional neural network for artifacts detection in EEG data | |
Gopan et al. | Adaptive neuro-fuzzy classifier for ‘Petit Mal’epilepsy detection using Mean Teager Energy | |
Lu | Human emotion recognition based on multi-channel EEG signals using LSTM neural network | |
Garg et al. | Exploring wrist pulse signals using empirical mode decomposition: emotions | |
Vedavathi et al. | Wavelet transform based neural network model to detect and characterise ECG and EEG signals simultaneously | |
Zhao et al. | GTSception: a deep learning eeg emotion recognition model based on fusion of global, time domain and frequency domain feature extraction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |