CN116720545A - Information flow control method, device, equipment and medium of neural network - Google Patents

Information flow control method, device, equipment and medium of neural network Download PDF

Info

Publication number
CN116720545A
CN116720545A CN202310999951.3A CN202310999951A CN116720545A CN 116720545 A CN116720545 A CN 116720545A CN 202310999951 A CN202310999951 A CN 202310999951A CN 116720545 A CN116720545 A CN 116720545A
Authority
CN
China
Prior art keywords
data
zero setting
neural network
body surface
hidden layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310999951.3A
Other languages
Chinese (zh)
Other versions
CN116720545B (en
Inventor
杨志宏
由育阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Technology BIT
Institute of Medicinal Plant Development of CAMS and PUMC
Original Assignee
Beijing Institute of Technology BIT
Institute of Medicinal Plant Development of CAMS and PUMC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Technology BIT, Institute of Medicinal Plant Development of CAMS and PUMC filed Critical Beijing Institute of Technology BIT
Priority to CN202310999951.3A priority Critical patent/CN116720545B/en
Publication of CN116720545A publication Critical patent/CN116720545A/en
Application granted granted Critical
Publication of CN116720545B publication Critical patent/CN116720545B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/042Knowledge-based neural networks; Logical representations of neural networks
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/318Heart-related electrical modalities, e.g. electrocardiography [ECG]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/369Electroencephalography [EEG]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/389Electromyography [EMG]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/398Electrooculography [EOG], e.g. detecting nystagmus; Electroretinography [ERG]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/48Other medical applications
    • A61B5/4806Sleep evaluation
    • A61B5/4812Detecting sleep stages or cycles
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/48Other medical applications
    • A61B5/4806Sleep evaluation
    • A61B5/4815Sleep quality
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7264Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
    • A61B5/7267Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2131Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on a transform domain processing, e.g. wavelet transform
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/045Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2123/00Data types
    • G06F2123/02Data types in the time domain, e.g. time-series data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction
    • G06F2218/10Feature extraction by analysing the shape of a waveform, e.g. extracting parameters relating to peaks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Software Systems (AREA)
  • Pathology (AREA)
  • Computing Systems (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • Animal Behavior & Ethology (AREA)
  • Surgery (AREA)
  • Heart & Thoracic Surgery (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Psychiatry (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Fuzzy Systems (AREA)
  • Physiology (AREA)
  • Signal Processing (AREA)
  • Psychology (AREA)
  • Ophthalmology & Optometry (AREA)
  • Cardiology (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)

Abstract

The present invention relates to the field of neural networks, and in particular, to a method, an apparatus, a device, and a medium for controlling information flow of a neural network. The method comprises the following steps: inputting physiological data of a sample body surface into a target neural network; performing zero setting processing on at least part of input features of at least one hidden layer; when each pair of hidden layers carries out zero setting treatment, determining the variation of model performance of the target neural network before and after the zero setting treatment; based on the variation of model performance corresponding to each zero setting process, determining the zero setting position of the input features entering each hidden layer so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be measured are the same in data type, and are time sequence data. The technical scheme can enhance the interpretability of the neural network.

Description

Information flow control method, device, equipment and medium of neural network
Technical Field
The present invention relates to the field of neural networks, and in particular, to a method, an apparatus, a device, and a medium for controlling information flow of a neural network.
Background
Neural networks, also known as Artificial Neural Networks (ANNs) or Simulated Neural Networks (SNNs), are a subset of machine learning and are also the core of deep learning algorithms. The name and structure are inspired by the brain of a person, and can imitate the mutual transmission mode of biological neurons. The neural network is composed of node layers and comprises an input layer, a plurality of hidden layers and an output layer. Each node, also called an artificial neuron, is connected to another node with an associated weight and threshold. If the output of any individual node is above a specified threshold, that node is activated and data is sent to the next layer of the network; otherwise, the data is not passed on to the next layer of the network.
Neural networks rely on training data to learn and improve their accuracy over time. Once the learning algorithms are optimized, the accuracy is improved, and the learning algorithms become a powerful tool in the fields of computer science and artificial intelligence, so that the data can be classified and clustered rapidly. However, because neural networks have black box characteristics, and neural networks involve a large number of network parameters, these abstract network parameters are generally independent of the physical nature of the problem to be solved. Therefore, researchers cannot directly interpret the network parameters of neurons as an understandable knowledge, which results in opacity and unexplainability of the neural network, and thus interpretability is particularly important.
One of the research ideas for enhancing the interpretability of the neural network is to introduce priori knowledge of human beings so that the model learns the discriminant criteria of the human beings as much as possible. For example, the sleep staging area has a great deal of rich a priori knowledge, which has unique advantages over the computer vision and natural language areas. However, the division rules of the sleep stages have a certain ambiguity, and even different human experts may diverge to some extent.
Disclosure of Invention
The invention describes an information flow control method, device, equipment and medium of a neural network, which can enhance the interpretability of the neural network.
According to a first aspect, the present invention provides an information flow control method of a neural network, including:
inputting physiological data of a sample body surface into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
zero setting is carried out on at least part of input features of at least one hidden layer;
when each pair of hidden layers carries out zero setting treatment, determining the variation of model performance of the target neural network before and after the zero setting treatment;
based on the variation of model performance corresponding to each zero setting process, determining the zero setting position of the input features entering each hidden layer so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
According to one embodiment, the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
According to one embodiment, the hidden layer comprises at least one of: convolution layer, activation layer, pooling layer and full connection layer.
According to one embodiment, the number of the convolution layers is at least two, and the hidden layer subjected to zero setting processing is the first two convolution layers.
According to one embodiment, the model performance includes at least one of: accuracy, kapa coefficient, and F1 fraction.
According to one embodiment, the zeroing processing is performed on at least part of the input features of the hidden layer, including:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
According to one embodiment, the determining the zeroing position of the input feature entering each hidden layer based on the variation of the model performance corresponding to each zeroing process includes:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
According to a second aspect, the present invention provides an information flow control apparatus of a neural network, comprising:
the input unit is used for inputting the physiological data of the sample body surface into the target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
the zero setting unit is used for carrying out zero setting processing on at least part of input features of at least one hidden layer;
the first determining unit is used for determining the variation of the model performance of the target neural network before and after the zero setting processing when the zero setting processing is carried out on each hidden layer;
the second determining unit is used for determining the zero setting position of the input characteristic of each hidden layer based on the variation of the model performance corresponding to each zero setting process so as to realize the information flow control of the physiological data of the body surface to be detected in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
According to a third aspect, the present invention provides an electronic device comprising a memory and a processor, the memory having stored therein a computer program, the processor implementing the method of the first aspect when executing the computer program.
According to a fourth aspect, the present invention provides a computer readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of the first aspect.
According to the information flow control method, the device, the equipment and the medium of the neural network, the zero setting processing is carried out on at least part of the input features of at least one hidden layer, and the zero setting position of the input features entering each hidden layer is determined based on the variation of the model performance corresponding to each zero setting processing, so that the effective information which cannot be utilized by the target neural network can be actively discarded, the complexity of feature distribution is reduced, and the model performance is improved. That is, the above technical solution can explain the model according to the output result, and enhance the interpretability of the neural network, that is, realize the layer-by-layer controllability of the information flow of the neural network, so as to realize the information flow control of the physiological data of the body surface to be measured in the target neural network.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions of the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described, and it is obvious that the drawings described below are some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 illustrates a flow diagram of a method of information flow control of a neural network, according to one embodiment;
fig. 2 shows a schematic block diagram of an information flow control apparatus of a neural network according to one embodiment.
Detailed Description
The scheme provided by the invention is described below with reference to the accompanying drawings.
Fig. 1 shows a flow diagram of an information flow control method of a neural network according to one embodiment. It is understood that the method may be performed by any apparatus, device, platform, cluster of devices having computing, processing capabilities. As shown in fig. 1, the method includes:
step 100, inputting physiological data of a sample body surface into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
102, aiming at least one hidden layer, carrying out zero setting treatment on at least part of input features of the hidden layer;
104, determining the variation of model performance of the target neural network before and after the zero setting treatment when the zero setting treatment is carried out on each hidden layer;
step 106, determining the zero setting position of the input features entering each hidden layer based on the variation of the model performance corresponding to the zero setting processing performed each time so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be measured are the same in data type, and are time sequence data.
In this embodiment, by performing zero-setting processing on at least part of the input features of at least one hidden layer, and determining the zero-setting position of the input features entering each hidden layer based on the variation of the model performance corresponding to each zero-setting processing, effective information that cannot be utilized by the target neural network can be actively discarded, complexity of feature distribution is reduced, and model performance is improved. That is, the above technical solution can explain the model according to the output result, and enhance the interpretability of the neural network, that is, realize the layer-by-layer controllability of the information flow of the neural network, so as to realize the information flow control of the physiological data of the body surface to be measured in the target neural network.
In the field of data mining, time sequence classification task is an important research direction, and diagnosis and prediction of diseases can be facilitated by analyzing and mining time sequence physiological data, so that development of intelligent medical treatment is promoted.
In one embodiment of the invention, the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
The sample body surface physiological data and the body surface physiological data to be measured are the same in data type, for example, the type of the sample body surface physiological data and the body surface physiological data to be measured are respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest belt data, abdominal belt data, pulse wave data, leg movement data, snore data, pulse rate data or blood oxygen saturation data, and specific data types of the sample body surface physiological data and the body surface physiological data to be measured are not limited.
The following is presented in terms of sleep stage or sleep data (belonging to a subset of body surface physiological data).
Sleep staging is taken as a typical physiological time sequence classification task, is a basic research in the field of sleep monitoring, and is more and more widely focused by people. Sleep staging is an important means of assessing sleep quality and sleep disorders, and sleep professionals often determine sleep stages through Polysomnography (PSG), which consists of electroencephalogram (EEG), electrooculogram (EOG), electromyogram (EMG), and Electrocardiography (ECG), which can be used to diagnose sleep disorders and other common diseases. Among other things, EEG can record not only large PSG activity changes, but also drug effects during different sleep stages and awake states. In addition, sleep can be staged according to PSG waves, the control of a sleep phase is more accurate, and a PSG chart is a more objective, accurate, rapid and widely applied drug effect evaluation method in sleep drug effect research. On the other hand, the sleep PSG explores the distribution and activity of the PSG in various frequency bands and the regularity, detail change and predictability of a nonlinear system in the brain in a linear analysis and nonlinear analysis mode. The PSG analysis method is used as a research parameter for evaluating sleep quality and PSG rhythmicity change, and has certain universality, international acceptance and larger document support and reference values.
The current sleep stage model lacks the interpretability, namely, clinical specialists in the sleep field cannot understand the logic and reasons for judging the model, which means that the clinical specialists can only choose to completely trust or not trust the results of automatic sleep disorder diagnosis in the diagnosis process, which is one of the main barriers of the application of the deep learning-based automatic sleep stage model in the clinical environment. The success of deep learning is due in part to the ability of neural networks to progressively expose relevant useful information. The interpretability of an automatic sleep stage reasonably relates to, but is not limited to, which features the model learns from the input signal, whether the features relate to and reasonably explain the sleep stage. Therefore, interpretability is particularly important, since there is some ambiguity in the division rules of sleep stages, and even different human experts may diverge to some extent.
One of the research ideas for enhancing the interpretability of the model is to introduce priori knowledge of human beings so that the model learns the discriminant criteria of the human beings as much as possible. The sleep staging area has a great deal of rich a priori knowledge, which has unique advantages over the computer vision and natural language areas. Due to the high flexibility of the modern deep learning algorithm, the deep learning model is fit with not only common knowledge in data, but also individual knowledge of some specific data samples or 'noise' in data, so that generalization performance is poor.
It should be noted that "noise" herein is not noise in a conventional sense (e.g., noise filtered with a band-pass filter), but data points that may affect the performance of the model or data points (or effective information) that cannot be effectively utilized by the neural network. It is known to those skilled in the art that the generalization ability of a model can be enhanced with an increase in the number of training times if the effective information is true positive feedback information or true negative feedback information, and that the generalization ability of a model can be reduced if the effective information is doped with "noise", i.e., if the effective information is not known to those skilled in the art, i.e., there is a misjudgment, and if the "noise" data is utilized as effective information in a conventional training process, this is disadvantageous. Therefore, a large amount of experimental comparison of the model performance is used for screening out which information in the original data is the so-called noise, so that the model performance and generalization capability are improved. In addition, the body surface physiological data mentioned in the embodiment of the invention can be original data, namely the data which is filtered by a band-pass filter is not needed, so that the integrity of the original data can be ensured, and the real noise in the original data can be fully mined.
To improve the generalization performance of the model, it is necessary to remove the "noise" of some specific data. Based on this, the inventors creatively found during the development process that: the zeroing process can be considered to be performed on at least part of the input features of at least one hidden layer (namely, the features of each position are regarded as 'noise' to perform the zeroing process, so that the 'noise' is deleted), and as to whether the deleted 'noise' really belongs to the real 'noise', the variation of the model performance of the target neural network before and after the zeroing process can be determined when each pair of hidden layers are subjected to the zeroing process, so that the zeroing position of the input features entering each hidden layer can be determined based on the variation of the model performance corresponding to each zeroing process, and the information flow control of the physiological data of the body surface to be measured in the target neural network can be realized. Therefore, the scheme can delete unnecessary noise components in the body surface physiological data to realize model improvement generalization, so that the black box characteristic of the neural network can be broken (namely, the positions of the zeroed characteristics and the possible commonalities thereof can be known to explain the model).
In one embodiment of the invention, the hidden layer comprises at least one of: convolution layer, activation layer, pooling layer and full connection layer.
In one embodiment of the present invention, the number of the convolution layers is at least two, and the hidden layer performing the zero setting process is the first two convolution layers.
In this embodiment, the inventors have obtained through a large number of experiments that the hidden layer affecting the performance of the model is mainly the first two convolution layers, and the influence of other convolution layers, the activation layer, the pooling layer or the full-connection layer on the performance of the model can be simplified and ignored compared with the first two convolution layers. Therefore, from the viewpoints of improving the model interpretability and reducing the computational resource computation power, the hidden layer subjected to the zeroing processing can be determined as the first two layers of convolution layers.
In one embodiment of the invention, the model properties include at least one of: accuracy, kapa coefficient, and F1 fraction.
In one embodiment of the present invention, the step of "zeroing at least part of the input features of the hidden layer" may specifically include:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of which the frequency is lower than a preset frequency in the first frequency spectrum characteristic to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
In this embodiment, since the body surface physiological data is usually image data, in order to find the commonality rule of the "noise" thereof more quickly, the inventors creatively think that the image data can be subjected to frequency domain transformation processing to obtain one-dimensional spectrum characteristics. Of course, the frequency domain transformation processing is not required, that is, the matrix form of the image data is directly utilized to perform the zeroing experiment of each position until the zeroing position corresponding to the optimal model performance is found.
Still, the following is illustrative in the sleep stage field.
The sleep science field has a great deal of rich prior knowledge, which has unique advantages over the computer vision and natural language fields. According to the sleep event interpretation rules issued by the AASM american sleep medical institute and the sleep stage criteria, the whole night sleep record is divided into a number of 30 seconds windows called "epochs", each epoch window representing a sleep stage. As the frequency range in which each sleep stage is located is different. Thus, the inventors creatively found that: the frequency information is information effective for the deep sleep network. For this purpose, by converting the image data into the spectral features, the commonality of the "noise" thereof can be found more quickly (i.e., the position in the spectral features where the frequency is lower than the preset frequency is "noise", and the feature of the "noise" position is set to zero).
Wherein, the EEG expert divides the EEG into five basic rhythms of delta (delta) wave, theta (theta) wave, alpha (alpha) wave, sigma (sigma) wave and beta (beta) wave according to the frequency characteristic of the EEG. The alpha rhythm mainly occurs in the awake eye-closing state and in the REM stage, and the occurrence rate in the N1 stage is less than 50%; the beta rhythm mainly appears in a conscious eye-opening state, and the beta rhythm appears more after taking hypnotic drugs; the θ rhythm occurs mainly in the late phase of the N1 phase, with amplitude typically greater than 50 μv; the delta rhythm mainly occurs in the N3 phase, the amplitude is higher (more than or equal to 75 mu V), and the proportion in the N2 phase is less than 20%; the waveform of the sigma rhythm, also called spindle wave (fusiform wave), is a characteristic brain wave of the N2 phase, with a duration amplitude typically less than 50 μv. In addition to the five main rhythms, there are several non-fundamental waveforms with distinct characteristics, such as the K-complex wave occurring primarily in the N2 phase, a steep negative wave (upward) followed by a positive wave (downward) is typically observed; the saw tooth wave has steep wave shape or triangle shape, and is the main waveform in REM period.
In one embodiment of the present invention, the step of determining the zeroing position of the input feature entering each hidden layer based on the variation of the model performance corresponding to each zeroing process may specifically include:
aiming at the zero setting processing performed each time, judging whether the model performance of the target neural network is improved before and after the zero setting processing;
if the model performance is improved, judging whether the variation of the model performance of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
In this embodiment, by sequentially determining whether the model performance of the target neural network before and after the zero setting process is improved and determining whether the variation of the model performance of the target neural network before and after the zero setting process is greater than a preset variation, the zero setting position of the input features entering each hidden layer can be determined, thereby realizing the deletion of "noise" data.
In addition, in general, when classifying time series data, the existing classification model requires that data input to the model be developed strictly in time series, for example, 1000 consecutive polysomnography. However, the above-mentioned technical solution realizes active control of the effective information flow in the deep network through the operation of the zeroing process, which can correct the inductive bias of the existing neural network (i.e. data with different time sequences can be input into the target neural network at will, for example, 500 frames before input), so as to achieve the improvement of the performance of the network for a certain sleep stage, which is difficult to achieve in the existing classification model, i.e. the existing classification model is the indiscriminate feature extraction, and does not perform the preferential feature extraction (i.e. zeroing specific positions) for some specific information.
The foregoing describes certain embodiments of the present invention. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims can be performed in a different order than the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
According to an embodiment of another aspect, the present invention provides an information flow control apparatus of a neural network. Fig. 2 shows a schematic block diagram of an information flow control apparatus of a neural network according to one embodiment. It will be appreciated that the apparatus may be implemented by any means, device, platform or cluster of devices having computing, processing capabilities. As shown in fig. 2, the apparatus includes: an input unit 200, a zeroing unit 202, a first determining unit 204 and a second determining unit 206. Wherein the main functions of each constituent unit are as follows:
an input unit 200 for inputting sample body surface physiological data into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
a zeroing unit 202, configured to, for at least one of the hidden layers, perform zeroing processing on at least part of input features of the hidden layer;
a first determining unit 204, configured to determine, when each pair of hidden layers performs a zeroing process, a variation of model performance of the target neural network before and after the zeroing process;
a second determining unit 206, configured to determine, based on a variation of model performance corresponding to each zero setting process performed, a zero setting position of an input feature of each hidden layer, so as to implement information flow control of body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
In one embodiment of the invention, the body surface physiological data includes at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
In one embodiment of the present invention, the hidden layer includes at least one of: convolution layer, activation layer, pooling layer and full connection layer.
In one embodiment of the present invention, the number of the convolution layers is at least two, and the hidden layer performing the zero setting process is the first two convolution layers.
In one embodiment of the invention, the model performance includes at least one of: accuracy, kapa coefficient, and F1 fraction.
In one embodiment of the present invention, the zeroing unit is configured to, when executing the zeroing processing on at least part of the input features of the hidden layer, perform the following operations:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
In one embodiment of the present invention, the second determining unit is configured to, when executing the change amount based on the model performance corresponding to each time of the zeroing process, determine a zeroing position of an input feature entering each of the hidden layers, execute the following operations:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
According to an embodiment of another aspect, there is also provided a computer-readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method described in connection with fig. 1.
According to an embodiment of yet another aspect, there is also provided an electronic device including a memory having executable code stored therein and a processor that, when executing the executable code, implements the method described in connection with fig. 1.
The embodiments of the present invention are described in a progressive manner, and the same and similar parts of the embodiments are all referred to each other, and each embodiment is mainly described in the differences from the other embodiments. In particular, for the device embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and reference is made to the description of the method embodiments in part.
Those skilled in the art will appreciate that in one or more of the examples described above, the functions described in the present invention may be implemented in hardware, software, firmware, or any combination thereof. When implemented in software, these functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.
The foregoing embodiments have been provided for the purpose of illustrating the general principles of the present invention in further detail, and are not to be construed as limiting the scope of the invention, but are merely intended to cover any modifications, equivalents, improvements, etc. based on the teachings of the invention.

Claims (10)

1. An information flow control method of a neural network, comprising:
inputting physiological data of a sample body surface into a target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
zero setting is carried out on at least part of input features of at least one hidden layer;
when each pair of hidden layers carries out zero setting treatment, determining the variation of model performance of the target neural network before and after the zero setting treatment;
based on the variation of model performance corresponding to each zero setting process, determining the zero setting position of the input features entering each hidden layer so as to realize the information flow control of the body surface physiological data to be measured in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
2. The method of claim 1, wherein the body surface physiological data comprises at least one of: respiratory pressure data, brain electrical data, eye electrical data, myoelectrical data, electrocardiographic data, chest strap data, abdominal strap data, pulse wave data, leg movement data, snore data, pulse rate data, and blood oxygen saturation data.
3. The method of claim 1, wherein the hidden layer comprises at least one of: convolution layer, activation layer, pooling layer and full connection layer.
4. A method according to claim 3, wherein the number of the convolution layers is at least two, and the hidden layer subjected to zero setting is the first two convolution layers.
5. The method of claim 1, wherein the model performance comprises at least one of: accuracy, kapa coefficient, and F1 fraction.
6. The method according to any one of claims 1-5, wherein zeroing at least part of the input features of the hidden layer comprises:
carrying out frequency domain transformation processing on the input features of the hidden layer to obtain first frequency spectrum features;
setting zero at a position of the first frequency spectrum characteristic, the frequency of which is lower than a preset frequency, so as to obtain a second frequency spectrum characteristic;
and carrying out frequency domain inverse transformation processing on the second frequency spectrum characteristic to obtain a target input characteristic after carrying out zero setting processing on at least part of input characteristics of the hidden layer.
7. The method of claim 6, wherein determining the zeroing locations of the input features into each hidden layer based on the amount of change in model performance corresponding to each zeroing process comprises:
aiming at each zero setting process, judging whether the model performance of the target neural network is improved before and after the zero setting process;
if the model performance is improved, judging whether the model performance variation of the target neural network before and after the zero setting treatment is greater than a preset variation;
and if the zero setting position is larger than the preset variation, taking the zero setting position of the current zero setting process as the zero setting position of the input feature entering the hidden layer.
8. An information flow control apparatus of a neural network, comprising:
the input unit is used for inputting the physiological data of the sample body surface into the target neural network; the target neural network comprises an input layer, a plurality of hidden layers and an output layer;
the zero setting unit is used for carrying out zero setting processing on at least part of input features of at least one hidden layer;
the first determining unit is used for determining the variation of the model performance of the target neural network before and after the zero setting processing when the zero setting processing is carried out on each hidden layer;
the second determining unit is used for determining the zero setting position of the input characteristic of each hidden layer based on the variation of the model performance corresponding to each zero setting process so as to realize the information flow control of the physiological data of the body surface to be detected in the target neural network; the sample body surface physiological data and the body surface physiological data to be detected are the same in data type, and are time sequence data.
9. An electronic device comprising a memory and a processor, the memory having stored therein a computer program, the processor implementing the method of any of claims 1-7 when the computer program is executed.
10. A computer readable storage medium, having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of any of claims 1-7.
CN202310999951.3A 2023-08-10 2023-08-10 Information flow control method, device, equipment and medium of neural network Active CN116720545B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310999951.3A CN116720545B (en) 2023-08-10 2023-08-10 Information flow control method, device, equipment and medium of neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310999951.3A CN116720545B (en) 2023-08-10 2023-08-10 Information flow control method, device, equipment and medium of neural network

Publications (2)

Publication Number Publication Date
CN116720545A true CN116720545A (en) 2023-09-08
CN116720545B CN116720545B (en) 2023-10-27

Family

ID=87866496

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310999951.3A Active CN116720545B (en) 2023-08-10 2023-08-10 Information flow control method, device, equipment and medium of neural network

Country Status (1)

Country Link
CN (1) CN116720545B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190005384A1 (en) * 2017-06-29 2019-01-03 General Electric Company Topology aware graph neural nets
CN110300077A (en) * 2019-04-15 2019-10-01 南京邮电大学 The blind modulation recognition algorithm of spacing related MIMO system based on ExtremeLearningMachine
CN115969329A (en) * 2023-02-08 2023-04-18 长春理工大学 Sleep staging method, system, device and medium
CN116430164A (en) * 2023-03-03 2023-07-14 中铁武汉勘察设计院有限公司 Cable online monitoring method based on distributed temperature measurement and fault waveform analysis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190005384A1 (en) * 2017-06-29 2019-01-03 General Electric Company Topology aware graph neural nets
CN110300077A (en) * 2019-04-15 2019-10-01 南京邮电大学 The blind modulation recognition algorithm of spacing related MIMO system based on ExtremeLearningMachine
CN115969329A (en) * 2023-02-08 2023-04-18 长春理工大学 Sleep staging method, system, device and medium
CN116430164A (en) * 2023-03-03 2023-07-14 中铁武汉勘察设计院有限公司 Cable online monitoring method based on distributed temperature measurement and fault waveform analysis

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
由育阳; 由书凯; 高健凯; 杨志宏: "《基于正态逆高斯和特征贡献度的睡眠分期实验研究》", 《北京理工大学学报》, pages 1 - 6 *

Also Published As

Publication number Publication date
CN116720545B (en) 2023-10-27

Similar Documents

Publication Publication Date Title
Djemal et al. EEG-based computer aided diagnosis of autism spectrum disorder using wavelet, entropy, and ANN
Güler et al. Classification of EMG signals using PCA and FFT
Liu et al. Electroencephalogram emotion recognition based on empirical mode decomposition and optimal feature selection
Abbasi et al. EEG-based neonatal sleep-wake classification using multilayer perceptron neural network
Kalaivani et al. Prediction of biomedical signals using deep learning techniques
Soni et al. Graphical representation learning-based approach for automatic classification of electroencephalogram signals in depression
Djamal et al. Significant variables extraction of post-stroke EEG signal using wavelet and SOM kohonen
Modak et al. Focal epileptic area recognition employing cross EEG rhythm spectrum images and convolutional neural network
Prakash et al. A system for automatic cardiac arrhythmia recognition using electrocardiogram signal
Khosropanah et al. A hybrid unsupervised approach toward EEG epileptic spikes detection
Murugappan et al. Recurrent quantification analysis-based emotion classification in stroke using electroencephalogram signals
Jameil et al. Efficient cnn architecture on fpga using high level module for healthcare devices
Begum et al. Automated detection of abnormalities in ECG signals using deep neural network
Azar Neuro-fuzzy system for cardiac signals classification
Saini et al. Discriminatory features based on wavelet energy for effective analysis of electroencephalogram during mental tasks
CN116720545B (en) Information flow control method, device, equipment and medium of neural network
Paithane et al. Electroencephalogram signal analysis using wavelet transform and support vector machine for human stress recognition
Boudaya et al. A convolutional neural network for artifacts detection in EEG data
Zhao et al. GTSception: a deep learning eeg emotion recognition model based on fusion of global, time domain and frequency domain feature extraction
Bano et al. Prediction of Covid-19 and post Covid-19 patients with reduced feature extraction using Machine Learning Techniques
Lu Human emotion recognition based on multi-channel EEG signals using LSTM neural network
Umar et al. Epilepsy detection using artificial neural network and grasshopper optimization algorithm (GOA)
Gopan et al. Adaptive neuro-fuzzy classifier for ‘Petit Mal’epilepsy detection using Mean Teager Energy
Vedavathi et al. Wavelet transform based neural network model to detect and characterise ECG and EEG signals simultaneously
Almutairi et al. Classification of sleep stages from EEG, EOG and EMG signals by SSNet

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant