WO2021237917A1 - Self-adaptive cognitive activity recognition method and apparatus, and storage medium - Google Patents

Self-adaptive cognitive activity recognition method and apparatus, and storage medium Download PDF

Info

Publication number
WO2021237917A1
WO2021237917A1 PCT/CN2020/104558 CN2020104558W WO2021237917A1 WO 2021237917 A1 WO2021237917 A1 WO 2021237917A1 CN 2020104558 W CN2020104558 W CN 2020104558W WO 2021237917 A1 WO2021237917 A1 WO 2021237917A1
Authority
WO
WIPO (PCT)
Prior art keywords
eeg
information
reward
model
cognitive activity
Prior art date
Application number
PCT/CN2020/104558
Other languages
French (fr)
Chinese (zh)
Inventor
王洪涛
唐聪
许林峰
岳洪伟
Original Assignee
五邑大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 五邑大学 filed Critical 五邑大学
Publication of WO2021237917A1 publication Critical patent/WO2021237917A1/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/369Electroencephalography [EEG]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/16Devices for psychotechnics; Testing reaction times ; Devices for evaluating the psychological state
    • A61B5/165Evaluating the state of mind, e.g. depression, anxiety
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7264Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
    • A61B5/7267Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present invention relates to the field of artificial intelligence, in particular to an adaptive cognitive activity recognition method, device and storage medium.
  • Electroencephalogram is an electrophysiological monitoring index that can analyze the state and activity of the brain by measuring the voltage fluctuations of the ion current in the brain neurons.
  • EEG signals can be collected in a non-invasive and non-fixed manner with portable and off-the-shelf equipment.
  • the EEG signal classification algorithm has been studied for a series of practical applications.
  • the accuracy and robustness of the EEG classification model is an important measure of cognitive activities such as movement intention recognition and emotion recognition.
  • the cognitive activity recognition system builds a bridge between the internal cognitive world and the external physical world. They are recently used in assisted living, smart home and entertainment industries; motor image recognition based on EEG can help people with disabilities perform basic activities necessary for life; Emotion recognition based on EEG signals can be used to detect the emotional state of current patients, such as Depression, anxiety, etc.
  • the classification of cognitive activities faces several challenges.
  • most of the existing EEG classification research uses EEG data preprocessing and feature extraction methods (for example, band-pass filtering, discrete wavelet transform, and feature selection) that are time-consuming and highly dependent on professional knowledge.
  • most of the current EEG classification methods are designed based on the knowledge of a specific field, so they may fail or even fail in different situations.
  • the present invention aims to solve at least one of the technical problems existing in the prior art.
  • the present invention proposes an adaptive cognitive activity recognition method, which can recognize cognitive activities more effectively and improve the accuracy of recognition.
  • the present invention also provides an adaptive cognitive activity recognition device applying the above-mentioned adaptive cognitive activity recognition method.
  • the present invention also provides a computer-readable storage medium applying the above-mentioned adaptive cognitive activity recognition method.
  • the best attention area information is input into the reward model to obtain the classification recognition result.
  • the adaptive cognitive activity recognition method has at least the following beneficial effects: a general framework for automatic cognitive activity recognition is proposed to promote the scope of various cognitive application fields, including motor image recognition and emotion recognition . Design a reinforced selective attention model by combining deep reinforcement learning and attention mechanism to automatically extract robust and unique deep features; to encourage the model to select the best attention area that can achieve the highest classification accuracy; in addition, it is customized according to the cognitive activity recognition environment State and action; The reward model is also used to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods and lower delay.
  • the processing the original EEG data to obtain EEG signals includes:
  • the combined EEG data is selected to obtain EEG signals.
  • the inputting the EEG state information and the reward information into the enhanced selection attention model to obtain the best attention area information includes:
  • the EEG state information and the reward information are received through the enhanced selective attention model to obtain EEG evaluation information;
  • the EEG evaluation information is fed back to the state transition model to drive the state transition model to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
  • the reward model includes a convolutional mapping network and a classifier.
  • the inputting the best attention area information into the reward model to obtain the classification recognition result includes:
  • the best attention area information is input to the convolutional mapping network to obtain the spatial dependence feature
  • the spatially dependent features are input to the classifier to obtain classification recognition results.
  • the convolutional mapping network includes an input layer, a convolutional layer, a fully connected layer, an extraction feature layer, and an output layer.
  • the input layer, the convolutional layer, the fully connected layer, The extracted feature layer and the output layer are sequentially connected.
  • the collection unit is used to collect the original EEG data
  • the processing unit is used to process the original EEG data to obtain EEG signals
  • the detection unit is configured to input the EEG signal into the state transition model and the reward model respectively to obtain EEG state information and reward information;
  • a screening unit configured to input the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information
  • the recognition unit is used to input the best attention area information into the reward model to obtain a classification recognition result.
  • the processing unit includes:
  • the copying unit is used for copying the original EEG data
  • the shuffling unit is used for shuffling the original EEG data processed by the copying unit to obtain combined EEG data
  • the selecting unit is used to select the combined EEG data to obtain EEG signals.
  • the detection unit includes:
  • the state transition unit is used to input the EEG signal into the state transition model to obtain EEG state information
  • the reward unit is used to input brain electrical signals into the reward model to obtain reward information.
  • the adaptive cognitive activity recognition device has at least the following beneficial effects: a reinforced selective attention model is designed by combining deep reinforcement learning and an attention mechanism to automatically extract robust and unique deep features; to encourage model selection The best attention area that can achieve the highest classification accuracy; in addition, the state and action are customized according to the cognitive activity recognition environment; the reward model is also used to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods , And the latency is low.
  • the adaptive cognitive activity recognition method according to the embodiment of the first aspect of the present invention can be applied.
  • the computer-readable storage medium has at least the following beneficial effects: a reinforced selective attention model is designed by combining deep reinforcement learning and an attention mechanism to automatically extract robust and unique deep features; to encourage model selection to achieve The best attention area with the highest classification accuracy; in addition, the state and action are customized according to the recognition environment of cognitive activities; the reward model is also used to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods, and The latency is low.
  • FIG. 1 is a flowchart of an adaptive cognitive activity recognition method according to Embodiment 1 of the present invention
  • FIG. 2 is a working flow chart of processing raw EEG data in the adaptive cognitive activity recognition method according to the first embodiment of the present invention
  • FIG. 3 is a working flow chart of screening the best attention area in the adaptive cognitive activity recognition method according to the first embodiment of the present invention
  • FIG. 4 is a work flow chart of obtaining classification and recognition results in the adaptive cognitive activity recognition method according to the first embodiment of the present invention
  • FIG. 5 is a schematic diagram of the device structure of an adaptive cognitive activity recognition device according to the second embodiment of the present invention.
  • Fig. 6 is a detailed flowchart of the method for adaptive cognitive activity recognition according to the first embodiment of the present invention.
  • Embodiment 1 of the present invention provides an adaptive cognitive activity recognition method, one of which includes but is not limited to the following steps:
  • Step S100 collecting raw brain electricity data.
  • the original EEG data is collected in this step to prepare for subsequent adaptive cognitive activity recognition and provide a basis for collecting data.
  • Step S200 processing the original EEG data to obtain EEG signals.
  • this step organizes and processes the collected raw EEG data, and further prepares for adaptive cognitive activity recognition.
  • step S300 the EEG signals are input into the state transition model and the reward model respectively, and the EEG state information and reward information are obtained respectively.
  • the state transition model in this step is used to select a certain behavior in the EEG signal to obtain EEG state information.
  • the reward model also evaluates the quality of the behavior by rewarding the behavior. Reward information, so as to prepare the prerequisites for selecting the best attention area information.
  • Step S400 input the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information.
  • the enhanced selected attention model in this step can evaluate EEG state information and reward information. When it is found that it is not the best attention area information, it will be fed back to the state transition model, making the state transition model Reselect another attention area in the EEG signal and re-evaluate until the best attention area information is obtained.
  • Step S500 Input the best attention area information into the reward model to obtain a classification and recognition result.
  • the best attention area information obtained in this step is input into the reward model, and the classification and recognition result of the original EEG data is obtained through the reward model.
  • step S200 in this embodiment may include but is not limited to the following steps:
  • step S210 the original EEG data is copied and shuffled to obtain combined EEG data.
  • step S220 the combined EEG data is selected to obtain EEG signals.
  • this step selects the original EEG data that has been copied and shuffled to obtain the EEG signal, and then input it into the state transition model and the reward model to select the best attention area in the original EEG data .
  • step S400 of this embodiment the following steps may be included but not limited to:
  • step S410 the EEG state information and the reward information are received through the enhanced selective attention model to obtain EEG evaluation information.
  • the enhanced selective attention model in this step receives the EEG state information and reward information, and then comprehensively evaluates the EEG state information and reward information.
  • the reward information can reveal a certain selected behavior of the input EEG signal
  • the enhanced selection attention model can give feedback to the state transition model. When the selected area is not the best attention area, it will drive the state transition model to reselect another attention area.
  • step S420 the EEG evaluation information is fed back to the state transition model to drive the state transition model to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
  • the enhanced selection attention model in this step can feed back to the state transition model.
  • the state transition model will be driven to reselect another attention area until the selected attention area is strengthened.
  • the model selects the best attention area information.
  • step S500 of this embodiment the following steps may be included but not limited to:
  • step S510 the best attention area information is input to the convolutional mapping network to obtain the spatial dependence feature.
  • the best attention region information obtained in this step is input into the convolutional mapping network, and the convolutional mapping network is used to extract spatial dependent features.
  • step S520 the spatially dependent features are input to the classifier to obtain a classification recognition result.
  • this step inputs the spatially dependent features obtained above into the classifier, so that the classifier can be used for adaptive cognitive activity recognition.
  • the reward model includes a convolutional mapping network and a classifier.
  • the convolutional mapping network can perform spatially dependent feature extraction; the classifier can perform cognitive activity recognition based on the spatially dependent feature extracted by the convolutional mapping network.
  • the convolutional mapping network includes an input layer, a convolutional layer, a fully connected layer, an extraction feature layer, and an output layer.
  • the input layer, the convolutional layer, the fully connected layer, The extracted feature layer and the output layer are sequentially connected.
  • the enhanced selective attention model includes a competitive DQN network, a fully connected layer, a value function V, an advantage function A and Q function, and a competitive DQN network and a fully connected layer are connected.
  • the value function V and the advantage function A are connected with the completely The connection layer is connected, and the value function V and the advantage function A are also connected with the Q function to realize a strengthened selection attention mechanism.
  • a method using the spatial relationship between EEG signals is designed.
  • the signals belonging to different brain activities should have different spatial dependencies. Copy and shuffle the input EEG signal according to the dimensions. In this method, all possible dimensional arrangements have the appearance of equal probability.
  • x ik represents the k-th dimension value in the i-th sample.
  • EEG data is usually connected according to the distribution of biomedical EEG channels.
  • the order of biomedical dimensions may not show the best spatial dependence.
  • the exhaust method is computationally too expensive, which makes it impossible to exhaust all possible dimensional arrangements.
  • RS copy and shuffle
  • x'i contains more different combinations of dimensions. Note that this RS operation is only performed once on a specific input data set in order to provide a stable environment for subsequent reinforcement learning.
  • a focus area is introduced to focus on the feature dimension segment.
  • the attention area is optimized through deep reinforcement learning, and it turns out that this area is stable and performs well in strategy learning.
  • the detection of the best attention area includes two key components: environment (including state transition and reward model) and enhanced selective attention mechanism. Three elements (state s, behavior a, and reward r) are exchanged in the interaction between the environment and the subject. These three elements are all customized according to the background of this research. Next, explain the design of the key components of the deep reinforcement learning structure:
  • the state transition selects an operation to be implemented according to the strategy ⁇ that strengthens the selective attention mechanism:
  • the attention area will move a random distance d ⁇ [1,d u ], where d u is the upper limit.
  • d u is the upper limit.
  • the area will move left or right with step d.
  • the state start index or end index exceeds the boundary, a crop operation is performed. For example, if (Below the lower boundary 0), we crop the starting index to
  • F the reward model F as a combination of convolutional mapping and classification. Because in the actual method optimization, the higher the accuracy, the more difficult it is to increase the classification accuracy. To encourage higher levels of accuracy, we designed a non-linear reward function:
  • acc means classification accuracy.
  • This function consists of two parts; the first part is a normalized exponential function with exponent acc ⁇ [0,1]. This part encourages the reinforcement learning algorithm to search for a better st to obtain a higher acc.
  • the motivation of the exponential function is that the growth rate of the reward increases as the accuracy increases.
  • the second part is to pay attention to the length of the zone to keep the penalty factor shorter, and ⁇ is the penalty factor.
  • the purpose of deep reinforcement learning is to learn the best attention area that leads to the greatest reward
  • the ⁇ -greedy method is adopted, which selects random actions with a probability of 1- ⁇ or the optimal Q function with a probability of ⁇ Choose an action behavior.
  • ⁇ [0,1] is a decay parameter that weighs the importance of immediate and future rewards
  • n represents the number of subsequent steps.
  • V( st ) estimates the expected reward.
  • the Q function is related to the pair (s t , a t ), and the value function is only related to s t .
  • ⁇ ' ⁇ ⁇ are the parameters in the confrontation DQN network, and will be automatically optimized.
  • the above formula is not recognized, in fact, can not be recovered uniquely V (s t) and A (s t, a t) given Q (s t, a t) .
  • the advantage function can be forced to be equal to zero on the selected action. In other words, let the network implement forward mapping:
  • the value function V( st ) is forced to learn the estimation of the value function, while the other direction produces the estimation of the advantage function.
  • the gradient update method is
  • a convolutional layer is used with many filters to learn the attention area Perform a scan.
  • the convolutional mapping structure consists of five layers: the input layer receives the learned attention area, the convolutional layer is followed by a fully connected layer, and the output layer. Compare the single hot spot live with the output layer to calculate the training loss.
  • the Relu nonlinear activation function is applied to the convolution output.
  • the convolutional layer is described as follows:
  • the pooling layer aims to reduce the redundant information in the convolution output to reduce the computational cost. As far as we are concerned, try to retain as much information as possible. Therefore, the method does not use a pooling layer. Then, in the fully connected layer and the output layer
  • W f , W o , b f , and b o respectively represent the corresponding weights and deviations.
  • y' represents the predicted label.
  • the cost function is measured by cross entropy, and the l 2 -norm (with parameter ⁇ ) is used as regularization to prevent overfitting:
  • the AdamOptimizer algorithm optimizes the cost function.
  • the fully connected layer extracts features and inputs them into the lightweight nearest neighbor classifier.
  • the convolutional map is updated iteratively for N'times.
  • the state transition and reward model In each step t, the state transition will select a behavior to update according to the feedback of the enhanced selective mechanism.
  • the reward model evaluates the quality of the area of interest through reward scores.
  • the competitive DQN is used to find the best attention area, which will be fed into the convolutional mapping process to extract the spatial dependent representation.
  • the features represented will be used for classification.
  • the reward model is a combination of convolutional mapping and classifier.
  • the framework can directly process raw EEG data without feature extraction. In addition, it can automatically select distinguishable feature dimensions for different EEG data, thereby achieving high availability.
  • the method not only surpasses several latest baselines to a large extent, but also shows low latency and high flexibility in dealing with multiple EEG signal channels and incomplete EEG signals.
  • the method is suitable for a wider range of application scenarios, such as motor image recognition. And emotional state recognition.
  • the enhanced selective attention model is designed by combining deep reinforcement learning and attention mechanism to automatically extract robust and unique deep features; to encourage the model to select the best attention area that can achieve the highest classification accuracy; in addition, according to the recognition Knowing activities recognize the customized state and actions of the environment; it also uses the reward model to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods and lower delay.
  • the second embodiment of the present invention provides an adaptive cognitive activity recognition device 1000, including:
  • the collecting unit 1100 is used to collect raw brain electricity data
  • the processing unit 1200 is configured to process the original EEG data to obtain EEG signals;
  • the detection unit 1300 is configured to input the EEG signal into the state transition model and the reward model, respectively, to obtain EEG state information and reward information respectively;
  • the screening unit 1400 is configured to input the EEG state information and the reward information into the enhanced selection attention model to obtain the best attention area information;
  • the recognition unit 1500 is configured to input the best attention area information into the reward model to obtain a classification recognition result.
  • the processing unit 1200 includes:
  • the copying unit 1210 is used for copying the original EEG data
  • the shuffling unit 1220 is used for shuffling the original EEG data processed by the copying unit 1210 to obtain combined EEG data;
  • the selecting unit 1230 is used to select the combined EEG data to obtain EEG signals.
  • the detection unit 1300 includes:
  • the state transition unit 1310 is used to input brain electrical signals into the state transition model to obtain brain electrical state information
  • the reward unit 1320 is used to input brain electrical signals into the reward model to obtain reward information.
  • the screening unit 1400 includes:
  • the selected attention unit 1410 which is to strengthen the selected attention model, can receive the EEG state information and the reward information to obtain EEG evaluation information; and then feed back the EEG evaluation information to the state transition model to The state transition model is driven to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
  • the recognition unit 1500 is the above-mentioned reward unit 1320, and can perform adaptive cognitive activity recognition.
  • the enhanced selective attention model is designed by combining deep reinforcement learning and attention mechanism to automatically extract robust and unique deep features; to encourage the model to select the best attention area that can achieve the highest classification accuracy; in addition, according to the recognition Knowing activities recognize the customized state and actions of the environment; it also uses the reward model to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods and lower delay.
  • the third embodiment of the present invention also provides a computer-readable storage medium that stores executable instructions of the adaptive cognitive activity recognition device, and the executable instructions of the adaptive cognitive activity recognition device are used to enable self
  • the adaptive cognitive activity recognition device executes the above-mentioned adaptive cognitive activity recognition method, for example, executes the method steps S100 to S500 in FIG. 1 described above to realize the functions of the units 1000-1500 in FIG. 5.

Landscapes

  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Veterinary Medicine (AREA)
  • Data Mining & Analysis (AREA)
  • Public Health (AREA)
  • Psychiatry (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Surgery (AREA)
  • Animal Behavior & Ethology (AREA)
  • Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Psychology (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Signal Processing (AREA)
  • Physiology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Child & Adolescent Psychology (AREA)
  • Developmental Disabilities (AREA)
  • Educational Technology (AREA)
  • Hospice & Palliative Care (AREA)
  • Fuzzy Systems (AREA)
  • Social Psychology (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)

Abstract

Disclosed are a self-adaptive cognitive activity recognition method and apparatus, and a storage medium. The method comprises: collecting original electroencephalogram data; processing the original electroencephalogram data to obtain an electroencephalogram signal; respectively inputting the electroencephalogram signal into a state transition model and a reward model, so as to respectively obtain electroencephalogram state information and reward information; inputting the electroencephalogram state information and the reward information into a reinforced selective attention model to obtain optimal attention area information; and inputting the optimal attention area information into the reward model to obtain a classification recognition result. A cognitive activity can be recognized more effectively, and the recognition accuracy is also improved.

Description

一种自适应认知活动识别方法、装置及存储介质Self-adaptive cognitive activity recognition method, device and storage medium 技术领域Technical field
本发明涉及人工智能领域,特别涉及一种自适应认知活动识别方法、装置及存储介质。The present invention relates to the field of artificial intelligence, in particular to an adaptive cognitive activity recognition method, device and storage medium.
背景技术Background technique
脑电图(EEG)是一种电生理监测指标,可通过测量大脑神经元内离子电流的电压波动来分析大脑状态和活动。在实践中,脑电信号可以通过便携式和现成的设备以非侵入性和非固定的方式进行收集。EEG信号分类算法已针对一系列实际应用进行了研究。脑电分类模型的准确性和鲁棒性是运动意图识别,情绪识别等认知活动的重要衡量指标。认知活动识别系统在内部认知世界和外部物理世界之间架起了一座桥梁。它们最近用于辅助生活,智能家居和娱乐业;基于脑电图的运动想象识别可帮助残障人士进行生活必须的基本活动;基于脑电信号的情绪识别可用于检测当前病患的情感状态,例如抑郁、焦虑等。Electroencephalogram (EEG) is an electrophysiological monitoring index that can analyze the state and activity of the brain by measuring the voltage fluctuations of the ion current in the brain neurons. In practice, EEG signals can be collected in a non-invasive and non-fixed manner with portable and off-the-shelf equipment. The EEG signal classification algorithm has been studied for a series of practical applications. The accuracy and robustness of the EEG classification model is an important measure of cognitive activities such as movement intention recognition and emotion recognition. The cognitive activity recognition system builds a bridge between the internal cognitive world and the external physical world. They are recently used in assisted living, smart home and entertainment industries; motor image recognition based on EEG can help people with disabilities perform basic activities necessary for life; Emotion recognition based on EEG signals can be used to detect the emotional state of current patients, such as Depression, anxiety, etc.
认知活动的分类面临若干挑战。首先,大多数现有的EEG分类研究使用的EEG数据预处理和特征提取方法(例如,带通滤波,离散小波变换和特征选择)既耗时又高度依赖专业知识。其次,目前大多数的EEG分类方法都是基于特定领域的知识设计的,因此在不同情况下可能会失效甚至失败。The classification of cognitive activities faces several challenges. First, most of the existing EEG classification research uses EEG data preprocessing and feature extraction methods (for example, band-pass filtering, discrete wavelet transform, and feature selection) that are time-consuming and highly dependent on professional knowledge. Secondly, most of the current EEG classification methods are designed based on the knowledge of a specific field, so they may fail or even fail in different situations.
发明内容Summary of the invention
本发明旨在至少解决现有技术中存在的技术问题之一。The present invention aims to solve at least one of the technical problems existing in the prior art.
为此,本发明提出一种自适应认知活动识别方法,能够更加有效地认知活动进行识别,提高了识别的准确性。For this reason, the present invention proposes an adaptive cognitive activity recognition method, which can recognize cognitive activities more effectively and improve the accuracy of recognition.
本发明还提出一种应用上述自适应认知活动识别方法的自适应认知活动识别装置。The present invention also provides an adaptive cognitive activity recognition device applying the above-mentioned adaptive cognitive activity recognition method.
本发明还提出一种应用上述自适应认知活动识别方法的计算机可读存储介 质。The present invention also provides a computer-readable storage medium applying the above-mentioned adaptive cognitive activity recognition method.
根据本发明第一方面实施例的自适应认知活动识别方法,包括:The adaptive cognitive activity recognition method according to the embodiment of the first aspect of the present invention includes:
采集原始脑电数据;Collect raw EEG data;
对所述原始脑电数据进行处理,得到脑电信号;Processing the original EEG data to obtain EEG signals;
将所述脑电信号分别输入到状态过渡模型和奖励模型,分别得到脑电状态信息和奖励信息;Inputting the EEG signal into the state transition model and the reward model, respectively, to obtain EEG state information and reward information;
将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息;Inputting the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information;
将所述最佳注意区域信息输入到奖励模型,得出分类识别结果。The best attention area information is input into the reward model to obtain the classification recognition result.
根据本发明实施例的自适应认知活动识别方法,至少具有如下有益效果:提出一种自动认知活动识别的通用框架,以促进各种认知应用领域的范围,包括运动想象识别和情绪识别。通过结合深度强化学习和注意机制来设计强化选择性注意模型,以自动提取健壮且独特的深度特征;以鼓励模型选择能够达到最高分类精度的最佳注意区域;此外,根据认知活动识别环境定制状态和动作;还利用奖励模型进行选定的原始脑电数据进行分类,实现了比传统方法更高的识别准确性,并且延迟较低。The adaptive cognitive activity recognition method according to the embodiment of the present invention has at least the following beneficial effects: a general framework for automatic cognitive activity recognition is proposed to promote the scope of various cognitive application fields, including motor image recognition and emotion recognition . Design a reinforced selective attention model by combining deep reinforcement learning and attention mechanism to automatically extract robust and unique deep features; to encourage the model to select the best attention area that can achieve the highest classification accuracy; in addition, it is customized according to the cognitive activity recognition environment State and action; The reward model is also used to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods and lower delay.
根据本发明的一些实施例,所述对所述原始脑电数据进行处理,得到脑电信号,包括:According to some embodiments of the present invention, the processing the original EEG data to obtain EEG signals includes:
对所述原始脑电数据进行复制和洗牌处理,得到组合脑电数据;Copying and shuffling the original EEG data to obtain combined EEG data;
对所述组合脑电数据进行选取,得到脑电信号。The combined EEG data is selected to obtain EEG signals.
根据本发明的一些实施例,所述将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息,包括:According to some embodiments of the present invention, the inputting the EEG state information and the reward information into the enhanced selection attention model to obtain the best attention area information includes:
通过强化选择注意模型接收到所述脑电状态信息和所述奖励信息,得出脑电评估信息;The EEG state information and the reward information are received through the enhanced selective attention model to obtain EEG evaluation information;
将所述脑电评估信息反馈到所述状态过渡模型以驱使所述状态过渡模型进行脑电状态信息转换,直至所述强化选择注意模型得出最佳注意区域信息。The EEG evaluation information is fed back to the state transition model to drive the state transition model to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
根据本发明的一些实施例,所述奖励模型包括卷积映射网络和分类器。According to some embodiments of the present invention, the reward model includes a convolutional mapping network and a classifier.
根据本发明的一些实施例,所述将所述最佳注意区域信息输入到奖励模型,得出分类识别结果,包括:According to some embodiments of the present invention, the inputting the best attention area information into the reward model to obtain the classification recognition result includes:
所述最佳注意区域信息输入到卷积映射网络,得出空间依赖特征;The best attention area information is input to the convolutional mapping network to obtain the spatial dependence feature;
所述空间依赖特征输入到分类器以得到分类识别结果。The spatially dependent features are input to the classifier to obtain classification recognition results.
根据本发明的一些实施例,所述卷积映射网络包括输入层、卷积层、完全连接层、提取特征层和输出层,所述输入层、所述卷积层、所述完全连接层、所述提取特征层和所述输出层依次连接。According to some embodiments of the present invention, the convolutional mapping network includes an input layer, a convolutional layer, a fully connected layer, an extraction feature layer, and an output layer. The input layer, the convolutional layer, the fully connected layer, The extracted feature layer and the output layer are sequentially connected.
根据本发明第二方面实施例的自适应认知活动识别装置,包括:An adaptive cognitive activity recognition device according to an embodiment of the second aspect of the present invention includes:
采集单元,用于采集原始脑电数据;The collection unit is used to collect the original EEG data;
处理单元,用于对所述原始脑电数据进行处理,得到脑电信号;The processing unit is used to process the original EEG data to obtain EEG signals;
检测单元,用于将所述脑电信号分别输入到状态过渡模型和奖励模型,分别得到脑电状态信息和奖励信息;The detection unit is configured to input the EEG signal into the state transition model and the reward model respectively to obtain EEG state information and reward information;
筛选单元,用于将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息;A screening unit, configured to input the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information;
识别单元,用于将所述最佳注意区域信息输入到奖励模型,得出分类识别结果。The recognition unit is used to input the best attention area information into the reward model to obtain a classification recognition result.
根据本发明的一些实施例,所述处理单元包括:According to some embodiments of the present invention, the processing unit includes:
复制单元,用于对所述原始脑电数据进行复制处理;The copying unit is used for copying the original EEG data;
洗牌单元,用于对经过所述复制单元处理后的原始脑电数据进行洗牌处理,得出组合脑电数据;The shuffling unit is used for shuffling the original EEG data processed by the copying unit to obtain combined EEG data;
选取单元,用于对所述组合脑电数据进行选取,得到脑电信号。The selecting unit is used to select the combined EEG data to obtain EEG signals.
根据本发明的一些实施例,所述检测单元包括:According to some embodiments of the present invention, the detection unit includes:
状态过渡单元,用于将脑电信号输入到状态过渡模型,得到脑电状态信息;The state transition unit is used to input the EEG signal into the state transition model to obtain EEG state information;
奖励单元,用于将脑电信号输入到奖励模型,得到奖励信息。The reward unit is used to input brain electrical signals into the reward model to obtain reward information.
根据本发明实施例的自适应认知活动识别装置,至少具有如下有益效果:通过结合深度强化学习和注意机制来设计强化选择性注意模型,以自动提取健壮且 独特的深度特征;以鼓励模型选择能够达到最高分类精度的最佳注意区域;此外,根据认知活动识别环境定制状态和动作;还利用奖励模型进行选定的原始脑电数据进行分类,实现了比传统方法更高的识别准确性,并且延迟较低。The adaptive cognitive activity recognition device according to the embodiment of the present invention has at least the following beneficial effects: a reinforced selective attention model is designed by combining deep reinforcement learning and an attention mechanism to automatically extract robust and unique deep features; to encourage model selection The best attention area that can achieve the highest classification accuracy; in addition, the state and action are customized according to the cognitive activity recognition environment; the reward model is also used to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods , And the latency is low.
根据本发明第三方面实施例的计算机可读存储介质,能够应用根据本发明上述第一方面实施例的自适应认知活动识别方法。According to the computer-readable storage medium of the embodiment of the third aspect of the present invention, the adaptive cognitive activity recognition method according to the embodiment of the first aspect of the present invention can be applied.
根据本发明实施例的计算机可读存储介质,至少具有如下有益效果:通过结合深度强化学习和注意机制来设计强化选择性注意模型,以自动提取健壮且独特的深度特征;以鼓励模型选择能够达到最高分类精度的最佳注意区域;此外,根据认知活动识别环境定制状态和动作;还利用奖励模型进行选定的原始脑电数据进行分类,实现了比传统方法更高的识别准确性,并且延迟较低。The computer-readable storage medium according to the embodiment of the present invention has at least the following beneficial effects: a reinforced selective attention model is designed by combining deep reinforcement learning and an attention mechanism to automatically extract robust and unique deep features; to encourage model selection to achieve The best attention area with the highest classification accuracy; in addition, the state and action are customized according to the recognition environment of cognitive activities; the reward model is also used to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods, and The latency is low.
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。The additional aspects and advantages of the present invention will be partially given in the following description, and some will become obvious from the following description, or be understood through the practice of the present invention.
附图说明Description of the drawings
本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become obvious and easy to understand from the description of the embodiments in conjunction with the following drawings, in which:
图1为本发明实施例一的自适应认知活动识别方法的流程图;FIG. 1 is a flowchart of an adaptive cognitive activity recognition method according to Embodiment 1 of the present invention;
图2为本发明实施例一的自适应认知活动识别方法中的对原始脑电数据进行处理的工作流程图;2 is a working flow chart of processing raw EEG data in the adaptive cognitive activity recognition method according to the first embodiment of the present invention;
图3为本发明实施例一的自适应认知活动识别方法中的筛选最佳注意区域的工作流程图;FIG. 3 is a working flow chart of screening the best attention area in the adaptive cognitive activity recognition method according to the first embodiment of the present invention; FIG.
图4为本发明实施例一的自适应认知活动识别方法中的得到分类识别结果的工作流程图;FIG. 4 is a work flow chart of obtaining classification and recognition results in the adaptive cognitive activity recognition method according to the first embodiment of the present invention;
图5为本发明实施例二的自适应认知活动识别装置的装置结构示意图;5 is a schematic diagram of the device structure of an adaptive cognitive activity recognition device according to the second embodiment of the present invention;
图6为本发明实施例一的自适应认知活动识别方法的详细流程图。Fig. 6 is a detailed flowchart of the method for adaptive cognitive activity recognition according to the first embodiment of the present invention.
具体实施方式Detailed ways
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。The embodiments of the present invention are described in detail below. Examples of the embodiments are shown in the accompanying drawings, in which the same or similar reference numerals indicate the same or similar elements or elements with the same or similar functions. The embodiments described below with reference to the drawings are exemplary, and are only used to explain the present invention, but should not be construed as limiting the present invention.
本发明的描述中,除非另有明确的限定,设置、连接等词语应做广义理解,所属技术领域技术人员可以结合技术方案的具体内容合理确定上述词语在本发明中的具体含义。In the description of the present invention, unless otherwise clearly defined, words such as setting and connection should be understood in a broad sense, and those skilled in the art can reasonably determine the specific meaning of the above words in the present invention in combination with the specific content of the technical solution.
实施例一Example one
参照图1,本发明实施例一提供了一种自适应认知活动识别方法,其中的一种实施例包括但不限于以下步骤:1, Embodiment 1 of the present invention provides an adaptive cognitive activity recognition method, one of which includes but is not limited to the following steps:
步骤S100,采集原始脑电数据。Step S100, collecting raw brain electricity data.
在本实施例中,本步骤采集原始脑电数据,为了后续的自适应认知活动识别做好准备,提供了采集数据基础。In this embodiment, the original EEG data is collected in this step to prepare for subsequent adaptive cognitive activity recognition and provide a basis for collecting data.
步骤S200,对所述原始脑电数据进行处理,得到脑电信号。Step S200, processing the original EEG data to obtain EEG signals.
在本实施例中,本步骤对收集到的原始脑电数据进行整理处理,进一步为了自适应认知活动识别做好准备。In this embodiment, this step organizes and processes the collected raw EEG data, and further prepares for adaptive cognitive activity recognition.
步骤S300,将所述脑电信号分别输入到状态过渡模型和奖励模型,分别得到脑电状态信息和奖励信息。In step S300, the EEG signals are input into the state transition model and the reward model respectively, and the EEG state information and reward information are obtained respectively.
在本实施例中,本步骤状态过渡模型用于选择脑电信号中的某一行为,得到脑电状态信息,以此同时,奖励模型还对该行为进行奖励分数评估这一行为的质量,得到奖励信息,从而能够为选出最佳注意区域信息做好前提准备。In this embodiment, the state transition model in this step is used to select a certain behavior in the EEG signal to obtain EEG state information. At the same time, the reward model also evaluates the quality of the behavior by rewarding the behavior. Reward information, so as to prepare the prerequisites for selecting the best attention area information.
步骤S400,将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息。Step S400, input the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information.
在本实施例中,本步骤中的强化选择注意模型能够对脑电状态信息和奖励信息进行评估,当发现不为最佳注意区域信息的时候,就会反馈到状态过渡模型,使得状态过渡模型重新选择脑电信号中的另外一个注意区域,重新进行评估,直 至得出最佳注意区域信息。In this embodiment, the enhanced selected attention model in this step can evaluate EEG state information and reward information. When it is found that it is not the best attention area information, it will be fed back to the state transition model, making the state transition model Reselect another attention area in the EEG signal and re-evaluate until the best attention area information is obtained.
步骤S500,将所述最佳注意区域信息输入到奖励模型,得出分类识别结果。Step S500: Input the best attention area information into the reward model to obtain a classification and recognition result.
在本实施例中,本步骤将得出的最佳注意区域信息输入到奖励模型中,通过奖励模型得出对原始脑电数据分类识别结果。In this embodiment, the best attention area information obtained in this step is input into the reward model, and the classification and recognition result of the original EEG data is obtained through the reward model.
参照图2,本实施例的步骤S200中,可以包括但不限于以下步骤:Referring to FIG. 2, step S200 in this embodiment may include but is not limited to the following steps:
步骤S210,对所述原始脑电数据进行复制和洗牌处理,得到组合脑电数据。In step S210, the original EEG data is copied and shuffled to obtain combined EEG data.
在本实施例中,本步骤为了提供尽可能多的信息,通过复制和洗牌处理,能够提供特征维度的更多潜在空间组合,为了后续的检测做好准备。In this embodiment, in this step, in order to provide as much information as possible, through copying and shuffling processing, more potential space combinations of feature dimensions can be provided, so as to prepare for subsequent detection.
步骤S220,对所述组合脑电数据进行选取,得到脑电信号。In step S220, the combined EEG data is selected to obtain EEG signals.
在本实施例中,本步骤对经过复制洗牌处理后的原始脑电数据进行选取,得到脑电信号,然后输入到状态过渡模型和奖励模型中进行原始脑电数据中的最佳注意区域选择。In this embodiment, this step selects the original EEG data that has been copied and shuffled to obtain the EEG signal, and then input it into the state transition model and the reward model to select the best attention area in the original EEG data .
参照图3,在本实施例的步骤S400中,可以包括但不限于以下步骤:Referring to FIG. 3, in step S400 of this embodiment, the following steps may be included but not limited to:
步骤S410,通过强化选择注意模型接收到所述脑电状态信息和所述奖励信息,得出脑电评估信息。In step S410, the EEG state information and the reward information are received through the enhanced selective attention model to obtain EEG evaluation information.
在本实施例中,本步骤强化选择注意模型接收到脑电状态信息和奖励信息,然后对脑电状态信息和奖励信息进行综合评估,奖励信息能够揭示输入的脑电信号的某一选定行为的质量,强化选择注意模型能够向状态过渡模型进行反馈,当选定的区域不是最佳注意区域的时候,就会驱使状态过渡模型重新选择另外一个注意区域。In this embodiment, the enhanced selective attention model in this step receives the EEG state information and reward information, and then comprehensively evaluates the EEG state information and reward information. The reward information can reveal a certain selected behavior of the input EEG signal The enhanced selection attention model can give feedback to the state transition model. When the selected area is not the best attention area, it will drive the state transition model to reselect another attention area.
步骤S420,将所述脑电评估信息反馈到所述状态过渡模型以驱使所述状态过渡模型进行脑电状态信息转换,直至所述强化选择注意模型得出最佳注意区域信息。In step S420, the EEG evaluation information is fed back to the state transition model to drive the state transition model to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
在本实施例中,本步骤强化选择注意模型能够向状态过渡模型进行反馈,当选定的区域不是最佳注意区域的时候,就会驱使状态过渡模型重新选择另外一个注意区域,直至强化选择注意模型选出最佳注意区域信息。In this embodiment, the enhanced selection attention model in this step can feed back to the state transition model. When the selected area is not the best attention area, the state transition model will be driven to reselect another attention area until the selected attention area is strengthened. The model selects the best attention area information.
参照图4,在本实施例的步骤S500中,可以包括但不限于以下步骤:Referring to FIG. 4, in step S500 of this embodiment, the following steps may be included but not limited to:
步骤S510,所述最佳注意区域信息输入到卷积映射网络,得出空间依赖特征。In step S510, the best attention area information is input to the convolutional mapping network to obtain the spatial dependence feature.
在本实施例中,本步骤将得到的最佳注意区域信息输入到卷积映射网络中,利用卷积映射网络进行空间依赖特征提取。In this embodiment, the best attention region information obtained in this step is input into the convolutional mapping network, and the convolutional mapping network is used to extract spatial dependent features.
步骤S520,所述空间依赖特征输入到分类器以得到分类识别结果。In step S520, the spatially dependent features are input to the classifier to obtain a classification recognition result.
在本实施例中,本步骤将上述所得到的空间依赖特征输入到分类器中,从而能够利用分类器进行自适应认知活动识别。In this embodiment, this step inputs the spatially dependent features obtained above into the classifier, so that the classifier can be used for adaptive cognitive activity recognition.
在本实施例中,奖励模型包括卷积映射网络和分类器,卷积映射网络能够进行空间依赖特征提取;分类器能够根据卷积映射网络提取的空间依赖特征进行认知活动识别。In this embodiment, the reward model includes a convolutional mapping network and a classifier. The convolutional mapping network can perform spatially dependent feature extraction; the classifier can perform cognitive activity recognition based on the spatially dependent feature extracted by the convolutional mapping network.
进一步,在本实施例中,所述卷积映射网络包括输入层、卷积层、完全连接层、提取特征层和输出层,所述输入层、所述卷积层、所述完全连接层、所述提取特征层和所述输出层依次连接。Further, in this embodiment, the convolutional mapping network includes an input layer, a convolutional layer, a fully connected layer, an extraction feature layer, and an output layer. The input layer, the convolutional layer, the fully connected layer, The extracted feature layer and the output layer are sequentially connected.
在本实施例中,强化选择注意模型包括了竞争DQN网络、完全连接层、值函数V、优势函数A和Q函数,竞争DQN网络和完全连接层连接,值函数V和优势函数A分别与完全连接层连接,值函数V和优势函数A还均与Q函数连接,实现强化选择注意机制。In this embodiment, the enhanced selective attention model includes a competitive DQN network, a fully connected layer, a value function V, an advantage function A and Q function, and a competitive DQN network and a fully connected layer are connected. The value function V and the advantage function A are connected with the completely The connection layer is connected, and the value function V and the advantage function A are also connected with the Q function to realize a strengthened selection attention mechanism.
下面以一个具体的实施例对自适应认知活动识别方法进行进一步的说明:The following uses a specific embodiment to further illustrate the adaptive cognitive activity recognition method:
参照图6,为了提供尽可能多的信息,设计了一种利用EEG信号之间的空间关系的方法。属于不同大脑活动的信号应该具有不同的空间依赖关系。将输入的EEG信号按照维度进行复制并混洗。在这种方法中,所有可能的维度布置都具有等概率的外观。Referring to Fig. 6, in order to provide as much information as possible, a method using the spatial relationship between EEG signals is designed. The signals belonging to different brain activities should have different spatial dependencies. Copy and shuffle the input EEG signal according to the dimensions. In this method, all possible dimensional arrangements have the appearance of equal probability.
假设输入的原始EEG数据用X={(x i,y i),i=1,2,...,I}表示,其中(x i,y i)表示单个EEG样本,I表示样本数。在每个样本中,特征x i={x ik,k=1,2,...,N},x i∈R N包含与N个EEG通道相对应的N个元素,而y i∈R表示相应的标签。x ik表示第i 个样本中的第k个维度值。 Suppose that the input original EEG data is represented by X={(x i , y i ), i=1, 2,..., I}, where (x i , y i ) represents a single EEG sample, and I represents the number of samples. In each sample, feature x i ={x ik ,k=1, 2,...,N}, x i ∈R N contains N elements corresponding to N EEG channels, and y i ∈R Indicates the corresponding label. x ik represents the k-th dimension value in the i-th sample.
在现实世界的收集场景中,EEG数据通常是根据生物医学EEG通道的分布进行连接的。但是,生物医学维度顺序可能不会表现出最佳的空间依赖性。排气方法在计算上过于昂贵,导致无法排出所有可能的维度布置。In real-world collection scenarios, EEG data is usually connected according to the distribution of biomedical EEG channels. However, the order of biomedical dimensions may not show the best spatial dependence. The exhaust method is computationally too expensive, which makes it impossible to exhaust all possible dimensional arrangements.
为了提供更多潜在的维度组合,提出了一种称为“复制和洗牌(RS)”的方法。RS是一种两步映射方法,可将x i映射到具有完整元素组合的更高维空间x′ iIn order to provide more potential combinations of dimensions, a method called "copy and shuffle (RS)" is proposed. RS is a two-step mapping method that can map x i to a higher-dimensional space x′ i with a complete element combination:
x i∈R N→x′ i∈R N′,N'>N x i ∈R N →x′ i ∈R N′ ,N'>N
在第一步(复制)中,将x i复制h=N'/N+1次。然后,得到一个新的向量,其长度为h*N,不小于N';在第二步(洗牌)中,随机的将复制的矢量在第一步中打乱,并截取第一个N'生成x′ i的元素。从理论上讲,与x i相比,x′ i包含更多不同的维度组合。注意,此RS操作仅对特定的输入数据集执行一次,以便为后续的强化学习提供稳定的环境。 In the first step (copy), x i is copied h=N'/N+1 times. Then, get a new vector whose length is h*N, not less than N'; in the second step (shuffle), randomly shuffle the copied vector in the first step, and intercept the first N 'generates x' element i. In theory, compared with x i , x'i contains more different combinations of dimensions. Note that this RS operation is only performed once on a specific input data set in order to provide a stable environment for subsequent reinforcement learning.
受到最佳空间关系仅取决于特征维子集这一事实的启发,引入了一个关注区域来关注特征维的片段。在这里,通过深度强化学习优化了注意区域,事实证明,该区域在策略学习中是稳定且表现良好的。Inspired by the fact that the best spatial relationship only depends on the feature dimension subset, a focus area is introduced to focus on the feature dimension segment. Here, the attention area is optimized through deep reinforcement learning, and it turns out that this area is stable and performs well in strategy learning.
特别是,旨在检测最佳维度组合,其中包括EEG信号之间最显著的空间依赖性。由于N'(x′ i的长度)太大且计算量巨大,无法平衡长度和信息内容,因此引入了注意机制,因为它的有效性已在最近的研究领域(例如语音识别)中得到证明。试图强调x′ i中的信息片段,并用
Figure PCTCN2020104558-appb-000001
表示该片段,这被称为注意区。令
Figure PCTCN2020104558-appb-000002
Figure PCTCN2020104558-appb-000003
表示注意区域的长度,该注意区域由所提出的算法自动学习。采用深度强化学习来发现最佳注意区域。
In particular, it aims to detect the best combination of dimensions, which includes the most significant spatial dependence between EEG signals. Since N '(x' i of length) and a huge amount of calculation is too large, the length and information content can not be balanced, so the introduction of the attention mechanism, since its effectiveness has been demonstrated in a recent study in the art (e.g. voice recognition). Try to emphasize the information fragments in x′ i, and use
Figure PCTCN2020104558-appb-000001
Indicates the segment, which is called the attention zone. make
Figure PCTCN2020104558-appb-000002
with
Figure PCTCN2020104558-appb-000003
Indicates the length of the attention area, which is automatically learned by the proposed algorithm. Use deep reinforcement learning to find the best attention area.
最佳关注区域的检测包括两个关键组件:环境(包括状态转换和奖励模型)和强化选择注意机制。在环境与主体之间的交互中交换了三个要素(状态s,行为a和奖励r)。这三个要素都是根据本研究的背景进行定制的。接下来,阐述深度强化学习结构的关键组件的设计:The detection of the best attention area includes two key components: environment (including state transition and reward model) and enhanced selective attention mechanism. Three elements (state s, behavior a, and reward r) are exchanged in the interaction between the environment and the subject. These three elements are all customized according to the background of this research. Next, explain the design of the key components of the deep reinforcement learning structure:
状态S={s t,t=0,1,...,T},s t∈R 2描述关注区域的位置,其中t表示时间。由于关注区域是1-D x′ i上的移动片段,因此我们设计了两个参数来定义状态:
Figure PCTCN2020104558-appb-000004
其中
Figure PCTCN2020104558-appb-000005
Figure PCTCN2020104558-appb-000006
分别表示注意区域的开始索引和结束索引。在训练中,将s 0初始化为
State S={s t ,t=0,1,...,T}, st ∈R 2 describes the location of the region of interest, where t represents time. Since the area of interest is a moving segment on 1-D x′ i , we design two parameters to define the state:
Figure PCTCN2020104558-appb-000004
in
Figure PCTCN2020104558-appb-000005
with
Figure PCTCN2020104558-appb-000006
Indicates the start index and end index of the attention area respectively. In training, initialize s 0 as
Figure PCTCN2020104558-appb-000007
Figure PCTCN2020104558-appb-000007
行为A={a t,t=0,1,...,T}∈R 4描述了强化选择注意机制可以选择对环境采取的行为。在时间标记t处,状态转换根据强化选择注意机制的策略π选择一个要实施的操作: Behavior A = {a t, t = 0,1, ..., T} ∈R 4 describes enhanced attention selection mechanism can select the behavior of the environment taken. At the time mark t, the state transition selects an operation to be implemented according to the strategy π that strengthens the selective attention mechanism:
s t+1=π(s t,a t) s t+1 =π(s t ,a t )
为注意区域定义了四类动作:向左(想象左手),向右(想象右手),向上(想象舌头)和向下(想象双脚)。对于每个动作,注意区域都会移动一个随机距离d∈[1,d u],其中d u是上限。对于向左移和向右移动作,注意区域会随着步骤d向左或向右移动。对于向上和向下操作,
Figure PCTCN2020104558-appb-000008
Figure PCTCN2020104558-appb-000009
都是移动d。最后,如果状态开始索引或结束索引超出边界,则执行裁剪操作。例如,如果
Figure PCTCN2020104558-appb-000010
(低于下边界0),我们将起始索引裁剪为
Figure PCTCN2020104558-appb-000011
Four types of actions are defined for the attention area: left (imagine left hand), right (imagine right hand), up (imagine tongue) and down (imagine feet). For each action, the attention area will move a random distance d∈[1,d u ], where d u is the upper limit. For moving left and right, note that the area will move left or right with step d. For up and down operations,
Figure PCTCN2020104558-appb-000008
with
Figure PCTCN2020104558-appb-000009
Both are mobile d. Finally, if the state start index or end index exceeds the boundary, a crop operation is performed. For example, if
Figure PCTCN2020104558-appb-000010
(Below the lower boundary 0), we crop the starting index to
Figure PCTCN2020104558-appb-000011
奖励R={r t,t=0,1,...,T}∈R是由奖励模型计算的。奖励模型Φ: The reward R={r t ,t=0,1,...,T}∈R is calculated by the reward model. Reward model Φ:
r t=Φ(s t) r t =Φ(s t )
接收当前状态并返回评估作为奖励。Receive the current status and return the evaluation as a reward.
奖励模型的目的是评估当前状态如何影响分类性能。凭直觉,导致更好的分类性能的状态应具有更高的回报:r t=F(s t)。我们将奖励模型F设置为卷积映射和分类的组合。由于在实际方法优化中,准确度越高,增加分类准确度就越困难。为了鼓励更高级别的准确性,我们设计了非线性奖励函数: The purpose of the reward model is to evaluate how the current state affects classification performance. Intuitively, the state leading to better classification performance should have a higher return: r t =F(s t ). We set the reward model F as a combination of convolutional mapping and classification. Because in the actual method optimization, the higher the accuracy, the more difficult it is to increase the classification accuracy. To encourage higher levels of accuracy, we designed a non-linear reward function:
Figure PCTCN2020104558-appb-000012
Figure PCTCN2020104558-appb-000012
其中acc表示分类准确性。该功能包括两部分;第一部分是具有指数acc∈[0,1]的归一化指数函数,这部分鼓励强化学习算法搜索更好的s t,从而获得更高的acc。指数函数的动机是:奖励的增长率随着准确性的提高而增加。第 二部分是注意区长度以保持更短的惩罚因子,β是惩罚系数。 Where acc means classification accuracy. This function consists of two parts; the first part is a normalized exponential function with exponent acc∈[0,1]. This part encourages the reinforcement learning algorithm to search for a better st to obtain a higher acc. The motivation of the exponential function is that the growth rate of the reward increases as the accuracy increases. The second part is to pay attention to the length of the zone to keep the penalty factor shorter, and β is the penalty factor.
总而言之,深度强化学习的目的是学习导致最大奖励的最佳注意区域
Figure PCTCN2020104558-appb-000013
选择机制总共迭代M=n e*n s次,其中n e和n s分别表示情节和步数。在状态转换中采用ε-贪心方法,该方法选择概率为1-ε的随机动作或根据概率为ε的最优Q函数
Figure PCTCN2020104558-appb-000014
选择一个动作行为。
All in all, the purpose of deep reinforcement learning is to learn the best attention area that leads to the greatest reward
Figure PCTCN2020104558-appb-000013
The selection mechanism iterates M=n e *n s times in total, where n e and n s represent the plot and the number of steps, respectively. In the state transition, the ε-greedy method is adopted, which selects random actions with a probability of 1-ε or the optimal Q function with a probability of ε
Figure PCTCN2020104558-appb-000014
Choose an action behavior.
Figure PCTCN2020104558-appb-000015
Figure PCTCN2020104558-appb-000015
其中ε'∈[0,1]是为每次迭代随机生成的,而
Figure PCTCN2020104558-appb-000016
是在A中随机选择的。
Where ε'∈[0,1] is randomly generated for each iteration, and
Figure PCTCN2020104558-appb-000016
It was randomly selected in A.
为了更好地收敛和更快地进行训练,ε随着迭代而逐渐增加。增量ε 0如下: In order to better converge and train faster, ε gradually increases with iterations. The increment ε 0 is as follows:
ε t+1=ε t0M ε t+1t0 M
竞争DQN(深层Q网络)被用作优化策略π(s t,a t),可以有效地学习状态值函数。我们采用对决DQN来发现最佳关注区域的主要原因是,它在每一步都会更新所有四个Q值,而其他策略在每一步只会更新一个Q值。Q函数会在采取该行并遵循最佳策略时衡量预期的未来奖励总和。特别是对于特定的步骤t,我们有: Competitive DQN (deep Q network) is used as an optimization strategy π(s t , a t ), which can effectively learn the state value function. The main reason why we use the duel DQN to find the best area of interest is that it updates all four Q values at each step, while other strategies only update one Q value at each step. The Q function measures the expected sum of future rewards when taking the line and following the best strategy. Especially for a specific step t, we have:
Figure PCTCN2020104558-appb-000017
Figure PCTCN2020104558-appb-000017
其中γ∈[0,1]是权衡立即和未来奖励的重要性的衰减参数,而n表示后续步骤的数量。当处于状态s时,值函数V(s t)估计预期奖励。Q函数与该对(s t,a t)相关,而value函数仅与s t相关。 Where γ∈[0,1] is a decay parameter that weighs the importance of immediate and future rewards, and n represents the number of subsequent steps. When in state s, the value function V( st ) estimates the expected reward. The Q function is related to the pair (s t , a t ), and the value function is only related to s t .
竞争DQN通过值函数V(s t)和优势函数A(s t,a t)学习Q函数,并通过以下公式进行组合 Competitive DQN learns the Q function through the value function V(s t ) and the advantage function A(s t , a t ), and combines them with the following formula
Q(s t,a t)=θV(s t)+θ'A(s t,a t) Q(s t ,a t )=θV(s t )+θ'A(s t ,a t )
其中θ,θ'∈Θ是对决DQN网络中的参数,并且会自动进行优化。上述公式是不可识别的,事实上,不能用给定的Q(s t,a t)唯一地恢复V(s t)和A(s t,a t)。为了解决这个问题,可以在选定的动作上强制优势函数等于零。也就是说,让网络实现前向映射: Among them θ, θ'∈ Θ are the parameters in the confrontation DQN network, and will be automatically optimized. The above formula is not recognized, in fact, can not be recovered uniquely V (s t) and A (s t, a t) given Q (s t, a t) . To solve this problem, the advantage function can be forced to be equal to zero on the selected action. In other words, let the network implement forward mapping:
Figure PCTCN2020104558-appb-000018
Figure PCTCN2020104558-appb-000018
因此,对于特定动作a *,如果 Therefore, for a specific action a * , if
Figure PCTCN2020104558-appb-000019
Figure PCTCN2020104558-appb-000019
然后有Then there is
Q(s t+1,a*)=V(s t) Q(s t+1 ,a*)=V(s t )
因此,值函数V(s t)被迫学习价值函数的估计,而另一方向产生优势函数的估计。 Therefore, the value function V( st ) is forced to learn the estimation of the value function, while the other direction produces the estimation of the advantage function.
为了评估Q函数,我们在第i次迭代中优化了以下成本函数:In order to evaluate the Q function, we optimized the following cost functions in the i-th iteration:
Figure PCTCN2020104558-appb-000020
Figure PCTCN2020104558-appb-000020
其中,in,
Figure PCTCN2020104558-appb-000021
Figure PCTCN2020104558-appb-000021
渐变更新方法是The gradient update method is
Figure PCTCN2020104558-appb-000022
Figure PCTCN2020104558-appb-000022
对于每个注意区域,进一步挖掘选定特征
Figure PCTCN2020104558-appb-000023
的潜在空间依赖性。由于只关注单个样本,因此EEG样本仅包含具有非常有限信息的数值向量,并且容易被噪声破坏。为了弥补这一缺陷,尝试通过CNN结构将EEG单个样本从原始空间
Figure PCTCN2020104558-appb-000024
映射到稀疏空间Γ∈R M
For each attention area, further mining selected features
Figure PCTCN2020104558-appb-000023
The potential spatial dependence of. Since only a single sample is concerned, EEG samples only contain numerical vectors with very limited information, and are easily corrupted by noise. In order to make up for this shortcoming, try to extract a single sample of EEG from the original space through the CNN structure
Figure PCTCN2020104558-appb-000024
Map to the sparse space Γ∈R M.
为了尽可能多地提取潜在的空间依赖性,使用了一个卷积层,该卷积层带有许多过滤器,以对学习注意区域
Figure PCTCN2020104558-appb-000025
进行扫描。卷积映射结构包含五层:输入层接收学习的注意区域,卷积层后跟一个完全连接层,以及输出层。将单热点实况与输出层进行比较,以计算训练损失。
In order to extract as many potential spatial dependencies as possible, a convolutional layer is used with many filters to learn the attention area
Figure PCTCN2020104558-appb-000025
Perform a scan. The convolutional mapping structure consists of five layers: the input layer receives the learned attention area, the convolutional layer is followed by a fully connected layer, and the output layer. Compare the single hot spot live with the output layer to calculate the training loss.
Relu非线性激活函数应用于卷积输出。将卷积层描述如下:The Relu nonlinear activation function is applied to the convolution output. The convolutional layer is described as follows:
Figure PCTCN2020104558-appb-000026
Figure PCTCN2020104558-appb-000026
其中
Figure PCTCN2020104558-appb-000027
表示卷积层的结果,而
Figure PCTCN2020104558-appb-000028
和W c分别表示滤波器的长度和滤波器的权重。池化层旨在减少卷积输出中的冗余信息,以降低计算成本。就我们而言,试图保留尽可能多的信息。因此,的方法不采用池化层。然后,在全连接层和输出层
in
Figure PCTCN2020104558-appb-000027
Represents the result of the convolutional layer, and
Figure PCTCN2020104558-appb-000028
And W c respectively represent the length of the filter and the weight of the filter. The pooling layer aims to reduce the redundant information in the convolution output to reduce the computational cost. As far as we are concerned, try to retain as much information as possible. Therefore, the method does not use a pooling layer. Then, in the fully connected layer and the output layer
Figure PCTCN2020104558-appb-000029
Figure PCTCN2020104558-appb-000029
Figure PCTCN2020104558-appb-000030
Figure PCTCN2020104558-appb-000030
其中W f,W o,b f,b o分别表示相应的权重和偏差。y'表示预测标签。代价函数通过交叉熵来度量,并且采用l 2-范数(带有参数λ)作为正则化以防止过度拟合: Among them, W f , W o , b f , and b o respectively represent the corresponding weights and deviations. y'represents the predicted label. The cost function is measured by cross entropy, and the l 2 -norm (with parameter λ) is used as regularization to prevent overfitting:
Figure PCTCN2020104558-appb-000031
Figure PCTCN2020104558-appb-000031
AdamOptimizer算法优化了成本函数。完全连接层提取特征,并将其输入到轻量级最近邻分类器中。卷积映射为N′次迭代更新。The AdamOptimizer algorithm optimizes the cost function. The fully connected layer extracts features and inputs them into the lightweight nearest neighbor classifier. The convolutional map is updated iteratively for N'times.
复制并打乱输入的原始EEG单个样本,以提供特征维度的更多潜在空间组合。然后,选择一个注意区。所选择的关注区域被输入到状态转变和奖励模型。在每个步骤t中,状态转换都会选择一个行为,以根据强化选择性机制的反馈更新。奖励模型通过奖励分数评估关注区域的质量。竞争DQN用于发现最佳注意区域,该注意区域将被馈送到卷积映射过程中以提取空间依赖表示。所表示的特征将用于分类。奖励模型是卷积映射和分类器的组合。Copy and scramble the input original EEG single sample to provide more potential spatial combinations of feature dimensions. Then, select an attention zone. The selected area of interest is input to the state transition and reward model. In each step t, the state transition will select a behavior to update according to the feedback of the enhanced selective mechanism. The reward model evaluates the quality of the area of interest through reward scores. The competitive DQN is used to find the best attention area, which will be fed into the convolutional mapping process to extract the spatial dependent representation. The features represented will be used for classification. The reward model is a combination of convolutional mapping and classifier.
该框架可直接处理原始EEG数据,而无需特征提取。此外,它可以针对不同的EEG数据自动选择可区分的特征维度,从而实现高可用性。方法不仅在很大程度上超越了几个最新的基线,而且在应对多个EEG信号通道和不完整的EEG信号方面显示出低延迟和高弹性方法适用于更广泛的应用场景,例如运动想象识别和情绪状态识别。The framework can directly process raw EEG data without feature extraction. In addition, it can automatically select distinguishable feature dimensions for different EEG data, thereby achieving high availability. The method not only surpasses several latest baselines to a large extent, but also shows low latency and high flexibility in dealing with multiple EEG signal channels and incomplete EEG signals. The method is suitable for a wider range of application scenarios, such as motor image recognition. And emotional state recognition.
通过上述方案可知,通过结合深度强化学习和注意机制来设计强化选择性注意模型,以自动提取健壮且独特的深度特征;以鼓励模型选择能够达到最高分类精度的最佳注意区域;此外,根据认知活动识别环境定制状态和动作;还利用奖励模型进行选定的原始脑电数据进行分类,实现了比传统方法更高的识别准确性,并且延迟较低。It can be seen from the above scheme that the enhanced selective attention model is designed by combining deep reinforcement learning and attention mechanism to automatically extract robust and unique deep features; to encourage the model to select the best attention area that can achieve the highest classification accuracy; in addition, according to the recognition Knowing activities recognize the customized state and actions of the environment; it also uses the reward model to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods and lower delay.
实施例二Example two
参照图5,本发明实施例二提供了一种自适应认知活动识别装置1000,包括:5, the second embodiment of the present invention provides an adaptive cognitive activity recognition device 1000, including:
采集单元1100,用于采集原始脑电数据;The collecting unit 1100 is used to collect raw brain electricity data;
处理单元1200,用于对所述原始脑电数据进行处理,得到脑电信号;The processing unit 1200 is configured to process the original EEG data to obtain EEG signals;
检测单元1300,用于将所述脑电信号分别输入到状态过渡模型和奖励模型,分别得到脑电状态信息和奖励信息;The detection unit 1300 is configured to input the EEG signal into the state transition model and the reward model, respectively, to obtain EEG state information and reward information respectively;
筛选单元1400,用于将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息;The screening unit 1400 is configured to input the EEG state information and the reward information into the enhanced selection attention model to obtain the best attention area information;
识别单元1500,用于将所述最佳注意区域信息输入到奖励模型,得出分类识别结果。The recognition unit 1500 is configured to input the best attention area information into the reward model to obtain a classification recognition result.
在本实施例中,处理单元1200包括:In this embodiment, the processing unit 1200 includes:
复制单元1210,用于对所述原始脑电数据进行复制处理;The copying unit 1210 is used for copying the original EEG data;
洗牌单元1220,用于对经过所述复制单元1210处理后的原始脑电数据进行洗牌处理,得出组合脑电数据;The shuffling unit 1220 is used for shuffling the original EEG data processed by the copying unit 1210 to obtain combined EEG data;
选取单元1230,用于对所述组合脑电数据进行选取,得到脑电信号。The selecting unit 1230 is used to select the combined EEG data to obtain EEG signals.
在本实施例中,检测单元1300包括:In this embodiment, the detection unit 1300 includes:
状态过渡单元1310,用于将脑电信号输入到状态过渡模型,得到脑电状态信息;奖励单元1320,用于将脑电信号输入到奖励模型,得到奖励信息。The state transition unit 1310 is used to input brain electrical signals into the state transition model to obtain brain electrical state information; the reward unit 1320 is used to input brain electrical signals into the reward model to obtain reward information.
在本实施例中,筛选单元1400包括:In this embodiment, the screening unit 1400 includes:
选择注意单元1410,即为强化选择注意模型,能够接收到所述脑电状态信息和所述奖励信息,得出脑电评估信息;然后将所述脑电评估信息反馈到所述状态过渡模型以驱使所述状态过渡模型进行脑电状态信息转换,直至所述强化选择注意模型得出最佳注意区域信息。The selected attention unit 1410, which is to strengthen the selected attention model, can receive the EEG state information and the reward information to obtain EEG evaluation information; and then feed back the EEG evaluation information to the state transition model to The state transition model is driven to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
在本实施例中,识别单元1500即为上述的奖励单元1320,能够进行自适应认知活动识别。In this embodiment, the recognition unit 1500 is the above-mentioned reward unit 1320, and can perform adaptive cognitive activity recognition.
需要说明的是,由于本实施例中的自适应认知活动识别装置与上述实施例一 中的自适应认识活动识别方法基于相同的发明构思,因此,方法实施例一中的相应内容同样适用于本系统实施例,此处不再详述。It should be noted that, since the adaptive cognitive activity recognition device in this embodiment and the adaptive cognitive activity recognition method in the first embodiment are based on the same inventive concept, the corresponding content in the first method embodiment is also applicable to This system embodiment will not be described in detail here.
通过上述方案可知,通过结合深度强化学习和注意机制来设计强化选择性注意模型,以自动提取健壮且独特的深度特征;以鼓励模型选择能够达到最高分类精度的最佳注意区域;此外,根据认知活动识别环境定制状态和动作;还利用奖励模型进行选定的原始脑电数据进行分类,实现了比传统方法更高的识别准确性,并且延迟较低。It can be seen from the above scheme that the enhanced selective attention model is designed by combining deep reinforcement learning and attention mechanism to automatically extract robust and unique deep features; to encourage the model to select the best attention area that can achieve the highest classification accuracy; in addition, according to the recognition Knowing activities recognize the customized state and actions of the environment; it also uses the reward model to classify the selected original EEG data, which achieves higher recognition accuracy than traditional methods and lower delay.
实施例三Example three
本发明实施例三还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有自适应认知活动识别装置可执行指令,自适应认知活动识别装置可执行指令用于使自适应认知活动识别装置执行上述的自适应认知活动识别方法,例如,执行以上描述的图1中的方法步骤S100至S500,实现图5中的单元1000-1500的功能。The third embodiment of the present invention also provides a computer-readable storage medium that stores executable instructions of the adaptive cognitive activity recognition device, and the executable instructions of the adaptive cognitive activity recognition device are used to enable self The adaptive cognitive activity recognition device executes the above-mentioned adaptive cognitive activity recognition method, for example, executes the method steps S100 to S500 in FIG. 1 described above to realize the functions of the units 1000-1500 in FIG. 5.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示意性实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, the description with reference to the terms "one embodiment", "some embodiments", "exemplary embodiments", "examples", "specific examples", or "some examples" etc. means to incorporate the implementation The specific features, structures, materials, or characteristics described by the examples or examples are included in at least one embodiment or example of the present invention. In this specification, the schematic representations of the above-mentioned terms do not necessarily refer to the same embodiment or example. Moreover, the described specific features, structures, materials or characteristics can be combined in any one or more embodiments or examples in a suitable manner.
尽管已经示出和描述了本发明的实施例,本领域的普通技术人员可以理解:在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those of ordinary skill in the art can understand that various changes, modifications, substitutions and modifications can be made to these embodiments without departing from the principle and purpose of the present invention. The scope of the present invention is defined by the claims and their equivalents.

Claims (10)

  1. 一种自适应认知活动识别方法,其特征在于,包括:An adaptive cognitive activity recognition method, which is characterized in that it includes:
    采集原始脑电数据;Collect raw EEG data;
    对所述原始脑电数据进行处理,得到脑电信号;Processing the original EEG data to obtain EEG signals;
    将所述脑电信号分别输入到状态过渡模型和奖励模型,分别得到脑电状态信息和奖励信息;Inputting the EEG signal into the state transition model and the reward model, respectively, to obtain EEG state information and reward information;
    将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息;Inputting the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information;
    将所述最佳注意区域信息输入到奖励模型,得出分类识别结果。The best attention area information is input into the reward model to obtain the classification recognition result.
  2. 根据权利要求1所述的一种自适应认知活动识别方法,其特征在于,所述对所述原始脑电数据进行处理,得到脑电信号,包括:An adaptive cognitive activity recognition method according to claim 1, wherein the processing the original EEG data to obtain EEG signals comprises:
    对所述原始脑电数据进行复制和洗牌处理,得到组合脑电数据;Copying and shuffling the original EEG data to obtain combined EEG data;
    对所述组合脑电数据进行选取,得到脑电信号。The combined EEG data is selected to obtain EEG signals.
  3. 根据权利要求1所述的一种自适应认知活动识别方法,其特征在于,所述将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息,包括:An adaptive cognitive activity recognition method according to claim 1, wherein said inputting said EEG state information and said reward information into an enhanced selective attention model to obtain best attention area information, include:
    通过强化选择注意模型接收到所述脑电状态信息和所述奖励信息,得出脑电评估信息;The EEG state information and the reward information are received through the enhanced selective attention model to obtain EEG evaluation information;
    将所述脑电评估信息反馈到所述状态过渡模型以驱使所述状态过渡模型进行脑电状态信息转换,直至所述强化选择注意模型得出最佳注意区域信息。The EEG evaluation information is fed back to the state transition model to drive the state transition model to perform EEG state information conversion until the enhanced selection attention model obtains the best attention area information.
  4. 根据权利要求1所述的一种自适应认知活动识别方法,其特征在于:所述奖励模型包括卷积映射网络和分类器。An adaptive cognitive activity recognition method according to claim 1, wherein the reward model includes a convolutional mapping network and a classifier.
  5. 根据权利要求4所述的一种自适应认知活动识别方法,其特征在于,所述将所述最佳注意区域信息输入到奖励模型,得出分类识别结果,包括:An adaptive cognitive activity recognition method according to claim 4, wherein said inputting said best attention area information into a reward model to obtain a classification recognition result comprises:
    所述最佳注意区域信息输入到卷积映射网络,得出空间依赖特征;The best attention area information is input to the convolutional mapping network to obtain the spatial dependence feature;
    所述空间依赖特征输入到分类器以得到分类识别结果。The spatially dependent features are input to the classifier to obtain classification recognition results.
  6. 根据权利要求4所述的一种自适应认知活动识别方法,其特征在于:所述卷积映射网络包括输入层、卷积层、完全连接层、提取特征层和输出层,所述输入层、所述卷积层、所述完全连接层、所述提取特征层和所述输出层依次连接。An adaptive cognitive activity recognition method according to claim 4, wherein the convolutional mapping network includes an input layer, a convolutional layer, a fully connected layer, an extraction feature layer, and an output layer, and the input layer , The convolutional layer, the fully connected layer, the extracted feature layer and the output layer are connected in sequence.
  7. 一种自适应认知活动识别装置,其特征在于,包括:An adaptive cognitive activity recognition device, which is characterized in that it comprises:
    采集单元,用于采集原始脑电数据;The collection unit is used to collect the original EEG data;
    处理单元,用于对所述原始脑电数据进行处理,得到脑电信号;The processing unit is used to process the original EEG data to obtain EEG signals;
    检测单元,用于将所述脑电信号分别输入到状态过渡模型和奖励模型,分别得到脑电状态信息和奖励信息;The detection unit is configured to input the EEG signal into the state transition model and the reward model respectively to obtain EEG state information and reward information;
    筛选单元,用于将所述脑电状态信息和所述奖励信息输入到强化选择注意模型,得出最佳注意区域信息;A screening unit, configured to input the EEG state information and the reward information into the enhanced selective attention model to obtain the best attention area information;
    识别单元,用于将所述最佳注意区域信息输入到奖励模型,得出分类识别结果。The recognition unit is used to input the best attention area information into the reward model to obtain a classification recognition result.
  8. 根据权利要求7所述的一种自适应认知活动识别装置,其特征在于,所述处理单元包括:An adaptive cognitive activity recognition device according to claim 7, wherein the processing unit comprises:
    复制单元,用于对所述原始脑电数据进行复制处理;The copying unit is used for copying the original EEG data;
    洗牌单元,用于对经过所述复制单元处理后的原始脑电数据进行洗牌处理,得出组合脑电数据;The shuffling unit is used for shuffling the original EEG data processed by the copying unit to obtain combined EEG data;
    选取单元,用于对所述组合脑电数据进行选取,得到脑电信号。The selecting unit is used to select the combined EEG data to obtain EEG signals.
  9. 根据权利要求7所述的一种自适应认知活动识别装置,其特征在于,所述检测单元包括:An adaptive cognitive activity recognition device according to claim 7, wherein the detection unit comprises:
    状态过渡单元,用于将脑电信号输入到状态过渡模型,得到脑电状态信息;The state transition unit is used to input the EEG signal into the state transition model to obtain EEG state information;
    奖励单元,用于将脑电信号输入到奖励模型,得到奖励信息。The reward unit is used to input brain electrical signals into the reward model to obtain reward information.
  10. 一种计算机可读存储介质,其特征在于:所述计算机可读存储介质存储有自适应认知活动识别装置可执行指令,自适应认知活动识别装置可执行指令用于使自适应认知活动识别装置执行如权利要求1至6任一所述的自适应认知 活动识别方法。A computer-readable storage medium is characterized in that: the computer-readable storage medium stores executable instructions of an adaptive cognitive activity recognition device, and the executable instructions of the adaptive cognitive activity recognition device are used to enable the adaptive cognitive activity The recognition device executes the adaptive cognitive activity recognition method according to any one of claims 1 to 6.
PCT/CN2020/104558 2020-05-25 2020-07-24 Self-adaptive cognitive activity recognition method and apparatus, and storage medium WO2021237917A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010449674.5 2020-05-25
CN202010449674.5A CN111543988B (en) 2020-05-25 2020-05-25 Adaptive cognitive activity recognition method and device and storage medium

Publications (1)

Publication Number Publication Date
WO2021237917A1 true WO2021237917A1 (en) 2021-12-02

Family

ID=72005138

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/104558 WO2021237917A1 (en) 2020-05-25 2020-07-24 Self-adaptive cognitive activity recognition method and apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN111543988B (en)
WO (1) WO2021237917A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115238835A (en) * 2022-09-23 2022-10-25 华南理工大学 Electroencephalogram emotion recognition method, medium and equipment based on double-space adaptive fusion
CN116392149A (en) * 2023-03-03 2023-07-07 南京左右脑医疗科技集团有限公司 Brain cognitive state recognition method, device and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112732846A (en) * 2021-01-27 2021-04-30 深圳市科荣软件股份有限公司 Water affair operation analysis system, method, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102542283A (en) * 2010-12-31 2012-07-04 北京工业大学 Optimal electrode assembly automatic selecting method of brain-machine interface
CN104127179A (en) * 2014-04-13 2014-11-05 北京工业大学 Electroencephalogram (EEG) feature extraction method based on dominant electrode combination and empirical mode decomposition (EMD)
EP3143933B1 (en) * 2015-09-15 2018-08-22 BrainSigns s.r.l. Method for estimating a mental state, in particular a workload, and related apparatus
CN110353673A (en) * 2019-07-16 2019-10-22 西安邮电大学 A kind of brain electric channel selection method based on standard mutual information
WO2020023232A1 (en) * 2018-07-24 2020-01-30 Keane Christopher Multiple frequency neurofeedback brain wave training techniques, systems, and methods
CN111184511A (en) * 2020-02-04 2020-05-22 西安交通大学 Electroencephalogram signal classification method based on attention mechanism and convolutional neural network

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102499676B (en) * 2011-11-03 2014-01-29 北京工业大学 Effective time sequence and electrode recombination based electroencephalograph signal categorizing system and method
CN102722728B (en) * 2012-06-11 2014-05-28 杭州电子科技大学 Motion image electroencephalogram classification method based on channel weighting supporting vector
AU2013100576A4 (en) * 2013-04-23 2013-06-06 Huang, Xu PROF Human Identification with Electroencephalogram (EEG) for the Future Network Security
CN105094324B (en) * 2015-07-14 2018-02-23 南京航空航天大学 Brain states recognition methods based on right-hand man's Mental imagery EEG signals
CN105654063B (en) * 2016-01-08 2019-04-19 东南大学 Mental imagery brain power mode recognition methods based on the optimization of artificial bee colony time and frequency parameter
CN106886792B (en) * 2017-01-22 2020-01-17 北京工业大学 Electroencephalogram emotion recognition method for constructing multi-classifier fusion model based on layering mechanism
CN107016345A (en) * 2017-03-08 2017-08-04 浙江大学 A kind of demand model construction method applied to Product Conceptual Design
CN107184207B (en) * 2017-05-12 2020-06-05 河海大学常州校区 Channel optimization method for epilepsia EEG automatic detection based on sparse common spatial mode
KR102152785B1 (en) * 2017-11-27 2020-09-07 고려대학교 산학협력단 Apparatus and method for classifying motor imagery based on local channel region analysis using common spatial pattern
CN108182470A (en) * 2018-01-17 2018-06-19 深圳市唯特视科技有限公司 A kind of user identification method based on the recurrent neural network for paying attention to module
CN108509869A (en) * 2018-03-14 2018-09-07 中南大学 Feature set based on OpenBCI optimizes on-line training method
CN108681394B (en) * 2018-04-19 2021-03-16 北京工业大学 Electrode optimization method based on brain source imaging technology
CN109009098B (en) * 2018-07-18 2021-05-28 大连交通大学 Electroencephalogram signal feature identification method under motor imagery state
CN109124625B (en) * 2018-09-04 2021-07-20 大连理工大学 Driver fatigue state level grading method
US11141088B2 (en) * 2018-10-09 2021-10-12 Sony Corporation Electronic device for recognition of mental behavioral attributes based on deep neural networks
CN110244854A (en) * 2019-07-16 2019-09-17 湖南大学 A kind of artificial intelligence approach of multi-class eeg data identification
CN110251124B (en) * 2019-07-19 2022-02-18 太原理工大学 Method and system for determining effective brain network
CN110610168B (en) * 2019-09-20 2021-10-26 合肥工业大学 Electroencephalogram emotion recognition method based on attention mechanism
CN110876626B (en) * 2019-11-22 2022-08-12 兰州大学 Depression detection system based on optimal lead selection of multi-lead electroencephalogram
CN111091074B (en) * 2019-12-02 2023-04-25 杭州电子科技大学 Motor imagery electroencephalogram signal classification method of optimal region co-space mode
CN111134666B (en) * 2020-01-09 2021-05-11 中国科学院软件研究所 Emotion recognition method of multi-channel electroencephalogram data and electronic device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102542283A (en) * 2010-12-31 2012-07-04 北京工业大学 Optimal electrode assembly automatic selecting method of brain-machine interface
CN104127179A (en) * 2014-04-13 2014-11-05 北京工业大学 Electroencephalogram (EEG) feature extraction method based on dominant electrode combination and empirical mode decomposition (EMD)
EP3143933B1 (en) * 2015-09-15 2018-08-22 BrainSigns s.r.l. Method for estimating a mental state, in particular a workload, and related apparatus
WO2020023232A1 (en) * 2018-07-24 2020-01-30 Keane Christopher Multiple frequency neurofeedback brain wave training techniques, systems, and methods
CN110353673A (en) * 2019-07-16 2019-10-22 西安邮电大学 A kind of brain electric channel selection method based on standard mutual information
CN111184511A (en) * 2020-02-04 2020-05-22 西安交通大学 Electroencephalogram signal classification method based on attention mechanism and convolutional neural network

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115238835A (en) * 2022-09-23 2022-10-25 华南理工大学 Electroencephalogram emotion recognition method, medium and equipment based on double-space adaptive fusion
CN116392149A (en) * 2023-03-03 2023-07-07 南京左右脑医疗科技集团有限公司 Brain cognitive state recognition method, device and storage medium

Also Published As

Publication number Publication date
CN111543988A (en) 2020-08-18
CN111543988B (en) 2021-06-08

Similar Documents

Publication Publication Date Title
WO2021237917A1 (en) Self-adaptive cognitive activity recognition method and apparatus, and storage medium
Balın et al. Concrete autoencoders: Differentiable feature selection and reconstruction
CN113693563B (en) Brain function network classification method based on hypergraph attention network
CN113421652A (en) Method for analyzing medical data, method for training model and analyzer
Gao et al. Fusion of medical images based on salient features extraction by PSO optimized fuzzy logic in NSST domain
CN112674720B (en) Alzheimer disease pre-judgment method based on 3D convolutional neural network
CN111967362B (en) Hypergraph feature fusion and ensemble learning human behavior identification method for wearable equipment
Rahman et al. Fuzzy unordered rules induction algorithm used as missing value imputation methods for k-mean clustering on real cardiovascular data
CN114391826A (en) Human characterization prediction method and device based on edge-driven graph neural network
Liu et al. BrainTGL: A dynamic graph representation learning model for brain network analysis
Pal Identification of paddy leaf diseases using a supervised neural network
Cao et al. Temporal graph representation learning for autism spectrum disorder brain networks
Yao et al. A robust approach to estimating vigilance from EEG with neural processes
CN111882045B (en) Brain time-space network decomposition method and system based on micronerve structure search
Angayarkanni Predictive analytics of chronic kidney disease using machine learning algorithm
Wu et al. Genetic-algorithm-based Convolutional Neural Network for Robust Time Series Classification with Unreliable Data.
Lu et al. Brain Effective Connectivity Learning with Deep Reinforcement Learning
Chen et al. Graph learning approaches for graph with noise: application to disease prediction in population graph
Yulita et al. Sequence-based sleep stage classification using conditional neural fields
Sundaramoorthy et al. Hybrid Optimization with Recurrent Neural Network-based Medical Image Processing for Predicting Interstitial Lung Disease
Hong et al. A novel and efficient neuro-fuzzy classifier for medical diagnosis
CN116130089B (en) Hypergraph neural network-based multi-mode depression detection system, device and medium
Zhang et al. Efficient and generalizable cross-patient epileptic seizure detection through a spiking neural network
CN116226629B (en) Multi-model feature selection method and system based on feature contribution
Kumar et al. Visual In-Context Learning for Few-Shot Eczema Segmentation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20937933

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20937933

Country of ref document: EP

Kind code of ref document: A1