CN101894295A - A Method for Simulating Attention Shifts with Neural Networks - Google Patents

A Method for Simulating Attention Shifts with Neural Networks Download PDF

Info

Publication number
CN101894295A
CN101894295A CN2010101992275A CN201010199227A CN101894295A CN 101894295 A CN101894295 A CN 101894295A CN 2010101992275 A CN2010101992275 A CN 2010101992275A CN 201010199227 A CN201010199227 A CN 201010199227A CN 101894295 A CN101894295 A CN 101894295A
Authority
CN
China
Prior art keywords
neuron
attention
network
neural
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010101992275A
Other languages
Chinese (zh)
Other versions
CN101894295B (en
Inventor
段立娟
房法明
乔元华
王海丽
苗军
吴春鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Hongzhou Culture Co ltd
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN201010199227.5A priority Critical patent/CN101894295B/en
Publication of CN101894295A publication Critical patent/CN101894295A/en
Application granted granted Critical
Publication of CN101894295B publication Critical patent/CN101894295B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

本发明公开了一种用神经网络模拟注意转移的方法,包括视觉图像输入层,神经元振子网络振荡层,注意转移实现层。视觉图像输入层将灰度图像的灰度值输入到神经动力网络中,神经元振子网络振荡层,将神经网络中的每个振子根据FitzHugh-Nagumo模型建立起来的动力学系统模型耦合形成神经动力网络,注意转移实现层通过改变参数实现在已经同步的不同物体间实现注意转移,当前受到关注的物体神经发放的频率会增大。本发明立足于神经动力学系统,通过对FitzHugh-Nagumo模型的分析和改造形成对人眼注意转移视觉处理的简单模拟,对进一步研究人的视觉处理机制有重要的理论和现实意义。

Figure 201010199227

The invention discloses a method for simulating attention transfer by neural network, which includes a visual image input layer, a neuron oscillator network oscillation layer, and an attention transfer realization layer. The visual image input layer inputs the gray value of the grayscale image into the neural dynamic network, and the neuron oscillator network oscillation layer couples each oscillator in the neural network to the dynamic system model established according to the FitzHugh-Nagumo model to form neural dynamics In the network, the attention transfer implementation layer realizes the attention transfer between different synchronized objects by changing the parameters, and the frequency of the nerve firing of the object that is currently concerned will increase. Based on the neurodynamic system, the present invention forms a simple simulation of human attention shifting visual processing by analyzing and transforming the FitzHugh-Nagumo model, which has important theoretical and practical significance for further research on human visual processing mechanism.

Figure 201010199227

Description

用神经网络模拟注意转移的方法 A Method for Simulating Attention Shifts with Neural Networks

技术领域technical field

本发明涉及以神经动力学为基础的图像处理和神经网络技术,特别是涉及一种以神经动力学模型描述的神经元振子形成的神经动力网络,用于模拟简单人造灰度图像中人的注意转移的方法。The invention relates to image processing and neural network technology based on neural dynamics, in particular to a neural dynamic network formed by neuron oscillators described by a neural dynamics model, which is used to simulate human attention in simple artificial grayscale images method of transfer.

背景技术Background technique

视觉注意选择和视觉注意转移是保证生物系统以有限的处理能力完成任务的重要机制。从图像或者感受野中提取显著的特征,并把它们分成不同的区域,从中选择显著的区域是感知理解的根本任务。这种能力就是视觉理解中的视觉注意选择。当显著的区域被选择之后,由于视觉系统的适应性,注意力会从当前显著的区域转到下一个显著的区域,这是注意转移。图像处理中基于同步振荡的神经元网络被越来越广泛的应用,脉冲耦合的神经元模型已经获得了良好的成果,但是在视觉注意选择和视觉注意转移中的应用对当前的研究来说仍然是非常重要的Visual attention selection and visual attention shift are important mechanisms to ensure that biological systems complete tasks with limited processing power. Extracting salient features from images or receptive fields and dividing them into different regions, selecting salient regions is the fundamental task of perceptual understanding. This ability is the visual attentional selection in visual comprehension. When the salient area is selected, due to the adaptability of the visual system, the attention will shift from the current salient area to the next salient area, which is attention shift. In image processing, neuron networks based on synchronous oscillations are more and more widely used, and the pulse-coupled neuron model has achieved good results, but the application in visual attention selection and visual attention transfer is still unreliable for current research. is very important

人眼处理视觉图像时可以在很短的时间内轻松的进行感知和理解,并且实现从一个显著的区域转移到下一个显著的区域,尽管这一方面的研究甚多,然而研究者对其中的机理至今仍然知之甚少。研究者们提出了很多神经元振子网络模型模拟真实的人脑中神经元行为,其中1952年在大量实验的基础上由Hodgkin和Huxley提出的Hodgkin-Huxley模型(以下简称HH模型)是第一个被广泛引用的模型。FitzHugh-Nagumo(以下简称FHN)模型是从HH模型简化而来,而且被多个研究者证明能较好的模拟神经元的电位发放,是模拟人的视觉的重要模型。When the human eye processes visual images, it can easily perceive and understand in a short period of time, and realize the transfer from one salient area to the next salient area. The mechanism is still poorly understood. Researchers have proposed many neuron oscillator network models to simulate the behavior of neurons in the real human brain, among which the Hodgkin-Huxley model (hereinafter referred to as the HH model) proposed by Hodgkin and Huxley on the basis of a large number of experiments in 1952 is the first widely cited model. The FitzHugh-Nagumo (hereinafter referred to as FHN) model is simplified from the HH model, and it has been proved by many researchers that it can better simulate the potential firing of neurons, and is an important model for simulating human vision.

FHN模型描述了一个神经元由输入电流Iinput驱动的膜电位V和内部状态变量R之间的关系。V和R对时间t的变化率分别为:The FHN model describes the relationship between the membrane potential V and the internal state variable R of a neuron driven by the input current I input . The rate of change of V and R with respect to time t is:

dVdV dtdt == 1010 (( VV -- VV 33 33 -- αRαR ++ II inputinput )) -- -- -- (( 11 ))

dRd dtdt == 0.80.8 (( -- RR ++ βVβV ++ 1.51.5 ))

V 是神经元细胞膜两侧的电位差,称为膜电位,R是代表电位阈值的内部状态变量,Iinput是神经元接受到的输入电流,也是外界对此神经元的光刺激。这里10和0.8是V和R的时间常数的倒数,并且α>0,β>0分别描述了从R到V和从V到R的动作强度。V的时间常数是R的12.5倍,这反应了轴突中激活过程要比恢复过程快得多的事实。假设(V,R)是平衡点,只有当平衡点(V,R)是不稳定的时候,在其周围产生稳定的极限环。因此α和β的取值必须满足公式(2)V is the potential difference on both sides of the neuron cell membrane, called the membrane potential, R is the internal state variable representing the potential threshold, I input is the input current received by the neuron, and it is also the external light stimulus to the neuron. Here 10 and 0.8 are the reciprocals of the time constants of V and R, and α > 0, β > 0 describe the motion intensity from R to V and from V to R, respectively. The time constant of V is 12.5 times that of R, reflecting the fact that the activation process in axons is much faster than the recovery process. Assuming (V, R) is an equilibrium point, only when the equilibrium point (V, R) is unstable, a stable limit cycle is generated around it. Therefore, the values of α and β must satisfy the formula (2)

8(V2+αβ-1)<0(2)8(V 2 +αβ-1)<0(2)

或者公式(3)or formula (3)

8(V2+αβ-1)>08(V 2 +αβ-1)>0

-(10V2-9.2)>0(3)-(10V 2 -9.2)>0(3)

改变方程(1)中的参数α和β可以控制发放频率。α或者β的增加都会引起发放频率的增加,但当α或者β是一个相当小的值时(例如α=1,β=0.6或者α=0.3,β=2或者α=0.3,β=0.6),方程(1)不产生发放。The firing frequency can be controlled by changing the parameters α and β in equation (1). The increase of α or β will cause the increase of firing frequency, but when α or β is a relatively small value (such as α=1, β=0.6 or α=0.3, β=2 or α=0.3, β=0.6) , Equation (1) does not produce a release.

发明内容Contents of the invention

本发明的目的在于,通过提供一种用神经网络模拟注意转移的方法,应用改造的FHN模型形成神经动力网络模拟人眼在感受野中从一个显著性区域到另一个的过程,并构建简单感受野中注意转移系统,对单个神经元进行模型构建,并基于单个模型构造生物视觉神经网络。The purpose of the present invention is, by providing a method for simulating attention transfer with a neural network, applying the transformed FHN model to form a neural dynamic network to simulate the process of the human eye from one salient region to another in the receptive field, and construct a simple receptive field Attention transfer system, model construction of single neuron, and construct biological visual neural network based on single model.

本发明用神经网络模拟注意转移的方法,是采用以下技术手段实现的。The method for simulating attention transfer by neural network in the present invention is realized by adopting the following technical means.

步骤1、视觉图像输入层将灰度图像的灰度值输入到神经动力网络中,图像中的像素点与神经动力网络上的神经元振子存在一一对应关系,每个像素对应的神经动力学系统由FHN模型描述。Step 1. The visual image input layer inputs the grayscale value of the grayscale image into the neural dynamic network. There is a one-to-one correspondence between the pixels in the image and the neuron oscillators on the neural dynamic network, and the neural dynamics corresponding to each pixel The system is described by the FHN model.

步骤2、神经元振子网络振荡层,将神经网络中的每个振子根据FHN模型建立起来的动力学系统模型耦合形成神经动力网络,其中的第i行、第j列个神经元振子的膜电位V和电压阈值R对时间的变化率分别为;Step 2, neuron oscillator network oscillation layer, coupling each oscillator in the neural network with the dynamic system model established according to the FHN model to form a neural dynamic network, in which the membrane potential of the i-th row and j-th column neuron oscillator The rate of change of V and the voltage threshold R to time are respectively;

dVdV ii ,, jj dtdt == 1010 (( (( VV ii ,, jj ++ ΔΔ VV ii ,, jj )) -- (( VV ii ,, jj ++ ΔΔ VV ii ,, jj )) 33 33 -- αα RR ii ,, jj ++ II ii ,, jj )) dd RR ii ,, jj dtdt == 0.80.8 (( -- (( RR ii ,, jj ++ ΔΔ RR ii ,, jj )) ++ ββ VV ii ,, jj ++ 1.51.5 )) -- -- -- (( 44 ))

其中,V是神经元细胞膜两侧的电位差,称为膜电位,R是代表电位阈值的内部状态变量,以下的公式中V和R代表相同的含义,Ii,j表示第i行、第j列个神经元接受到的来自外界的光刺激,数值上等于灰度图像中的灰度值。Among them, V is the potential difference on both sides of the neuron cell membrane, called the membrane potential, and R is the internal state variable representing the potential threshold. In the following formulas, V and R represent the same meaning, and I i, j represent the i-th row, the The light stimulus received by the j neurons from the outside world is numerically equal to the gray value in the gray image.

步骤3、注意转移实现层根据以下公式(5)和(6)的定义在振荡中调整α和β,从而实现当前注意的物体对应神经元群发放频率升高,并对其他物体对应的神经元群产生抑制性信号,使其发放频率放缓,以此区别当前关注物体与其他物体。Step 3, the attention transfer implementation layer adjusts α and β in the oscillation according to the definition of the following formulas (5) and (6), so as to increase the firing frequency of the neuron group corresponding to the object currently paying attention, and increase the firing frequency of the neuron group corresponding to other objects. The swarm produces inhibitory signals that slow down its firing frequency, thereby distinguishing the currently focused object from other objects.

αα pp ,, qq (( ττ )) == αα pp ,, qq (( ττ -- 11 )) ++ hh 11 (( αα pp ,, qq (( ττ -- 11 )) )) Mm (( ττ )) ΣΣ ii ,, jj ∈∈ ΔΔ (( ττ )) II ii ,, jj ff 11 (( || || VV ii ,, jj -- VV pp ,, qq || || )) -- -- -- (( 55 ))

ββ pp ,, qq (( ττ )) == ββ pp ,, qq (( ττ -- 11 )) ++ hh 22 (( ββ pp ,, qq (( ττ -- 11 )) )) Mm (( ττ )) ΣΣ ii ,, jj ∈∈ ΔΔ (( ττ )) II ii ,, jj ff 22 (( || || VV ii ,, jj -- VV pp ,, qq || || )) -- -- -- (( 66 ))

其中,(p,q)表示图像中p行q列的像素,τ是至少有一个神经元在发放的时刻,τ-1是τ的前一个时刻,M(τ)是时刻τ在发放状态的神经元的数量,Δ(τ)是时刻τ在发放状态的神经元的集合,Ii,j表示第i行、第j列个神经元接受到的来自外界的光刺激,数值上等于灰度图像中的灰度值。Among them, (p, q) represents the pixels in the p row and q column in the image, τ is the moment when at least one neuron is firing, τ-1 is the previous moment of τ, and M(τ) is the time when τ is firing. The number of neurons, Δ(τ) is the set of neurons in firing state at time τ, I i, j represents the light stimulation from the outside world received by neurons in row i and column j, numerically equal to grayscale Grayscale values in the image.

所述的FHN模型为FitzHugh-Nagumo模型的简称。The FHN model is an abbreviation of the FitzHugh-Nagumo model.

前述的视觉图像输入层,在将灰度值输入到神经动力网络之前,进行了归一化,归一化之后的灰度值在[0,1]范围内。The aforementioned visual image input layer is normalized before the gray value is input to the neural dynamic network, and the gray value after normalization is within the range of [0, 1].

只有当平衡点(V,R)是不稳定的时候才会有稳定的极限环产生,α和β的取值必须满足公式(2)Only when the equilibrium point (V, R) is unstable will there be a stable limit cycle, and the values of α and β must satisfy the formula (2)

8(V2+αβ-1)<0(2)8(V 2 +αβ-1)<0(2)

或者公式(3)or formula (3)

8(V2+αβ-1)>08(V 2 +αβ-1)>0

-(10V2-9.2)>0°(3)-(10V 2 -9.2)>0°(3)

前述的神经元振子网络振荡层,在所述的公式(4)中(i,j)表示图像中的第i行、第j列,1≤i ≤M,1≤j≤N;The aforementioned neuron oscillator network oscillation layer, in the formula (4), (i, j) represents the i-th row and j-th column in the image, 1≤i≤M, 1≤j≤N;

其中,M和N是图像的宽和高;ΔVi,j和ΔRi,j代表周围神经元的影响,它们由以下式子定义:Among them, M and N are the width and height of the image; ΔV i,j and ΔR i,j represent the influence of surrounding neurons, which are defined by the following formula:

Δxi,j=γi-1,j-1;i,j(xi-1,j-1-xi,j)+γi-1,j;i,j(xi-1,j-xi,j)+Δxi , j = γ i-1, j-1; i, j (xi -1, j-1 -xi , j )+γ i-1, j; i, j (xi -1, j -x i, j )+

γi-1,j+1;i,j(xi-1,j+1-xi,j)+γi,j-1;i,j(xi,j-1-xi,j)+γ i-1, j+1; i, j (xi -1, j+1 -xi , j )+γ i, j-1; i, j (xi , j-1 -xi , j )+

γi,j+1;i,j(xi,j+1-xi,j)+γi+1,j-1;i,j(xi+1,j-1-xi,j)+γ i, j+1; i, j (xi , j+1 -xi , j )+γ i+1, j-1; i, j (xi +1, j-1 -xi , j )+

                                                                                            (7)...

γi+1,j;i,j(xi+1,j-xi,j)+γi+1,j+1;i,j(xi+1,j+1-xi,j)γ i+1, j; i, j (x i+1, j - x i, j )+γ i+1, j+1; i, j (x i+1, j+1 - x i, j )

其中

Figure BSA00000165340200041
in
Figure BSA00000165340200041

其中x表示V或者R。Where x represents V or R.

前述的注意转移实现层,在以下公式(9)和(10)中θ1>θ2>0In the aforementioned attention transfer implementation layer, θ 12 >0 in the following formulas (9) and (10)

hh 11 (( &alpha;&alpha; )) == &theta;&theta; 11 &alpha;&alpha; << &theta;&theta; &alpha;&alpha; &theta;&theta; 22 &alpha;&alpha; &GreaterEqual;&Greater Equal; &theta;&theta; &alpha;&alpha; -- -- -- (( 99 ))

hh 22 (( &beta;&beta; )) == &theta;&theta; 11 &beta;&beta; &GreaterEqual;&Greater Equal; &theta;&theta; &beta;&beta; &theta;&theta; 22 &beta;&beta; << &theta;&theta; &beta;&beta; -- -- -- (( 1010 ))

f1(x)=a1x+b1(a1>0,b1<0)(11)f 1 (x)=a 1 x+b 1 (a 1 >0, b 1 <0)(11)

f2(x)=a2x+b2(a2<0,b2>0)(12)f 2 (x)=a 2 x+b 2 (a 2 <0, b 2 >0)(12)

实施过程参数为a1=2,b1=-4,a2=-4,b2=2,θ1=0.1,θ2=0.01,θα=0.8,θβ=4。The implementation process parameters are a 1 =2, b 1 =-4, a 2 =-4, b 2 =2, θ 1 =0.1, θ 2 =0.01, θ α =0.8, θ β =4.

本发明一种用神经网络模拟注意转移的方法,与以往文献中使用FHN模型的方法及其应用领域相比,具有以下明显的优势和有益的效果:A kind of method of neural network simulation attention transfer of the present invention, compared with the method and its application field using FHN model in the past literature, has the following obvious advantages and beneficial effects:

1、神经元之间的耦合方式不同于以往文献。1. The coupling mode between neurons is different from the previous literature.

2、把FHN模型用于模拟人的注意转移机制。2. The FHN model is used to simulate the human attention shifting mechanism.

附图说明Description of drawings

图1注意转移系统结构示意图;Fig. 1 is a schematic structural diagram of attention transfer system;

图2为图像与神经动力网络对应示意图;Fig. 2 is a schematic diagram corresponding to an image and a neural dynamic network;

图3为输入图像示意图;Fig. 3 is a schematic diagram of an input image;

图4为本发明中方法对图3的依次关注物体图。Fig. 4 is a sequential attention object map of Fig. 3 by the method in the present invention.

具体实施方式Detailed ways

以下结合附图对本发明的具体实施方式加以说明。Specific embodiments of the present invention will be described below in conjunction with the accompanying drawings.

该系统分为三个部分,视觉图像输入层,神经元振子网络振荡层,注意转移实现层。三层之间的关系如图1注意转移系统结构示意图所示。The system is divided into three parts, the visual image input layer, the neuron oscillator network oscillation layer, and the attention transfer implementation layer. The relationship between the three layers is shown in Fig. 1, the structural diagram of the attention transfer system.

该系统首先将一幅视觉图像中的各个像素与神经动力网络中的所有神经元振子建立一对一关联,然后让这一组神经元振子产生振荡。由于亮度大的物体对人眼视网膜的光刺激更强烈,所以Iinput的增大导致发放频率的增大,而发放过程中会对其他区域产生抑制。The system first establishes a one-to-one association between each pixel in a visual image and all the neuron oscillators in the neurodynamic network, and then makes this group of neuron oscillators oscillate. Since objects with high brightness stimulate the retina of the human eye more strongly, the increase in I input leads to an increase in the firing frequency, and other areas will be inhibited during the firing process.

把单个的由FHN模型描述的神经元以矩阵的形式排列并进行模型重构,形成神经动力网络。新建立的模型中每一个神经元振子除边缘外都与其周边8领域内的神经元有耦合关系,边缘的神经元振子根据其相邻振子的不同分别与3个或者5个神经元振子耦合。神经动力网络中第i行、第j列的神经元振子的膜电位V和电压阈值R对时间的变化率分别为:Arrange the individual neurons described by the FHN model in the form of a matrix and reconstruct the model to form a neural dynamic network. In the newly established model, each neuron oscillator has a coupling relationship with the neurons in the surrounding 8 fields except the edge, and the neuron oscillators at the edge are coupled with 3 or 5 neuron oscillators according to the difference between the adjacent oscillators. The rate of change of the membrane potential V and the voltage threshold R of the neuron oscillator in the i-th row and j-th column of the neurodynamic network with respect to time are:

dVdV ii ,, jj dtdt == 1010 (( (( VV ii ,, jj ++ &Delta;&Delta; VV ii ,, jj )) -- (( VV ii ,, jj ++ &Delta;&Delta; VV ii ,, jj )) 33 33 -- &alpha;&alpha; RR ii ,, jj ++ II ii ,, jj )) dd RR ii ,, jj dtdt == 0.80.8 (( -- (( RR ii ,, jj ++ &Delta;&Delta; RR ii ,, jj )) ++ &beta;&beta; VV ii ,, jj ++ 1.51.5 )) -- -- -- (( 44 ))

方程中(i,j)表示图像中第i行、第j列的像素,也代表神经动力网络中第i行、第j列的神经元振子,1≤i≤M,1≤j≤N(M和N是图像的宽和高)。ΔVi,j和ΔRi,j代表周围8邻域的神经元的影响,它们由以下式子定义:In the equation (i, j) represents the pixel in the i-th row and j-th column in the image, and also represents the neuron oscillator in the i-th row and j-th column in the neural dynamic network, 1≤i≤M, 1≤j≤N( M and N are the width and height of the image). ΔV i,j and ΔR i,j represent the influence of neurons in the surrounding 8 neighborhoods, which are defined by the following formula:

Δxi,j=γi-1,j-1,i,j(xi-1,j-1-xi,j)+γi-1,j;i,j(xi-1,j-xi,j)+Δx i,ji-1,j-1,i,j (xi -1,j-1 -xi ,j )+γ i-1,j; i,j (xi -1,j -x i, j )+

γi-1,j+1;i,j(xi-1,j+1-xi,j)+γi,j-1;i,j(xi,j-1-xi,j)+γ i-1, j+1; i, j (xi -1, j+1 -xi , j )+γ i, j-1; i, j (xi , j-1 -xi , j )+

γi,j+1;i,j(xi,j+1-xi,j)+γi+1,j-1;i,j(xi+1,j-1-xi,j)+γ i, j+1; i, j (xi , j+1 -xi , j )+γ i+1, j-1; i, j (xi +1, j-1 -xi , j )+

                                                                                            (7)...

γi+1,j;i,j(xi+1,j-xi,j)+γi+1,j+1;i,j(xi+1,j+1-xi,j)γ i+1, j; i, j (x i+1, j - x i, j )+γ i+1, j+1; i, j (x i+1, j+1 - x i, j )

其中

Figure BSA00000165340200052
其中x表示V或者R。in
Figure BSA00000165340200052
Where x represents V or R.

为了实现对应显著物体的神经元更频繁的发放,其他物体的神经元以较低频率发放或者不发放这样的事实,我们首先让神经动力网络带固定参数α和β运行,直到对应相同区域的神经元同步,这也表明完成了分割任务。之后,不论何时任何神经元发放,它会对自己和其他神经元产生两种信号:对自己和一起发放的神经元产生兴奋性的信号,没有随它一起发放的是抑制性的信号。To realize the fact that neurons corresponding to salient objects fire more frequently and neurons on other objects fire less frequently or not, we first run the neural dynamics network with fixed parameters α and β until neurons corresponding to the same region Meta-sync, which also indicates the completion of the split task. Then, whenever any neuron fires, it generates two kinds of signals to itself and to other neurons: an excitatory signal to itself and to the neuron that fires with it, and an inhibitory signal to the neuron that doesn't fire with it.

系统运行时,我们可以控制参数α和β的改变,使得α的增加速度比β的减小速度大的多。这样最显著的区域将会跳到高频周期振荡相位,而其他区域会相对安静,注意也会转移到最显著的区域。当接受到注意之后,此区域会受到抑制以允许其他区域成为显著的。When the system is running, we can control the changes of parameters α and β, so that the increasing speed of α is much larger than the decreasing speed of β. In this way, the most prominent area will jump to the high-frequency periodic oscillation phase, while other areas will be relatively quiet, and attention will also be shifted to the most prominent area. After receiving attention, this area is suppressed to allow other areas to become salient.

从上面对方程(1)的分析我们可以得出,α和β可以控制神经元的活动性。From the above analysis of equation (1), we can conclude that α and β can control the activity of neurons.

一个神经元发放后会引起周边神经元α和β参数的改变,从而实现了注意的转移。其中α和β的变化由以下方程定义:After a neuron fires, it will cause changes in the α and β parameters of peripheral neurons, thereby realizing the transfer of attention. where the variation of α and β is defined by the following equations:

&alpha;&alpha; pp ,, qq (( &tau;&tau; )) == &alpha;&alpha; pp ,, qq (( &tau;&tau; -- 11 )) ++ hh 11 (( &alpha;&alpha; pp ,, qq (( &tau;&tau; -- 11 )) )) Mm (( &tau;&tau; )) &Sigma;&Sigma; ii ,, jj &Element;&Element; (( &tau;&tau; )) II ii ,, jj ff 11 (( || || VV ii ,, jj -- VV pp ,, qq || || )) -- -- -- (( 55 ))

&beta;&beta; pp ,, qq (( &tau;&tau; )) == &beta;&beta; pp ,, qq (( &tau;&tau; -- 11 )) ++ hh 22 (( &beta;&beta; pp ,, qq (( &tau;&tau; -- 11 )) )) Mm (( &tau;&tau; )) &Sigma;&Sigma; ii ,, jj &Element;&Element; &Delta;&Delta; (( &tau;&tau; )) II ii ,, jj ff 22 (( || || VV ii ,, jj -- VV pp ,, qq || || )) -- -- -- (( 66 ))

其中in

hh 11 (( &alpha;&alpha; )) == &theta;&theta; 11 &alpha;&alpha; << &theta;&theta; &alpha;&alpha; &theta;&theta; 22 &alpha;&alpha; &GreaterEqual;&Greater Equal; &theta;&theta; &alpha;&alpha; ,, &theta;&theta; 11 >> &theta;&theta; 22 >> 00 -- -- (( 99 ))

hh 22 (( &beta;&beta; )) == &theta;&theta; 11 &beta;&beta; &GreaterEqual;&Greater Equal; &theta;&theta; &beta;&beta; &theta;&theta; 22 &beta;&beta; << &theta;&theta; &beta;&beta; &theta;&theta; 11 >> &theta;&theta; 22 >> 00 -- -- -- (( 1010 ))

f1(x)=a1x+b1(a1>0,b1<0)(11)f 1 (x)=a 1 x+b 1 (a 1 >0, b 1 <0)(11)

f2(x)=a2x+b2(a2<0,b2>0)(12)f 2 (x)=a 2 x+b 2 (a 2 <0, b 2 >0)(12)

(p,q)表示图像中p行q列的像素,τ是至少有一个神经元在发放的时刻,M(τ)是时刻τ在发放状态的神经元的数量,Δ(τ)是时刻τ在发放状态的神经元的集合,Ii,j表示神经元接受到的来自外界的光刺激,数值上等于灰度图像中的灰度值。通过设置a1>0,b1<0和a2<0,b2>0,以及函数h1,h2,每个发放中的神经元(i,j)向(p,q)发放兴奋性或者抑制性的信号。(p, q) represents the pixel in row p and column q in the image, τ is the moment when at least one neuron is firing, M(τ) is the number of neurons firing at time τ, Δ(τ) is the moment τ The set of neurons in the firing state, I i, j represents the light stimulus received by the neuron from the outside world, which is numerically equal to the gray value in the gray image. By setting a 1 >0, b 1 <0 and a 2 <0, b 2 >0, and functions h 1 , h 2 , neuron (i, j) in each firing fires to (p, q) Sexual or inhibitory signals.

首先是建立视觉图像输入层。根据公式(1)和公式(4)确立的系统,建立神经动力网络,并将网络上的每个神经元与图像中的像素建立一一对应的关系,而图像的灰度值也成为神经动力系统的输入值,对应公式(1)中的Iinput。附图2为神经动力网络与灰度图像一一对应图,每一个小圆圈既代表图像中的一个像素,也代表图像对应的神经动力网络中的一个神经元,黑点代表可数个数的神经元,横线代表神经元之间的连接。The first is to build the visual image input layer. According to the system established by formula (1) and formula (4), establish a neural power network, and establish a one-to-one correspondence between each neuron on the network and the pixel in the image, and the gray value of the image also becomes the neural power The input value of the system corresponds to I input in formula (1). Attached Figure 2 is a one-to-one correspondence between the neural dynamic network and the grayscale image. Each small circle represents not only a pixel in the image, but also a neuron in the corresponding neural dynamic network of the image. Black dots represent countable Neurons, horizontal lines represent connections between neurons.

其次是神经元振子网络振荡层。通过对整个的神经动力网络中每个神经元振子用4阶Runge-Kutta法求解,实现同一物体内的振子达到同步振荡,而不同物体间的振子则去同步。The second is the neuron oscillator network oscillation layer. By using the fourth-order Runge-Kutta method to solve each neuron oscillator in the entire neural dynamic network, the oscillators in the same object can achieve synchronous oscillation, while the oscillators between different objects are desynchronized.

最后是注意转移实现层。根据公式(5)、公式(6)中的定义改变α和β,结果是按照显著性的大小图像中的物体依次选出,选出的物体被注意到时对应神经振子的发放频率会变大。The last is the attention transfer implementation layer. Change α and β according to the definition in formula (5) and formula (6), the result is that the objects in the image are selected in order according to the size of the salience, and the firing frequency of the corresponding neural oscillator will become larger when the selected object is noticed .

接下来用一幅具体图像图3说明实施流程。图3是一幅灰度图,由4个目标组成,其中①是太阳,对应灰度值255,②是树,对应灰度值195,③是山峦,对应灰度值130,④是天空,对应灰度值65。首先将图3进行归一化操作,也就是将其上像素值取值范围从[0,255]变为[0,1]。将归一化后的视觉图像输入到建立好的动力学系统(4)中,建立好的动力学系统是指按照公式(4),(5),(6)建立起来的系统。接下来对(4)式利用4阶经典Runge-Kutta法进行求解,求解过程中参数α=1,β=1.25首先自行保持不变,直至神经动力网络中的局部区域中的神经元振子达到同步共振,在求解过程中求解结果V和R得到周期解。在注意转移实现层中使用的参数分别为a1=2,b1=-4,a2=-4,b2=2,θ1=0.1,θ2=0.01,θα=0.8,θβ=4。通过迭代求解,图3中4个物体对应的发放频率依照①②③④的顺序分别增大,在某个物体增大的同时抑制其他物体的发放频率,是其他3个物体的发放频率变缓,由此实现注意转移在①②③④4个物体间不断切换。在图3中依次选出的物体如图4依次关注物体图所示。Next, a specific image, Figure 3, is used to illustrate the implementation process. Figure 3 is a grayscale image consisting of 4 objects, among which ① is the sun, corresponding to a grayscale value of 255, ② is a tree, corresponding to a grayscale value of 195, ③ is a mountain, corresponding to a grayscale value of 130, ④ is the sky, Corresponds to a grayscale value of 65. Firstly, the normalization operation is performed on Figure 3, that is, the range of pixel values on it is changed from [0, 255] to [0, 1]. The normalized visual image is input into the established dynamic system (4), and the established dynamic system refers to the system established according to formulas (4), (5), and (6). Next, formula (4) is solved by the 4th-order classic Runge-Kutta method. During the solution process, the parameters α=1 and β=1.25 are kept unchanged until the neuron oscillators in the local area of the neural dynamical network are synchronized. Resonance, during the solution process, solve the results V and R to obtain a periodic solution. The parameters used in the attention transfer realization layer are a 1 =2, b 1 =-4, a 2 =-4, b 2 =2, θ 1 =0.1, θ 2 =0.01, θ α =0.8, θ β =4. Through iterative solution, the release frequencies corresponding to the four objects in Figure 3 increase in the order of ①②③④. When an object increases, the release frequency of other objects is suppressed, and the release frequency of the other 3 objects slows down. Realize attention transfer and switch continuously among ①②③④4 objects. The objects sequentially selected in Fig. 3 are shown in Fig. 4, which is followed by objects sequentially.

最后应说明的是:以上实施例仅用以说明本发明而并非限制本发明所描述的技术方案;因此,尽管本说明书参照上述的各个实施例对本发明已进行了详细的说明,但是,本领域的普通技术人员应当理解,仍然可以对本发明进行修改或等同替换;而一切不脱离发明的精神和范围的技术方案及其改进,其均应涵盖在本发明的权利要求范围当中。Finally, it should be noted that: the above embodiments are only used to illustrate the present invention rather than limit the technical solutions described in the present invention; Those of ordinary skill in the art should understand that the present invention can still be modified or equivalently replaced; and all technical solutions and improvements that do not depart from the spirit and scope of the invention should be covered by the claims of the present invention.

Claims (5)

1.一种用神经网络模拟注意转移的方法,其特征在于包括以下步骤:1. A method for simulating attention transfer with neural network, is characterized in that comprising the following steps: 步骤1、视觉图像输入层将灰度图像的灰度值输入到神经动力网络中,图像中的像素点与神经动力网络上的神经元振子存在一一对应关系,每个像素对应的神经动力学系统由FHN模型描述;Step 1. The visual image input layer inputs the grayscale value of the grayscale image into the neural dynamic network. There is a one-to-one correspondence between the pixels in the image and the neuron oscillators on the neural dynamic network, and the neural dynamics corresponding to each pixel The system is described by the FHN model; 步骤2、神经元振子网络振荡层,将神经网络中的每个振子根据FHN模型建立起来的动力学系统模型耦合形成神经动力网络,其中的第i行、第j列个神经元振子的膜电位V和电压阈值R对时间的变化率分别为:Step 2, neuron oscillator network oscillation layer, coupling each oscillator in the neural network with the dynamic system model established according to the FHN model to form a neural dynamic network, in which the membrane potential of the i-th row and j-th column neuron oscillator The rate of change of V and the voltage threshold R with respect to time are:
Figure FSA00000165340100011
Figure FSA00000165340100011
其中,V是神经元细胞膜两侧的电位差,称为膜电位,R是代表电位阈值的内部状态变量,以下的公式中V和R代表相同的含义,Ii,j表示神经元接受到的来自外界的光刺激,数值上等于灰度图像中的灰度值;Among them, V is the potential difference on both sides of the neuron cell membrane, called the membrane potential, and R is the internal state variable representing the potential threshold. In the following formulas, V and R represent the same meaning, and I i, j represent the neuron received The light stimulus from the outside is numerically equal to the gray value in the gray image; 步骤3、注意转移实现层根据以下公式(2)和(3)的定义在振荡中调整α和β,从而实现当前注意的物体对应神经元群发放频率升高,并对其他物体对应的神经元群产生抑制性信号,使其发放频率放缓,以此区别当前关注物体与其他物体;Step 3. The attention transfer implementation layer adjusts α and β in the oscillation according to the definition of the following formulas (2) and (3), so as to increase the firing frequency of the neuron group corresponding to the object currently paying attention, and to increase the firing frequency of the neuron group corresponding to other objects. The group produces inhibitory signals to slow down its emission frequency, so as to distinguish the current attention object from other objects;
Figure FSA00000165340100012
Figure FSA00000165340100012
Figure FSA00000165340100013
Figure FSA00000165340100013
其中,(p,q)表示图像中p行q列的像素,τ是至少有一个神经元在发放的时刻,τ-1是τ的前一个时刻,M(τ)是时刻τ在发放状态的神经元的数量,Δ(τ)是时刻τ在发放状态的神经元的集合,Ii,j表示第i行、第j列个神经元接受到的来自外界的光刺激,数值上等于灰度图像中的灰度值;Among them, (p, q) represents the pixels in the p row and q column in the image, τ is the moment when at least one neuron is firing, τ-1 is the previous moment of τ, and M(τ) is the time when τ is firing. The number of neurons, Δ(τ) is the set of neurons in firing state at time τ, I i, j represents the light stimulation from the outside world received by neurons in row i and column j, numerically equal to grayscale the grayscale value in the image; 所述的FHN模型为FitzHugh-Nagumo模型的简称。The FHN model is an abbreviation of the FitzHugh-Nagumo model.
2.根据权利要求1所述的用神经网络模拟注意转移的方法,其特征在于: 所述的视觉图像输入层,在将灰度值输入到神经动力网络之前,进行了归一化,归一化之后的灰度值在[0,1]范围内。2. the method for simulating attention shifting with neural network according to claim 1, is characterized in that: described visual image input layer, before grayscale value is input to neural power network, has carried out normalization, normalization The gray value after conversion is in the range of [0, 1]. 3.根据权利要求1所述的用神经网络模拟注意转移的方法,其特征在于:只有当平衡点(V,R)是不稳定的时候才会有稳定的极限环产生,α和β的取值必须满足公式(4)3. according to claim 1, use neural network simulation to pay attention to the method of shifting, it is characterized in that: only when equilibrium point (V, R) is unstable time just can have stable limit cycle to produce, the getting of α and β The value must satisfy formula (4) 8(V2+αβ-1)<0(4)8(V 2 +αβ-1)<0(4) 或者公式(5)or formula (5) 8(V2+αβ-1)>08(V 2 +αβ-1)>0 -(10V2-9.2)>0°(5)。-(10V 2 -9.2) > 0° (5). 4.根据权利要求1所述的用神经网络模拟注意转移的方法,其特征在于:所述的神经元振子网络振荡层,在所述的公式(1)中(i,j)表示图像中的第i行、第j列,1≤i≤M,1≤j≤N;4. the method for imitating attention with neural network according to claim 1 is characterized in that: described neuron oscillator network oscillation layer, in described formula (1), (i, j) represent in the image row i, column j, 1≤i≤M, 1≤j≤N; 其中,M和N是图像的宽和高;ΔVi,j和ΔRi,j代表周围神经元的影响,它们由以下式子定义:Among them, M and N are the width and height of the image; ΔV i,j and ΔR i,j represent the influence of surrounding neurons, which are defined by the following formula: Δxi,j=γi-1,j-1;i,j(xi-1,j-1-xi,j)+γi-1,j;i,j(xi-1,j-xi,j)+Δxi , j = γ i-1, j-1; i, j (xi -1, j-1 -xi , j )+γ i-1, j; i, j (xi -1, j -x i, j )+ γi-1,j+1;i,j(xi-1,j+1-xi,j)+γi,j-1;i,j(xi,j-1-xi,j)+γ i-1, j+1; i, j (xi -1, j+1 -xi , j )+γ i, j-1; i, j (xi , j-1 -xi , j )+ γi,j+1;i,j(xi,j+1-xi,j)+γi+1,j-1;i,j(xi+1,j-1-xi,j)+γ i, j+1; i, j (xi , j+1 -xi , j )+γ i+1, j-1; i, j (xi +1, j-1 -xi , j )+                                                                         (6)... γi+1,j;i,j(xi+1,j-xi,j)+γi+1,j+1;i,j(xi+1,j+1-xi,j)γ i+1, j; i, j (x i+1, j - x i, j )+γ i+1, j+1; i, j (x i+1, j+1 - x i, j ) 其中 
Figure FSA00000165340100021
其中x表示V或者R。
in
Figure FSA00000165340100021
Where x represents V or R.
5.根据权利要求1所述的用神经网络模拟注意转移的方法,其特征在于:所述的注意转移实现层,在以下公式(8)和(9)中θ1>θ2>05. the method for simulating attention transfer with neural network according to claim 1, is characterized in that: described attention transfer realizes layer, in following formula (8) and (9) θ 1 > θ 2 > 0
Figure FSA00000165340100023
Figure FSA00000165340100023
f1(x)=a1x+b1(a1>0,b1<0)(10)f 1 (x)=a 1 x+b 1 (a 1 >0, b 1 <0)(10) f2(x)=a2x+b2(a2<0,b2>0)(11) f 2 (x)=a 2 x+b 2 (a 2 <0, b 2 >0)(11) 实施过程参数为a1=2,b1=-4,a2=-4,b2=2,θ1=0.1,θ2=0.01,θα=0.8,θβ=4。 The implementation process parameters are a 1 =2, b 1 =-4, a 2 =-4, b 2 =2, θ 1 =0.1, θ 2 =0.01, θ α =0.8, θ β =4.
CN201010199227.5A 2010-06-04 2010-06-04 Method for simulating attention mobility by using neural network Active CN101894295B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010199227.5A CN101894295B (en) 2010-06-04 2010-06-04 Method for simulating attention mobility by using neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010199227.5A CN101894295B (en) 2010-06-04 2010-06-04 Method for simulating attention mobility by using neural network

Publications (2)

Publication Number Publication Date
CN101894295A true CN101894295A (en) 2010-11-24
CN101894295B CN101894295B (en) 2014-07-23

Family

ID=43103482

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010199227.5A Active CN101894295B (en) 2010-06-04 2010-06-04 Method for simulating attention mobility by using neural network

Country Status (1)

Country Link
CN (1) CN101894295B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107179705A (en) * 2017-05-25 2017-09-19 江西理工大学 A kind of annular coupled oscillator system and method for realizing that explosion type is synchronous
CN107909151A (en) * 2017-07-02 2018-04-13 小蚁科技(香港)有限公司 Method and system for implementing an attention mechanism in an artificial neural network

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6504871B1 (en) * 1997-07-31 2003-01-07 Lsi Logic Corporation IDCT processor for use in decoding MPEG compliant video bitstreams meeting 2-frame and letterboxing requirements
CN101447077A (en) * 2008-12-18 2009-06-03 浙江大学 Edge detection method of color textile texture image oriented to textile industry
CN101587590A (en) * 2009-06-17 2009-11-25 复旦大学 Selective visual attention computation model based on pulse cosine transform

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6504871B1 (en) * 1997-07-31 2003-01-07 Lsi Logic Corporation IDCT processor for use in decoding MPEG compliant video bitstreams meeting 2-frame and letterboxing requirements
CN101447077A (en) * 2008-12-18 2009-06-03 浙江大学 Edge detection method of color textile texture image oriented to textile industry
CN101587590A (en) * 2009-06-17 2009-11-25 复旦大学 Selective visual attention computation model based on pulse cosine transform

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107179705A (en) * 2017-05-25 2017-09-19 江西理工大学 A kind of annular coupled oscillator system and method for realizing that explosion type is synchronous
CN107179705B (en) * 2017-05-25 2019-12-31 江西理工大学 A Ring-Coupled Oscillator System and Method for Realizing Explosive Synchronization
CN107909151A (en) * 2017-07-02 2018-04-13 小蚁科技(香港)有限公司 Method and system for implementing an attention mechanism in an artificial neural network
CN107909151B (en) * 2017-07-02 2020-06-02 小蚁科技(香港)有限公司 Method and system for implementing an attention mechanism in an artificial neural network

Also Published As

Publication number Publication date
CN101894295B (en) 2014-07-23

Similar Documents

Publication Publication Date Title
US20200342301A1 (en) Convolutional neural network on-chip learning system based on non-volatile memory
CN104809501B (en) A kind of computer system based on class brain coprocessor
WO2022078334A1 (en) Processing method for processing signals using neuron model and network, medium and device
US9014416B1 (en) Sensory processing apparatus and methods
CN109284812B (en) Video game simulation method based on improved DQN
WO2015020802A2 (en) Computed synapses for neuromorphic systems
Tanaka et al. Reservoir computing with diverse timescales for prediction of multiscale dynamics
CN109165730A (en) State quantifies network implementation approach in crossed array neuromorphic hardware
CN101894295B (en) Method for simulating attention mobility by using neural network
Câteau et al. Interplay between a phase response curve and spike-timing-dependent plasticity leading to wireless clustering
CN116080688B (en) A brain-inspired intelligent driving visual aid method, device and storage medium
CN103679670A (en) A PCNN multisource image fusion method based on an improved model
Wang et al. A reconfigurable graphene-based spiking neural network architecture
CN106681149A (en) Bumblebee robot abdomen swinging control method based on virtual reality and reinforcement learning
CN101505423B (en) Non-uniform sampling method for video image
CN101017535A (en) Implementation method of retina encoder using space time filter
CN116369945A (en) EEG cognitive recognition method based on 4D spiking neural network
Pashaie et al. Self-organization in a parametrically coupled logistic map network: A model for information processing in the visual cortex
CN101814180A (en) Simple gray-scale image segmentation method based on locally coupled neural oscillator network
Shao et al. Artificial sensory neurons and their applications
Oubari Precise timing and computationally efficient learning in neuromorphic systems
CN101751671A (en) Visual attention computing method and system based on bit entropy rate
CN114638286A (en) A visual channel coding method based on spiking neural network
Chen et al. Relativity modeling of work motivation and human error probability based on neural network
CN106444431A (en) Neural oscillation signal simulator

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230918

Address after: Room 603, 6th Floor, Building 6, Yard A1, Guanghua Road, Tongzhou District, Beijing, 101113

Patentee after: Beijing Hongzhou Culture Co.,Ltd.

Address before: 100124 No. 100 Chaoyang District Ping Tian Park, Beijing

Patentee before: Beijing University of Technology

TR01 Transfer of patent right