WO2023082554A1

WO2023082554A1 - Adaptive network switching method and system, and storage medium

Info

Publication number: WO2023082554A1
Application number: PCT/CN2022/089236
Authority: WO
Inventors: 路永玲; 王真; 胡成博; 朱雪琼; 杨景刚; 贾骏; 孙蓉; 刘子全; 薛海; 张东磊
Original assignee: 国网江苏省电力有限公司电力科学研究院; 国网江苏省电力有限公司; 江苏省电力试验研究院有限公司; 国网智能电网研究院有限公司
Priority date: 2021-11-10
Filing date: 2022-04-26
Publication date: 2023-05-19
Also published as: CN114125962B; KR20230070405A; CN114125962A

Abstract

Disclosed in the present invention are a self-adaptive network switching method and system, and a storage medium. According to the present invention, a device service type and a device network standard are determined on the basis of a RBF neural network so as to determine a deep Q network used for switching, and the environmental state of the current heterogeneous network or network access point is taken as an input into the deep Q network to obtain an optimal network and an optimal access point. Therefore, false switching caused by the fact that a switching index of a traditional switching algorithm is undiversified is avoided, the number of instances of device switching is reduced, and switching is more rational.

Description

An adaptive network switching method, system and storage medium

technical field

The invention relates to an adaptive network switching method, system and storage medium, and belongs to the technical field of wireless networks.

Background technique

With the deployment of multiple wireless networks in various application scenarios in the future, heterogeneous wireless networks with multi-network integration and seamless roaming will become an inevitable trend. As the core of heterogeneous wireless network mobility management technology, network handover technology is one of the key technologies to ensure session continuity when devices move across heterogeneous networks, and has important research significance.

At present, traditional handover methods include Received Signal Strength (RSS) algorithm and Multi-criteria Decision Making (MCDM) algorithm; among them, the RSS algorithm will cause frequent handover of equipment, affecting the quality of user service. In MCDM algorithm, various decision-making criteria depend on each other and interact to influence the relative weight of handover decision criteria, which eventually leads to unreasonable handover.

Contents of the invention

The invention provides an adaptive network switching method, system and storage medium, which solves the problems of frequent equipment switching and unreasonable switching criteria caused by the traditional method.

In order to solve the problems of the technologies described above, the technical solution adopted in the present invention is:

A method for adaptive network switching, comprising:

Input the status information of the equipment in the network environment into the pre-trained RBF neural network to obtain the equipment business type and equipment network standard;

If the device network standard supports multiple network standards, the network environment state parameters are used as input, and the pre-trained first deep Q network and second deep Q network are used to obtain the selected network and the access point of the selected network, according to the selection Connected network and select the access point connected to the network to switch the network;

If the network standard of the device is a single network standard and the business type of the device is non-fixed business, the network environment status parameters are used as input, and the pre-trained second depth Q network is used to obtain the network access point for the selected connection. Access point for network switching.

The status information of the device includes the change value of the received power of the device, the change value of the received delay of the device and the inherent parameters of the device.

If the device network standard supports multiple network standards, use the pre-trained first deep Q network and second deep Q network to obtain the selected network and the access point of the selected network, according to the selected network and the selected connection network Access point for network switching, including:

If the network standard of the device supports multiple network standards, use the pre-trained first depth Q network to obtain the network to be selected for connection, and use the pre-trained second depth Q network to obtain the access point to select the network to connect to. and select the access point connected to the network to switch the network.

The input of the first deep Q network is the network environment state parameters, including:

The bandwidth, delay, bit error rate and jitter of each wireless network in the environment;

The device's demand matrix for network bandwidth, delay, bit error rate, and jitter.

The network switching reward in the first deep Q network is:

Among them, r ₁ is the network switching reward in the first deep Q network,

Select weights for networks with bandwidth when switching between networks,

Select weights for networks with delays in network switching,

Select the weight for the network for the bit error rate when the network is switched,

The network selection weight for the jitter during network switching, f _1B (S ₁ ,n) is the network selection revenue function of bandwidth during network switching, f _1τ (S ₁ ,n) is the network selection revenue function of network switching delay, f _1e (S ₁ ,n) is the network selection benefit function of bit error rate during network switching, f _1J (S ₁ ,n) is the network selection benefit function of jitter during network switching, and S ₁ is the network environment input to the first deep Q network State parameter set.

Among them, B _n is the bandwidth provided by wireless network n, τ _n is the delay provided by wireless network n, e _n is the bit error rate provided by wireless network n, J _n is the jitter provided by wireless network n, qB is the bandwidth required by wireless network n, qτ is the delay required by wireless network n, qe is the bit error rate required by wireless network n, and qJ is the jitter required by wireless network n.

The input of the second deep Q network is the network environment state parameters, including:

Bandwidth, delay, bit error rate and jitter of each access point in the network;

The device receives the receiving power of each access point;

The network switching reward in the second deep Q network is:

Among them, r ₂ is the network switching reward in the second deep Q network,

Select weights for networks with bandwidth when selecting access points,

Select weights for networks with delays when selecting access points,

Select the weight for the network for the bit error rate when selecting an access point,

Select the weight for the jitter network when selecting the access point, f _2B (S ₂ ,m) is the network selection revenue function of the bandwidth when selecting the access point, and f _2τ (S ₂ ,m) is the network delay when selecting the access point Select the income function, f _2e (S ₂ ,m) is the network selection income function of bit error rate when selecting the access point, f _2J (S ₂ ,m) is the network selection income function of jitter when selecting the access point, S ₂ is the network environment state parameter set input to the second deep Q network, P _m is the received power of access point m, and P _th is the sensitivity of the received power

Among them, B′ _m is the bandwidth provided by the access point m, τ′ _m is the delay provided by the access point m, e′ _m is the bit error rate provided by the access point m, and J′ _m is the access point The jitter provided by point m, qB' is the network bandwidth required by access point m, qτ' is the network delay required by access point m, qe' is the network bit error rate required by access point m, and qJ' is the network bandwidth required by access point m. Network jitter required by entry point m.

An adaptive network switching system, comprising:

RBF neural network module: Input the status information of the equipment in the network environment into the pre-trained RBF neural network to obtain the equipment business type and equipment network standard;

The first switching module: if the network standard of the device supports multiple network standards, the pre-trained first deep Q network and the second deep Q network are used to obtain the selected network and the access point of the selected network, according to the selected network And select the access point connected to the network to switch the network;

The second switching module: if the device network standard is a single network standard and the device business type is non-fixed business, use the pre-trained second depth Q network to obtain the selected network access point, and select the connected network access point to perform network switching.

The first switching module: if the network standard of the device supports multi-network standard, the pre-trained first deep Q network is used to obtain the network to be selected for connection, and the pre-trained second deep Q network is used to obtain the access point of the selected network, Network switching is performed according to the network selected for connection and the access point selected for connection to the network.

A computer-readable storage medium storing one or more programs including instructions that, when executed by a computing device, cause the computing device to perform an adaptive network switching method.

The beneficial effects achieved by the present invention: the present invention judges the service type of the equipment and the network standard of the equipment based on the RBF neural network, thereby determining the depth Q network used for switching, and using the current heterogeneous network or the environment state of the network access point as the depth Q network Input, obtain the optimal network and access point, avoid the wrong handover caused by the single handover index of the traditional handover algorithm, reduce the number of equipment handovers, and handover is more reasonable.

Description of drawings

Fig. 1 is the flowchart of the inventive method;

Figure 2 is a preset rule table;

Figure 3 Schematic diagram of loss function construction;

Figure 4 shows a heterogeneous wireless network scenario.

Detailed ways

The present invention will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

As shown in Figure 1, a method for adaptive network handover includes the following steps:

Step 1. Input the status information of the equipment in the network environment into the pre-trained RBF neural network to obtain the equipment service type and equipment network standard;

Step 2, if the network standard of the device supports multiple network standards, the network environment state parameters are used as input, and the pre-trained first depth Q network and second depth Q network are used to obtain the network to be connected and the access point to be connected to the network , switch the network according to the selected network and the selected access point to connect to the network;

Step 3. If the device network standard is a single network standard and the device service type is non-fixed service, the network environment status parameters are used as input, and the pre-trained second depth Q network is used to obtain the network access point for the selected connection. According to the selected The connected network access point performs network switching.

The above method is based on the RBF neural network to judge the device service type and device network standard, thereby determining the deep Q network used for switching, and using the current heterogeneous network or the environment state of the network access point as the input of the deep Q network to obtain the optimal network and access point. The entry point avoids false switching caused by the single switching index of the traditional switching algorithm, reduces the number of device switching times, and makes the switching more reasonable.

Before implementing the above method, it is necessary to pre-train the RBF neural network, the first deep Q network and the second deep Q network.

Training RBF neural network: Initialize the RBF neural network with 3 input nodes (received power change value, delay change value and device inherent parameters), number of hidden nodes (the number of hidden nodes is determined by the error back propagation algorithm) and the number of output nodes 2 (output device service type and device network standard); the actual measured power in the wireless network, delay variation, device intrinsic parameters, device service type and device network standard are used as samples to train the RBF neural network.

When the RBF neural network is in use, directly input the status information of the device, that is, the change value of the device receiving power, the change value of the device receiving delay, and the inherent parameters of the device, into the RBF neural network to obtain the device service type and device network standard, using the formula can be expressed as:

RBF _in ={ΔP,Δτ,F}

RBF _out = {State1, State2}, State1 = 0, 1, State2 = 0, 1

Among them, RBF _in and RBF _out are the input and output of the RBF neural network, respectively, ΔP, Δτ, and F are the device receiving power change value, device receiving delay change value and device inherent parameters, State1 is the device service type, State1=1 Indicates that the service type of the device is a mobile service, State1=0 indicates that the service type of the device is a fixed service, State2 indicates the network standard of the device, State2=1 indicates that the network standard of the device supports multiple networks, and State1=0 indicates that the network standard of the device is a single network format.

For example, there are 5G, WiFi, and LoRa in the environment. If the device supports multiple network standards, the device can switch between multiple networks. If the device supports a single network standard, the device can only switch in a single network, that is, access Click Switch.

Therefore, after obtaining the device service type and device network standard, the preset rules in Figure 2 can be used to determine the network used for network switching. The details can be as follows:

1) If the device network standard supports multiple network standards, no matter whether the device business type is fixed business (such as temperature, humidity, pressure sensor, etc. transmission business) or mobile business (such as smart label, operator smart helmet, etc. transmission business), First use the first depth Q network to obtain the network selected for connection, that is, the network after switching, and then use the second depth Q network to obtain the access point of the selected connection network, that is, the access point of the network after switching;

2) If the device network standard is a single network standard and the device service type is a fixed service, then the device cannot perform network switching; if the device network standard is a single network standard and the device service type is a non-fixed service (ie mobile service) , directly adopting the second deep Q network to obtain the network access point selected for connection, that is, the switched network access point.

The above-mentioned first deep Q-network is only responsible for vertical switching (i.e. inter-network switching), and the second deep Q-network is only responsible for access point switching. The first deep Q-network and the second deep-Q network use the same deep Q-network, including real The Q-network and the target Q-network also have network environment state parameters as their inputs, but because the purposes of the two are different, the input parameters and network switching reward functions are different.

The actual Q network is interacted with the wireless network environment, that is, the network environment state parameters are input into the actual Q network, wherein, the network environment state parameters input into the first deep Q network can be:

A. The bandwidth, delay, bit error rate and jitter of each wireless network in the environment;

B. The device's demand matrix for network bandwidth, delay, bit error rate and jitter.

Assuming that there are N heterogeneous wireless networks in the environment, such as WiFi, LoRa, 5G, etc., the above parameters can be expressed as:

S ₁ ＝{B ₁ ,τ ₁ ,e ₁ ,J ₁ ,B ₂ ,τ ₂ ,e ₂ ,J ₂ ,...,B _N ,τ _N ,e _N ,J _N ,X}

Among them, S ₁ is the network environment state parameter set input to the first depth Q network, B _n is the bandwidth provided by wireless network n, τ _n is the time delay provided by wireless network n, e _n is the time delay provided by wireless network n Bit error rate, J _n is the jitter provided by wireless network n, n∈[1,N], X is the demand matrix of equipment for network bandwidth, delay, bit error rate and jitter.

The network environment state parameter input into the second depth Q network can be:

A. The bandwidth, delay, bit error rate and jitter of each access point in the network;

B. The device receives the receiving power of each access point;

C. The device's demand matrix for network bandwidth, delay, bit error rate and jitter.

Assuming that the number of access points in the network is M, the above parameters can be expressed as:

S ₂ ＝{P ₁ ,B′ ₁ ,τ′ ₁ ,e′ ₁ ,J′ ₁ ,P ₁ ,B′ ₂ ,τ′ 2 ,e′ ₂ ,J′ ₂ _, ...,P _M ,B ′ _M ,τ′ _M ,e′ _M ,J′ _M ,X}

Among them, s ₂ is the network environment state parameter set input into the second depth Q network, P _m is the received power of access point m, B′ _m is the bandwidth provided by access point m, τ′ _m is the access point m The provided delay, e' _m is the bit error rate provided by the access point m, and J' _m is the jitter provided by the access point m.

Input the network environment state parameters into the actual Q network to get Q to, use the ε-greedy method to select the action, where, for the first deep Q network, the action is to select the connected network, and for the second deep Q network, the action is to select The connected network access point can be expressed by the following formula:

Among them, a ₁ and a ₂ are network switching actions and network access point switching actions respectively, θ ₁ and θ ₁ are the parameters of the first deep Q network and the second deep Q network respectively, and α is a generated random number, ε is the probability of exploration.

By executing the action, the environment will return to the state S' ₁ or S' ₂ at the next moment, and the reward for the device performing the network switching action.

Compared with the first deep Q network, the network switching reward function can be expressed by the following formula:

Among them, r ₁ is the network switching reward in the first deep Q network,

Select weights for networks with bandwidth when switching between networks,

Select weights for networks with delays in network switching,

The network selection weight for the jitter during network switching, f _1B (S ₁ ,n) is the network selection revenue function of bandwidth during network switching, f _1τ (S ₁ ,n) is the network selection revenue function of network switching delay, f _1e (S ₁ ,n) is the network selection revenue function of bit error rate during network switching, f _1J (S ₁ ,n) is the network selection revenue function of jitter during network switching, qB is the bandwidth required by wireless network n, and qτ is the wireless The delay required by network n, qe is the bit error rate required by wireless network n, and qJ is the jitter required by wireless network n.

Compared to the second deep Q network, the network switching reward function can be expressed by the following formula:

Among them, r ₂ is the network switching reward in the second deep Q network,

Select weights for networks with bandwidth when selecting access points,

Select weights for networks with delays when selecting access points,

Select weights for the network for bit error rates when selecting access points,

It is the network selection weight of the jitter when selecting the access point, f _2B (S ₂ ,m) is the network selection benefit function of the bandwidth when selecting the access point, and f _2τ (S ₂ ,m) is the time delay when selecting the access point Network selection revenue function, f _2e (S ₂ ,m) is the network selection revenue function of bit error rate when selecting an access point, f _2J (S ₂ ,m) is the network selection revenue function of jitter when selecting an access point, P _th is the sensitivity of receiving power, qB' is the network bandwidth required by access point m, qτ' is the network delay required by access point m, qe' is the network bit error rate required by access point m, and qJ' is the network bandwidth required by access point m. Network jitter required by entry point m.

The above four-dimensional data (S ₁ , a ₁ , r ₁ , S′ ₁ ) of the first deep Q network or the four-dimensional data (S ₂ , a ₂ , r ₂ , S′ ₂ ) of the second deep Q network will be stored in the empirical In the pool, the deep Q network is trained through the data in the experience pool.

Input the network environment state parameters (S ₁ , S ₂ ) in the four-dimensional data into the actual Q network to obtain the real Q value, and input the state (S′ ₁ , S′ ₂ ) in the four-dimensional data into the target Q network, The target Q value of the next state is obtained, and the switching action that maximizes the target Q value is selected as the action of the next state:

a′∈{a′ ₁ ,a′ ₂ }

a∈{a ₁ ,a ₂ }

S′∈{S′ ₁ ,S′ ₂ }

Among them, a' indicates the switching action of the next state, θ ^- indicates the network parameters of the target Q network, the subscript 1 indicates that it corresponds to the first deep Q network, and 2 indicates that it corresponds to the second deep Q network.

After obtaining the next state action, the target Q value can be updated as:

Q'=r+γQ(S',a'; θ)

r∈{r ₁ ,r ₂ }

θ∈{θ ₁ ,θ ₂ }

Among them, γ represents the discount factor and r represents the reward.

The loss function can be calculated using the updated target Q value, as shown in Figure 3, the formula can be expressed as:

Loss(θ)=E[r+γQ(S′,a′;θ)-Q(S,a;θ)] ²

Among them, θ is the deep Q network parameter, and E is the expected operation.

The network parameters of the real Q network are updated through the backpropagation of the loss function. Specifically, the network parameters of the current real Q network are copied to the target Q network every certain number of steps.

Use the network parameters of multi-network standard equipment and single-network standard equipment in a certain environment to train the first deep Q network and the second deep Q network until the network results converge, and apply the trained network to vertical switching of heterogeneous wireless networks .

The above method can be implemented in the heterogeneous wireless network scenario shown in Figure 4. Assuming that the scenario is a smart factory, substation, underground pipe corridor or large sports field, etc., first construct and train the RBF neural network and deep Q network, and then the trained The network is built into devices with computing power. Every time period, the RBF neural network will judge whether the device currently supports multi-network standards and the type of business transmitted according to the real-time device status information, and select the corresponding deep Q network to judge the switching action through the method in Figure 2.

The above method is oriented to different devices and different services in the wireless sensor network, combines the RBF neural network and the deep Q network, and judges the device service type and device network standard based on the RBF neural network, so as to determine the deep Q network used for switching. A deep Q network is responsible for switching different network standards, and a second deep Q network is responsible for switching different access points under the same network. Both deep Q-networks are trained through the network state parameters and constructed reward functions, and the current heterogeneous network or the environmental state of the network access point is used as the input of the deep Q-network to judge the network switching and obtain the optimal network and access points. point, avoiding the wrong switching caused by the single switching index of the traditional switching algorithm, reducing the number of equipment switching, and switching more reasonable. The method can realize accurate and effective network switching of different networks, and improve the service quality of users.

Based on the same technical solution, the present invention also discloses a software system corresponding to the above method, that is, an adaptive network switching system, including:

The first switching module: if the network standard of the device supports multi-network standard, the pre-trained first deep Q network is used to obtain the network to be selected for connection, and the pre-trained second deep Q network is used to obtain the access point of the selected network, Perform network switching according to the selected network and the access point selected to connect to the network;

Based on the same technical solution, the present invention also discloses a computer-readable storage medium that stores one or more programs, and the one or more programs include instructions that, when executed by a computing device, cause the computing The device implements an adaptive network switching method.

Based on the same technical solution, the present invention also discloses a computing device, including one or more processors, one or more memories, and one or more programs, wherein one or more programs are stored in the one or more stored in the memory and configured to be executed by the one or more processors, the one or more programs include instructions for executing the adaptive network handover method.

Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.

The above is only an embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention are included in the pending application of the present invention. within the scope of the claims.

Claims

A method for adaptive network switching, characterized in that it comprises:

Input the status information of the equipment in the network environment into the pre-trained RBF neural network to obtain the equipment business type and equipment network standard;

If the device network standard supports multiple network standards, the network environment state parameters are used as input, and the pre-trained first deep Q network and second deep Q network are used to obtain the selected network and the access point of the selected network, according to the selection Connected network and select the access point connected to the network to switch the network;

If the network standard of the device is a single network standard and the business type of the device is non-fixed business, the network environment status parameters are used as input, and the pre-trained second depth Q network is used to obtain the network access point for the selected connection. Access point for network switching.
The adaptive network switching method according to claim 1, wherein the status information of the device includes a change value of device receiving power, a change value of device receiving delay and inherent parameters of the device.
A method for adaptive network switching according to claim 1, wherein, if the network standard of the device supports multiple network standards, the pre-trained first depth Q network and the second depth Q network are used to obtain a network for selective connection And select the access point to connect to the network, and perform network switching according to the selected network and the selected access point to connect to the network, including:

If the network standard of the device supports multiple network standards, use the pre-trained first depth Q network to obtain the network to be selected for connection, and use the pre-trained second depth Q network to obtain the access point to select the network to connect to. and select the access point connected to the network to switch the network.
A kind of adaptive network switching method according to claim 3, is characterized in that, the input of the first depth Q network is network environment state parameter, comprises:

The bandwidth, delay, bit error rate and jitter of each wireless network in the environment;

The device's demand matrix for network bandwidth, delay, bit error rate, and jitter.
A method for adaptive network switching according to claim 4, wherein the network switching reward in the first deep Q network is:

Among them, r 1 is the network switching reward in the first deep Q network,
Select weights for networks with bandwidth when switching between networks,
Select weights for networks with delays in network switching,
Select the weight for the network for the bit error rate when the network is switched,
The network selection weight for the jitter during network switching, f 1B (S 1 ,n) is the network selection revenue function of bandwidth during network switching, f 1τ (S 1 ,n) is the network selection revenue function of network switching delay, f 1e (S 1 ,n) is the network selection benefit function of bit error rate during network switching, f 1J (S 1 ,n) is the network selection benefit function of jitter during network switching, and S 1 is the network environment input to the first deep Q network State parameter set.

Among them, B n is the bandwidth provided by wireless network n, τ n is the delay provided by wireless network n, e n is the bit error rate provided by wireless network n, J n is the jitter provided by wireless network n, qB is the bandwidth required by wireless network n, qτ is the delay required by wireless network n, qe is the bit error rate required by wireless network n, and qJ is the jitter required by wireless network n.
A kind of adaptive network switching method according to claim 3, is characterized in that, the input of the second depth Q network is network environment state parameter, comprises:

Bandwidth, delay, bit error rate and jitter of each access point in the network;

The device receives the receiving power of each access point;

The device's demand matrix for network bandwidth, delay, bit error rate, and jitter.
A kind of adaptive network switching method according to claim 6, is characterized in that, the network switching award in the second depth Q network is:

Among them, r 2 is the network switching reward in the second deep Q network,
Select weights for networks with bandwidth when selecting access points,
Select weights for networks with delays when selecting access points,
Select the weight for the network for the bit error rate when selecting an access point,
Select the weight for the jitter network when selecting the access point, f 2B (S 2 ,m) is the network selection revenue function of the bandwidth when selecting the access point, and f 2τ (S 2 ,m) is the network delay when selecting the access point Select the income function, f 2e (S 2 ,m) is the network selection income function of bit error rate when selecting the access point, f 2J (S 2 ,m) is the network selection income function of jitter when selecting the access point, S 2 is the network environment state parameter set input to the second deep Q network, P m is the received power of access point m, and P th is the sensitivity of the received power

Among them, B′ m is the bandwidth provided by the access point m, τ′ m is the delay provided by the access point m, e′ m is the bit error rate provided by the access point m, and J′ m is the access point The jitter provided by point m, qB' is the network bandwidth required by access point m, qτ' is the network delay required by access point m, qe' is the network bit error rate required by access point m, and qJ' is the network bandwidth required by access point m. Network jitter required by entry point m.
An adaptive network switching system is characterized in that it includes:

RBF neural network module: Input the status information of the equipment in the network environment into the pre-trained RBF neural network to obtain the equipment business type and equipment network standard;

The first switching module: if the network standard of the device supports multiple network standards, the pre-trained first deep Q network and the second deep Q network are used to obtain the selected network and the access point of the selected network, according to the selected network And select the access point connected to the network to switch the network;

The second switching module: if the device network standard is a single network standard and the device business type is non-fixed business, use the pre-trained second depth Q network to obtain the selected network access point, and select the connected network access point to perform network switching.
An adaptive network switching system according to claim 8, characterized in that, the first switching module: if the device network standard supports multi-network standard, the pre-trained first depth Q network is used to obtain the network for selective connection, The pre-trained second deep Q network is used to obtain the access point of the selected network, and the network switching is performed according to the selected network and the selected access point of the network.
A computer-readable storage medium storing one or more programs, wherein the one or more programs comprise instructions which, when executed by a computing device, cause the computing device to perform the Any one of the methods described in 7.