CN113010986B - Antenna array design method based on reinforcement learning and random optimization algorithm - Google Patents
Antenna array design method based on reinforcement learning and random optimization algorithm Download PDFInfo
- Publication number
- CN113010986B CN113010986B CN202110284035.2A CN202110284035A CN113010986B CN 113010986 B CN113010986 B CN 113010986B CN 202110284035 A CN202110284035 A CN 202110284035A CN 113010986 B CN113010986 B CN 113010986B
- Authority
- CN
- China
- Prior art keywords
- random
- antenna array
- algorithm
- reward
- optimization algorithm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/10—Geometric CAD
- G06F30/18—Network design, e.g. design based on topological or interconnect aspects of utility systems, piping, heating ventilation air conditioning [HVAC] or cabling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/006—Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
Abstract
The invention discloses an antenna array design method based on reinforcement learning and random optimization algorithm, which comprises the following steps: step 1, selecting a random optimization algorithm, and establishing an initial antenna array model to be optimized; step 2, designing a fitness function; step 3, compiling VB scripts in MATLAB; step 4, creating a plurality of HFSS threads to process VB scripts in parallel to obtain results corresponding to the random particles; step 5, generating a plurality of groups of xt by iteration; step 6, obtaining the optimal solution of all fitness functions by utilizing multiple groups of xt as a training set; step 7, designing a Q-Learning algorithm to train the training set to obtain a Q table; and 8, inputting the new cache reward and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed. The method provided by the invention does not need additional human intervention, and completely depends on the reinforcement learning intelligent agent to judge the antenna optimization result, so that the overall efficiency of the method is improved to more than 50% compared with the traditional antenna optimization design mode.
Description
Technical Field
The invention belongs to the field of antenna array optimization application, and relates to an antenna array design method based on reinforcement learning and random optimization algorithm.
Background
With the development of wireless communication technology, communication devices have gone into thousands of households, and become a part of people's daily life. The performance of the antenna, which is an important medium in the wireless communication system, directly affects the transmission quality of the whole wireless communication system. The single antenna can realize basic electromagnetic wave radiation and wireless transmission of signals, but for specific functions and great improvement of performance, the single antenna is somewhat more involved. The array antenna can perfectly make up for the defects of a single antenna, greatly improve the radiation gain of the antenna, and realize the functions of beam forming and the like which need to be realized in a specific scene.
The most obvious advantage of the antenna array is its high gain and directivity, and in a point-to-point communication system, the antenna is often required to have strong directivity, that is, the antenna is required to concentrate most of energy to a point and transmit the energy, so that both directivity and high gain are realized. For the adjustment of the directivity, the phase adjustment is reflected on the antenna, for the array, the phase adjustment involves more parameters, and when the array elements are more or the optimization target is more complex, the method of manual adjustment is too slow, so that some commonly used random optimization algorithms are used for optimizing partial parameters of the antenna, and the commonly used random optimization algorithms include a particle swarm algorithm, a simulated annealing algorithm, a differential evolution algorithm and the like.
The speed of algorithm optimization parameters is greatly improved compared with that of manpower, but no matter which random optimization algorithm is adopted, because the optimal fitness function is difficult to describe once for optimization by the optimal target of the antenna, a designer needs to observe the optimal result after the fitness function is optimized for many times to modify function parameters in the process of optimizing the antenna array, and the optimization time of modifying the fitness function for optimization needs several or dozens of hours each time. In particular, for multi-parameter, multi-objective optimization processes, manual operations may require many attempts to achieve a superior result due to the number of variables involved. Therefore, in this way, since human intervention is always required to continuously adjust the fitness function, the fitness function is relatively long and random in optimization time, and the optimization efficiency is low.
Disclosure of Invention
Aiming at the problem of low efficiency of array optimization design of the traditional optimization algorithm, the invention aims to provide an antenna array design method based on reinforcement learning and random optimization algorithm.
In order to solve the technical problems, the invention adopts the following technical scheme:
an antenna array design method based on reinforcement learning and random optimization algorithm specifically comprises the following steps:
step 1, selecting a random optimization algorithm, establishing an initial antenna array model to be optimized by using HFSS (high frequency signal-to-satellite) and deriving a corresponding radiation and scattering result x in a simulation manner;
step 2, designing a fitness function f (x) = a 1. F1+ a 2. F2+ \ 8230, + an. Fn, and determining the values of a1-an as [ a1 ] respectively according to the requirement of matching the magnitude of f1-fn 1 -a1 k ]、[a2 1 - a2 k ]…[an 1 -an k ];
Where x comprises the radiation pattern x of the array 1 And scattering pattern x 2 (ii) a f1-fn are respectively different evaluation functions of the evaluation x; a1-an are coefficients of different evaluation functions respectively, and have k n A fitness function;
step 3, compiling the selected random optimization algorithm in MATLAB, and compiling a plurality of VB scripts according to initial random particles of the random optimization algorithm; operating the step 4.2;
step 4, creating a plurality of HFSS threads to process VB scripts in parallel to obtain a group of radiation and scattering results xt corresponding to random particles, and specifically comprising the following substeps:
step 4.1, compiling a plurality of VB scripts according to random particles of a random optimization algorithm;
step 4.2, a plurality of HFSS threads are created by adopting a parallel computing function of MATLAB, VB scripts are processed by the HFSS threads, and radiation and scattering results xt led out by each HFSS thread are obtained, wherein xt corresponds to each particle in random particles, so that a group of xt corresponding to the current random particles one by one is obtained;
step 5, randomly selecting f (x) in the fitness function f (x), respectively substituting the xt obtained in the step 4 into the selected f (x) for calculation to obtain a new xt group, thereby obtaining a group of random particles corresponding to the xt group, taking the new random particles as random particles of a random optimization algorithm, returning to and iteratively executing the step 4 until the obtained current generation random particles meet a convergence condition, and finally obtaining a plurality of groups of xt generated in an iterative process;
and 6, substituting multiple groups of new xt into all fitness functions obtained in the step 2 as x respectively, taking the minimum value of the result of each fitness function as the optimal solution of the fitness function, and thus obtaining the optimal solutions of all the fitness functions with k in total n A, connect this k n Taking the optimal solution as a training set;
step 7, designing a Q-Learning algorithm to train a training set to obtain an intelligent agent optimizing strategy Q table;
step 8, a fitness function corresponding to the state s at the random position of the Q table is used in a random optimization algorithm, the random optimization algorithm is used for optimizing the array to be designed to obtain an optimized radiation and scattering result x ' derived from the HFSS, and the difference between the x ' and a target beam is used for obtaining a new cache reward tr ' at the state s; and inputting the new buffer reward tr' and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed.
Further, in the step 1, the random optimization algorithm is a particle swarm algorithm, a simulated annealing algorithm or a differential evolution algorithm.
Further, the step 7 specifically includes the following sub-steps:
step 7.1, determining a state s, an action a and an award r in a Q-Learning algorithm; wherein the state s is k obtained in step 6 n The optimal solution, action a is the transition of state s, and action a comprises 2 (k-1) n different actions;
generating a cache reward tr: each data in the state s set is differentiated from the target wave beam to obtain all buffer reward values tr, and normalization processing is carried out on all the buffer reward values tr;
setting a reward r: taking the minimum value of the normalized tr, setting all rewards r at the action a transferred to the minimum tr as 100, comparing the state correspondence tr before and after all actions a, if the transferred tr is smaller, setting the reward r as +1, otherwise, setting the reward r as-1; simultaneously setting the reward r of all the pointing actions a with the larger state difference with the target beam as-50;
and 7.2, starting from the initial position, gradually and iteratively searching for an optimal solution path by applying the state s, the action a and the reward r determined in the step 7.1 to obtain an agent optimization strategy Q table obtained by a Q-Learning algorithm.
Further, in step 7.1, the generating function of the target beam is as follows:
in the formula, target is a design-required pointing angle, and any angle between 45 degrees and 135 degrees is substituted into a function; deg is degree, the value is 0-180, and the step length is 1, and all the functions are substituted to obtain the target wave beam.
Further, in step 8, the antenna array to be designed has a similar arrangement to the initial antenna array in step 1.
Further, the similar arrangement means similar arrangement with the initial antenna array, including: (1) The number of the transverse or longitudinal antennas is not changed, and only the other one-dimensional number is modified; (2) Different array elements are used under the condition that the arrangement state of the antenna array is unchanged.
Compared with the prior art, the method adopts the reinforcement learning method to replace the function of human decision in the antenna array optimization process, so that the parameters of the fitness function needing human intervention automatically generate relative optimal values through reinforcement learning, the whole antenna design process does not need additional human intervention, and the antenna optimization result is judged completely by means of an intelligent body of reinforcement learning. Through experiments, compared with the traditional antenna optimization design mode, the method provided by the invention has the advantage that the overall efficiency is improved by more than 50%.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Fig. 2 shows the original 2 x 4 antenna array model and parameters in an embodiment.
FIG. 3 is a schematic diagram of a2 × 6 array model to be optimized after optimization.
Fig. 4 is a schematic diagram illustrating the consistency between the optimized result of the array model to be optimized and the designed target beam in the embodiment.
FIG. 5 is a comparison of the time consumption of the conventional optimization method and the reinforcement learning method of the present invention.
The present invention will be explained in further detail with reference to examples.
Detailed Description
The invention relates to an antenna array design method based on reinforcement learning and random optimization algorithm, which comprises the following steps:
step 1, selecting a random optimization algorithm, establishing an initial antenna array model to be optimized by using HFSS (high frequency signal-to-satellite) and deriving a corresponding radiation and scattering result x in a simulation manner;
the random optimization algorithm can be any conventional random optimization algorithm, such as a particle swarm algorithm, a simulated annealing algorithm, a differential evolution algorithm and the like.
Step 2, designing a fitness function f (x) = a 1. F1+ a 2. F2+ \ 8230, + an. Fn, and determining the values of a1-an as [ a1 ] respectively according to the requirement of matching the magnitude of f1-fn 1 -a1 k ]、[a2 1 - a2 k ]…[an 1 -an k ];
Where x comprises the radiation pattern x of the array 1 And scattering pattern x 2 (ii) a f1-fn are different evaluation functions of the evaluation x, such as the maximum peak value of the main lobe, the mean value of the side lobes, the width of the main lobe and the like; a1-an are respectively coefficients of different evaluation functions, the coefficients are used for balancing magnitude and changing action of different evaluation functions in the whole fitness function, therefore, k is shared according to different value combinations of the coefficients of different evaluation functions n And the fitness function is used for enabling the random optimization algorithm to obtain different optimal results.
Step 3, writing the selected random optimization algorithm in MATLAB, and writing a plurality of VB scripts according to the initial random particles of the random optimization algorithm (the number of the VB scripts is the same as that of the initial random particles); the VB script is used for controlling the HFSS to conduct parameter adjustment and result derivation of the initial antenna array model. Operating the step 4.2;
and 4, creating a plurality of HFSS threads to process the VB script in parallel to obtain a group of radiation and scattering results xt corresponding to the random particles. The method comprises the following specific steps:
and 4.1, writing a plurality of VB scripts according to the random particles of the random optimization algorithm. The VB script is used for controlling the HFSS to carry out parameter adjustment and result derivation on the initial antenna array model (the number of the VB script is the same as that of the random particles);
and 4.2, creating a plurality of HFSS threads by adopting a parallel computing function of MATLAB (the number of the HFSS threads is determined by the processing capacity of a CPU (Central processing Unit) of the machine), processing VB scripts by the HFSS threads to obtain radiation and scattering results xt led out by each HFSS thread, wherein the xt corresponds to each particle in random particles, and thus a group of xt corresponding to the current random particles one by one is obtained.
The step calls a plurality of solving processes in parallel, so that the convergence rate of the whole optimization algorithm is improved, and each generation of algorithm can be solved more quickly.
Step 5, randomly selecting an f (x) from the fitness function f (x), respectively substituting the xt obtained in the step 4 into the selected f (x) for calculation, and conventionally screening and transforming the calculation result to obtain a new xt group so as to obtain a group of random particles corresponding to the xt group (the screening and transforming process is determined by the selected random optimization algorithm), taking the new random particles as the random particles of the random optimization algorithm, returning and iteratively executing the step 4 until the obtained current random particles meet the convergence condition, and finally obtaining a plurality of groups of xt generated in the iterative process;
and 6, substituting multiple groups of new xt into all fitness functions obtained in the step 2 as x respectively, taking the minimum value of the result of each fitness function as the optimal solution of the fitness function, and thus obtaining the total k of the optimal solutions of all the fitness functions n A, connect this k n The optimal solution is used as a training set for reinforcement learning training;
and 7, designing a Q-Learning algorithm to train the training set to obtain an intelligent agent optimizing strategy Q table. The method comprises the following substeps:
step 7.1, determining a state s, an action a and an award r in a Q-Learning algorithm; wherein the state s is k obtained in step 6 n The optimal solution, action a is the transition of state s, namely the change types of the coefficients of different fitness functions, and action a comprises 2 (k-1) n different actions;
function g (deg) to generate the target beam:
in the formula, target is a designed required pointing angle, and any angle between 45 degrees and 135 degrees is substituted into a function; deg is degree, the value is 0-180, and the step length is 1, and all the steps are substituted into the function to obtain a target wave beam trans;
generating a cache reward tr: each data in the state s set is differentiated from the target wave beam to obtain all buffer reward values tr, and normalization processing is carried out on all the buffer reward values tr;
setting a reward r: taking the minimum value of the normalized tr, setting the reward r at the action a transferred to the minimum tr as 100, comparing the state corresponding to tr before and after all the actions a, if the transferred tr is smaller, setting the reward r as +1, otherwise, setting the reward r as-1; while the reward r at all pointing actions a that differ from the target beam by a large amount (tr is greater than 0.7 in the example) is set to-50 so that the algorithm avoids changing the fitness function to that state.
Step 7.2, starting from the initial position, the agent gradually and iteratively searches for an optimal solution path by applying the state s, the action a and the reward r determined in the step 7.1 to obtain an agent optimization strategy Q table obtained by a Q-Learning algorithm;
step 8, a fitness function corresponding to the state s at the random position of the Q table is used in a random optimization algorithm, the random optimization algorithm is used for optimizing the array to be designed to obtain an optimized radiation and scattering result x ' derived from the HFSS, and the difference between the x ' and a target beam is used for obtaining a new cache reward tr ' at the state s; and inputting the new buffer reward tr' and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed.
Having a similar arrangement means that the antenna array is similar in arrangement to the original antenna array, including:
(1) The case where the number of horizontal or vertical antennas is not changed but the number of the other one-dimension is modified. For example, for a2 × 4 original array, the method can be universally deployed and optimized on a2 × 6 universal array, a2 × 8 universal array and the like;
(2) And different array elements are used under the condition that the arrangement state of the antenna array is not changed. If the array element of the original array is a microstrip antenna, the antenna to be designed is a slot antenna.
Example 1:
step 1, according to the specific design requirements of a communication system (such as the limitation of factors such as the type, number and integral size of an antenna element), using HFSS to establish an initial antenna array model to be optimized.
Taking a2 × 4 microstrip patch antenna array as an example, the parameters of each array element include the lengths of the impedance line and the delay line. Spacing d of the antennas in this embodiment 1 -d 10 Length il of impedance line of eight antenna elements including 6 transverse and 4 longitudinal spacings of the elements 1 -il 8 And the length dl of the delay line 1 -dl 8 Are set as variables in the HFSS for subsequent optimization modification using MATLAB generation scripts.
Step 2, designing an optimization algorithm fitness function, wherein f (x) = a 1. F1+ a 2. F2+ \8230, + an. Fn and the variation range [ a1-an ] of a1-an 1 -a1 5 ]、[a2 1 -a2 5 ]…[an 1 -an 5 ]。
For a2 x 4 microstrip patch array antenna, the optimization objective is to increase the scattering intensity and shift the radiation pattern. The fitness function is designed as follows:
f(x)=-0.3*max(x 2 )+1*len(x 1 )+50*std(crest(x 1 ))-0.3*max(x 1 ) +3 × diff (x 1-target), where max () is the function taking the maximum value, len () is the function taking the width of the main lobe, std (crest ()) is the function taking the variance of the peak and valley of the side lobe, diff (x 1-target) isThe difference between the main lobe orientation and the design orientation.
And 3, selecting a random optimization algorithm as a learning basis, taking a differential evolution algorithm as an example, writing the differential evolution algorithm in MATLAB, setting the number of random particles to be 50, setting the iteration number to be 200, and setting the convergence condition to be that the optimal difference between two generations is less than 0.002.
And 4, creating a plurality of HFSS threads to process the VB script in parallel to obtain a group of radiation and scattering results xt corresponding to the random particles.
In steps 5 and 6, because the number of changes of f (x) is too large, a great amount of time is consumed if all generated fitness functions are traversed and solved. A modified solution process is therefore employed here to generate all the training sets. The method specifically comprises the steps of randomly selecting a fitness function, solving the fitness function, and storing all intermediate results xt. And then generating the optimal solution corresponding to all fitness functions. Finally, k is put n And taking the optimal solution as a training set for later reinforcement learning training.
And 7, designing a Q-Learning algorithm to train the training set to obtain an intelligent agent optimizing strategy Q table. The method specifically comprises the following steps: a state s, an action a, and a reward r are determined, wherein in the reward r setting, a function is used to generate the target beam, as shown by the dotted line in fig. 4, and the actual result is subtracted from the optimal value to obtain the cache reward tr.
And 8, applying the Q table to the simulation design antenna array, and enabling the intelligent agent to automatically optimize the antenna according to the Q table value and the really obtained reward. Particularly for the Q table obtained by training with the 2 × 4 array, the method can be universally deployed and optimized on the universal arrays such as 2 × 6, 2 × 8 and the like.
Fig. 4 shows simulation results (indicated by solid lines) obtained by optimizing the Q-table obtained by training the 2 × 4 array on the 2 × 6 microstrip array shown in fig. 3, wherein the dotted line is a target beam of the design, and it can be seen from fig. 4 that the solid line and the dotted line have a higher degree of coincidence, and thus substantially meet design requirements. Meanwhile, from the design time shown in fig. 5, the whole optimization speed is improved by 50% by optimizing the antenna array by using the reinforcement learning method of the invention. In addition, the overall optimization speed was improved by 70% when the Q-table obtained by training the 2 x 4 array was applied to the 2 x 8 array.
Claims (3)
1. An antenna array design method based on reinforcement learning and random optimization algorithm is characterized by comprising the following steps:
step 1, selecting a random optimization algorithm, establishing an initial antenna array model to be optimized by using HFSS (high frequency signal-to-satellite) and deriving a corresponding radiation and scattering result x in a simulation manner;
step 2, designing a fitness function f (x) = a 1. F1+ a 2. F2+ \ 8230, + an. Fn, and determining the values of a1-an as [ a1 ] respectively according to the requirement of matching the magnitude of f1-fn 1 -a1 k ]、[a2 1 -a2 k ]…[an 1 -an k ];
Where x comprises the radiation pattern x of the array 1 And scattering pattern x 2 (ii) a f1-fn are respectively different evaluation functions of the evaluation x; a1-an are coefficients of different evaluation functions respectively, and have k n A fitness function;
step 3, compiling the selected random optimization algorithm in MATLAB, and compiling a plurality of VB scripts according to initial random particles of the random optimization algorithm; operating the step 4.2;
step 4, creating a plurality of HFSS threads to process the VB script in parallel to obtain a group of radiation and scattering results xt corresponding to random particles, and specifically comprising the following sub-steps:
step 4.1, compiling a plurality of VB scripts according to random particles of a random optimization algorithm;
step 4.2, a plurality of HFSS threads are created by adopting a parallel computing function of MATLAB, VB scripts are processed by the HFSS threads, and radiation and scattering results xt led out by each HFSS thread are obtained, wherein xt corresponds to each particle in random particles, so that a group of xt corresponding to the current random particles one by one is obtained;
step 5, randomly selecting f (x) in the fitness function f (x), respectively substituting the xt obtained in the step 4 into the selected f (x) for calculation to obtain a new xt group, thereby obtaining a group of random particles corresponding to the xt group, taking the new random particles as random particles of a random optimization algorithm, returning to and iteratively executing the step 4 until the obtained current generation random particles meet a convergence condition, and finally obtaining a plurality of groups of xt generated in an iterative process;
and 6, substituting multiple groups of new xt into all fitness functions obtained in the step 2 as x respectively, taking the minimum value of the result of each fitness function as the optimal solution of the fitness function, and thus obtaining the optimal solutions of all the fitness functions with k in total n A, will k this n Using the optimal solution as a training set;
step 7, designing a Q-Learning algorithm to train the training set to obtain an intelligent agent optimizing strategy Q table;
the method specifically comprises the following substeps:
step 7.1, determining a state s, an action a and an award r in a Q-Learning algorithm; wherein the state s is k obtained in step 6 n The optimal solution is that action a is the transition of state s, and action a comprises 2 (k-1) n different actions;
generating a cache reward tr: each data in the state s set is differentiated from the target wave beam to obtain all buffer reward values tr, and normalization processing is carried out on all the buffer reward values tr;
setting a reward r: taking the minimum value of the normalized tr, setting all rewards r at the action a transferred to the minimum tr as 100, comparing the state correspondence tr before and after all actions a, if the transferred tr is smaller, setting the reward r as +1, otherwise, setting the reward r as-1; simultaneously setting the reward r of all the pointing actions a with the larger state difference with the target beam as-50;
the state with larger phase difference means that the generated cache reward tr is larger than 0.7;
the generation function of the target beam is as follows:
in the formula, target is a designed required pointing angle, and any angle between 45 degrees and 135 degrees is substituted into a function; deg is degree, the value is 0-180, and the step length is 1 to be substituted into the function to obtain the target wave beam;
step 7.2, starting from the initial position, the agent gradually and iteratively searches for an optimal solution path by applying the state s, the action a and the reward r determined in the step 7.1 to obtain an agent optimization strategy Q table obtained by a Q-Learning algorithm;
step 8, a fitness function corresponding to the state s at the random position of the Q table is used in a random optimization algorithm, the random optimization algorithm is used for optimizing the array to be designed to obtain an optimized radiation and scattering result x ' derived from the HFSS, and the difference between the x ' and a target beam is used for obtaining a new cache reward tr ' at the state s; and inputting the new buffer reward tr' and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed.
2. The method for designing an antenna array based on reinforcement learning and random optimization algorithm according to claim 1, wherein in the step 1, the random optimization algorithm is a particle swarm algorithm, a simulated annealing algorithm or a differential evolution algorithm.
3. The method according to claim 1, wherein in step 8, the antenna array to be designed has a similar arrangement to the initial antenna array in step 1;
the similar arrangement means that the antenna array is similar to the initial antenna array in arrangement, and comprises the following steps: (1) The number of the transverse or longitudinal antennas is not changed, and only the other one-dimensional number is modified; (2) Different array elements are used under the condition that the arrangement state of the antenna array is not changed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110284035.2A CN113010986B (en) | 2021-03-17 | 2021-03-17 | Antenna array design method based on reinforcement learning and random optimization algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110284035.2A CN113010986B (en) | 2021-03-17 | 2021-03-17 | Antenna array design method based on reinforcement learning and random optimization algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113010986A CN113010986A (en) | 2021-06-22 |
CN113010986B true CN113010986B (en) | 2023-02-14 |
Family
ID=76408828
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110284035.2A Active CN113010986B (en) | 2021-03-17 | 2021-03-17 | Antenna array design method based on reinforcement learning and random optimization algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113010986B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114266199B (en) * | 2021-12-24 | 2023-06-16 | 北部湾大学 | Solving method and system for internal variables in steady-state operation of magnetic saturation synchronous generator |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104992000A (en) * | 2015-06-18 | 2015-10-21 | 哈尔滨工业大学 | Method for beam forming and beam pattern optimization based on L-shaped array antenna |
CN110147590A (en) * | 2019-04-30 | 2019-08-20 | 中国地质大学(武汉) | A kind of helical antenna design method based on Adaptive evolution optimization algorithm |
CN111199126A (en) * | 2019-11-08 | 2020-05-26 | 江苏科技大学 | Rapid optimization design method of patch antenna |
CN111783301A (en) * | 2020-07-02 | 2020-10-16 | 西北大学 | Design method of anti-interference scattering communication system antenna array |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8776002B2 (en) * | 2011-09-06 | 2014-07-08 | Variable Z0, Ltd. | Variable Z0 antenna device design system and method |
US8942659B2 (en) * | 2011-09-08 | 2015-01-27 | Drexel University | Method for selecting state of a reconfigurable antenna in a communication system via machine learning |
CN109635343A (en) * | 2018-11-14 | 2019-04-16 | 江苏科技大学 | A kind of antenna fast optimal design method |
CN112100811A (en) * | 2020-08-13 | 2020-12-18 | 西北工业大学 | Antenna array directional diagram synthesis method based on adaptive wind-driven optimization algorithm |
-
2021
- 2021-03-17 CN CN202110284035.2A patent/CN113010986B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104992000A (en) * | 2015-06-18 | 2015-10-21 | 哈尔滨工业大学 | Method for beam forming and beam pattern optimization based on L-shaped array antenna |
CN110147590A (en) * | 2019-04-30 | 2019-08-20 | 中国地质大学(武汉) | A kind of helical antenna design method based on Adaptive evolution optimization algorithm |
CN111199126A (en) * | 2019-11-08 | 2020-05-26 | 江苏科技大学 | Rapid optimization design method of patch antenna |
CN111783301A (en) * | 2020-07-02 | 2020-10-16 | 西北大学 | Design method of anti-interference scattering communication system antenna array |
Non-Patent Citations (3)
Title |
---|
Paulo Victor Rodrigues Ferreira等.Multiobjective reinforcement learning for cognitive satellite communications using deep neural network ensembles.《IEEE Journal on Selected Areas in Communications》.2018,第36卷(第5期), * |
基于两种智能算法的阵列天线优化;刘佳音;《中国优秀硕士学位论文全文数据库 信息科技辑》;20200215;第2020年卷(第2期);第I136-115页 * |
基于改进反向传播神经网络代理模型的快速多目标天线设计;董健等;《电子与信息学报》;20181214;第2018年卷(第11期);第2712-2719页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113010986A (en) | 2021-06-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113010986B (en) | Antenna array design method based on reinforcement learning and random optimization algorithm | |
CN107798379B (en) | Method for improving quantum particle swarm optimization algorithm and application based on improved algorithm | |
Zhang et al. | Antenna design by an adaptive variable differential artificial bee colony algorithm | |
CN109063266A (en) | A kind of skin antenna electromechanics collaborative design method based on parallel Bayes optimization | |
CN110147590B (en) | Spiral antenna design method based on adaptive evolution optimization algorithm | |
Wei et al. | Fully automated design method based on reinforcement learning and surrogate modeling for antenna array decoupling | |
CN106935973B (en) | Design method of coaxial feed antenna | |
CN112364467B (en) | Method for analyzing electromagnetic grid size by loosening far field of reflector antenna | |
CN115081325B (en) | Lens antenna multi-objective optimization method based on particle swarm and genetic hybrid algorithm | |
WO2024082778A1 (en) | Parameter-topology hybrid optimization method for electromagnetic device design | |
CN108446504B (en) | Near-field array antenna directional pattern comprehensive method based on convex optimization | |
CN113962163A (en) | Optimization method, device and equipment for realizing efficient design of passive microwave device | |
CN115329655A (en) | Lens antenna multi-objective optimization method based on priori knowledge neural network | |
CN112149360A (en) | ALO-LMBP neural network-based double-notch characteristic ultra-wideband antenna reverse modeling research method | |
CN109358503B (en) | Energy-saving-oriented machine tool moving part multi-target structure optimization method | |
CN116720424A (en) | Antenna design method based on multi-self-adaptive agent model assisted evolution algorithm | |
CN107995027B (en) | Improved quantum particle swarm optimization algorithm and method applied to predicting network flow | |
Liu et al. | Microwave integrated circuits design with relational induction neural network | |
CN111725625B (en) | Dual-band Wi-Fi antenna designed based on multi-target evolution algorithm | |
Koziel et al. | Fast and reliable knowledge-based design closure of antennas by means of iterative prediction-correction scheme | |
Gu et al. | Intelligent design of arbitrary bandstop FSS through deep learning and genetic algorithm | |
Poveda-Pulla et al. | SACPDE: Self-Adaptive Control Parameters in Differential Evolution Algorithm for Notch Filter Design in UWB Antenna Applications | |
Zhang et al. | Nonuniform antenna array design by parallelizing three-parent crossover genetic algorithm | |
CN117688901A (en) | Circuit design method, system, device and storage medium for high-frequency discrete filter | |
Koziel | Accelerated optimization of input characteristics of wideband antennas using improved trust-region gradient search |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |