CN113010986B - Antenna array design method based on reinforcement learning and random optimization algorithm - Google Patents

Antenna array design method based on reinforcement learning and random optimization algorithm Download PDF

Info

Publication number
CN113010986B
CN113010986B CN202110284035.2A CN202110284035A CN113010986B CN 113010986 B CN113010986 B CN 113010986B CN 202110284035 A CN202110284035 A CN 202110284035A CN 113010986 B CN113010986 B CN 113010986B
Authority
CN
China
Prior art keywords
random
antenna array
algorithm
reward
optimization algorithm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110284035.2A
Other languages
Chinese (zh)
Other versions
CN113010986A (en
Inventor
陈晓江
赵宇航
王夫蔚
王基
房鼎益
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northwest University
Original Assignee
Northwest University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northwest University filed Critical Northwest University
Priority to CN202110284035.2A priority Critical patent/CN113010986B/en
Publication of CN113010986A publication Critical patent/CN113010986A/en
Application granted granted Critical
Publication of CN113010986B publication Critical patent/CN113010986B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/10Geometric CAD
    • G06F30/18Network design, e.g. design based on topological or interconnect aspects of utility systems, piping, heating ventilation air conditioning [HVAC] or cabling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]

Abstract

The invention discloses an antenna array design method based on reinforcement learning and random optimization algorithm, which comprises the following steps: step 1, selecting a random optimization algorithm, and establishing an initial antenna array model to be optimized; step 2, designing a fitness function; step 3, compiling VB scripts in MATLAB; step 4, creating a plurality of HFSS threads to process VB scripts in parallel to obtain results corresponding to the random particles; step 5, generating a plurality of groups of xt by iteration; step 6, obtaining the optimal solution of all fitness functions by utilizing multiple groups of xt as a training set; step 7, designing a Q-Learning algorithm to train the training set to obtain a Q table; and 8, inputting the new cache reward and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed. The method provided by the invention does not need additional human intervention, and completely depends on the reinforcement learning intelligent agent to judge the antenna optimization result, so that the overall efficiency of the method is improved to more than 50% compared with the traditional antenna optimization design mode.

Description

Antenna array design method based on reinforcement learning and random optimization algorithm
Technical Field
The invention belongs to the field of antenna array optimization application, and relates to an antenna array design method based on reinforcement learning and random optimization algorithm.
Background
With the development of wireless communication technology, communication devices have gone into thousands of households, and become a part of people's daily life. The performance of the antenna, which is an important medium in the wireless communication system, directly affects the transmission quality of the whole wireless communication system. The single antenna can realize basic electromagnetic wave radiation and wireless transmission of signals, but for specific functions and great improvement of performance, the single antenna is somewhat more involved. The array antenna can perfectly make up for the defects of a single antenna, greatly improve the radiation gain of the antenna, and realize the functions of beam forming and the like which need to be realized in a specific scene.
The most obvious advantage of the antenna array is its high gain and directivity, and in a point-to-point communication system, the antenna is often required to have strong directivity, that is, the antenna is required to concentrate most of energy to a point and transmit the energy, so that both directivity and high gain are realized. For the adjustment of the directivity, the phase adjustment is reflected on the antenna, for the array, the phase adjustment involves more parameters, and when the array elements are more or the optimization target is more complex, the method of manual adjustment is too slow, so that some commonly used random optimization algorithms are used for optimizing partial parameters of the antenna, and the commonly used random optimization algorithms include a particle swarm algorithm, a simulated annealing algorithm, a differential evolution algorithm and the like.
The speed of algorithm optimization parameters is greatly improved compared with that of manpower, but no matter which random optimization algorithm is adopted, because the optimal fitness function is difficult to describe once for optimization by the optimal target of the antenna, a designer needs to observe the optimal result after the fitness function is optimized for many times to modify function parameters in the process of optimizing the antenna array, and the optimization time of modifying the fitness function for optimization needs several or dozens of hours each time. In particular, for multi-parameter, multi-objective optimization processes, manual operations may require many attempts to achieve a superior result due to the number of variables involved. Therefore, in this way, since human intervention is always required to continuously adjust the fitness function, the fitness function is relatively long and random in optimization time, and the optimization efficiency is low.
Disclosure of Invention
Aiming at the problem of low efficiency of array optimization design of the traditional optimization algorithm, the invention aims to provide an antenna array design method based on reinforcement learning and random optimization algorithm.
In order to solve the technical problems, the invention adopts the following technical scheme:
an antenna array design method based on reinforcement learning and random optimization algorithm specifically comprises the following steps:
step 1, selecting a random optimization algorithm, establishing an initial antenna array model to be optimized by using HFSS (high frequency signal-to-satellite) and deriving a corresponding radiation and scattering result x in a simulation manner;
step 2, designing a fitness function f (x) = a 1. F1+ a 2. F2+ \ 8230, + an. Fn, and determining the values of a1-an as [ a1 ] respectively according to the requirement of matching the magnitude of f1-fn 1 -a1 k ]、[a2 1 - a2 k ]…[an 1 -an k ];
Where x comprises the radiation pattern x of the array 1 And scattering pattern x 2 (ii) a f1-fn are respectively different evaluation functions of the evaluation x; a1-an are coefficients of different evaluation functions respectively, and have k n A fitness function;
step 3, compiling the selected random optimization algorithm in MATLAB, and compiling a plurality of VB scripts according to initial random particles of the random optimization algorithm; operating the step 4.2;
step 4, creating a plurality of HFSS threads to process VB scripts in parallel to obtain a group of radiation and scattering results xt corresponding to random particles, and specifically comprising the following substeps:
step 4.1, compiling a plurality of VB scripts according to random particles of a random optimization algorithm;
step 4.2, a plurality of HFSS threads are created by adopting a parallel computing function of MATLAB, VB scripts are processed by the HFSS threads, and radiation and scattering results xt led out by each HFSS thread are obtained, wherein xt corresponds to each particle in random particles, so that a group of xt corresponding to the current random particles one by one is obtained;
step 5, randomly selecting f (x) in the fitness function f (x), respectively substituting the xt obtained in the step 4 into the selected f (x) for calculation to obtain a new xt group, thereby obtaining a group of random particles corresponding to the xt group, taking the new random particles as random particles of a random optimization algorithm, returning to and iteratively executing the step 4 until the obtained current generation random particles meet a convergence condition, and finally obtaining a plurality of groups of xt generated in an iterative process;
and 6, substituting multiple groups of new xt into all fitness functions obtained in the step 2 as x respectively, taking the minimum value of the result of each fitness function as the optimal solution of the fitness function, and thus obtaining the optimal solutions of all the fitness functions with k in total n A, connect this k n Taking the optimal solution as a training set;
step 7, designing a Q-Learning algorithm to train a training set to obtain an intelligent agent optimizing strategy Q table;
step 8, a fitness function corresponding to the state s at the random position of the Q table is used in a random optimization algorithm, the random optimization algorithm is used for optimizing the array to be designed to obtain an optimized radiation and scattering result x ' derived from the HFSS, and the difference between the x ' and a target beam is used for obtaining a new cache reward tr ' at the state s; and inputting the new buffer reward tr' and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed.
Further, in the step 1, the random optimization algorithm is a particle swarm algorithm, a simulated annealing algorithm or a differential evolution algorithm.
Further, the step 7 specifically includes the following sub-steps:
step 7.1, determining a state s, an action a and an award r in a Q-Learning algorithm; wherein the state s is k obtained in step 6 n The optimal solution, action a is the transition of state s, and action a comprises 2 (k-1) n different actions;
generating a cache reward tr: each data in the state s set is differentiated from the target wave beam to obtain all buffer reward values tr, and normalization processing is carried out on all the buffer reward values tr;
setting a reward r: taking the minimum value of the normalized tr, setting all rewards r at the action a transferred to the minimum tr as 100, comparing the state correspondence tr before and after all actions a, if the transferred tr is smaller, setting the reward r as +1, otherwise, setting the reward r as-1; simultaneously setting the reward r of all the pointing actions a with the larger state difference with the target beam as-50;
and 7.2, starting from the initial position, gradually and iteratively searching for an optimal solution path by applying the state s, the action a and the reward r determined in the step 7.1 to obtain an agent optimization strategy Q table obtained by a Q-Learning algorithm.
Further, in step 7.1, the generating function of the target beam is as follows:
Figure RE-GDA0003063554790000041
in the formula, target is a design-required pointing angle, and any angle between 45 degrees and 135 degrees is substituted into a function; deg is degree, the value is 0-180, and the step length is 1, and all the functions are substituted to obtain the target wave beam.
Further, in step 8, the antenna array to be designed has a similar arrangement to the initial antenna array in step 1.
Further, the similar arrangement means similar arrangement with the initial antenna array, including: (1) The number of the transverse or longitudinal antennas is not changed, and only the other one-dimensional number is modified; (2) Different array elements are used under the condition that the arrangement state of the antenna array is unchanged.
Compared with the prior art, the method adopts the reinforcement learning method to replace the function of human decision in the antenna array optimization process, so that the parameters of the fitness function needing human intervention automatically generate relative optimal values through reinforcement learning, the whole antenna design process does not need additional human intervention, and the antenna optimization result is judged completely by means of an intelligent body of reinforcement learning. Through experiments, compared with the traditional antenna optimization design mode, the method provided by the invention has the advantage that the overall efficiency is improved by more than 50%.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Fig. 2 shows the original 2 x 4 antenna array model and parameters in an embodiment.
FIG. 3 is a schematic diagram of a2 × 6 array model to be optimized after optimization.
Fig. 4 is a schematic diagram illustrating the consistency between the optimized result of the array model to be optimized and the designed target beam in the embodiment.
FIG. 5 is a comparison of the time consumption of the conventional optimization method and the reinforcement learning method of the present invention.
The present invention will be explained in further detail with reference to examples.
Detailed Description
The invention relates to an antenna array design method based on reinforcement learning and random optimization algorithm, which comprises the following steps:
step 1, selecting a random optimization algorithm, establishing an initial antenna array model to be optimized by using HFSS (high frequency signal-to-satellite) and deriving a corresponding radiation and scattering result x in a simulation manner;
the random optimization algorithm can be any conventional random optimization algorithm, such as a particle swarm algorithm, a simulated annealing algorithm, a differential evolution algorithm and the like.
Step 2, designing a fitness function f (x) = a 1. F1+ a 2. F2+ \ 8230, + an. Fn, and determining the values of a1-an as [ a1 ] respectively according to the requirement of matching the magnitude of f1-fn 1 -a1 k ]、[a2 1 - a2 k ]…[an 1 -an k ];
Where x comprises the radiation pattern x of the array 1 And scattering pattern x 2 (ii) a f1-fn are different evaluation functions of the evaluation x, such as the maximum peak value of the main lobe, the mean value of the side lobes, the width of the main lobe and the like; a1-an are respectively coefficients of different evaluation functions, the coefficients are used for balancing magnitude and changing action of different evaluation functions in the whole fitness function, therefore, k is shared according to different value combinations of the coefficients of different evaluation functions n And the fitness function is used for enabling the random optimization algorithm to obtain different optimal results.
Step 3, writing the selected random optimization algorithm in MATLAB, and writing a plurality of VB scripts according to the initial random particles of the random optimization algorithm (the number of the VB scripts is the same as that of the initial random particles); the VB script is used for controlling the HFSS to conduct parameter adjustment and result derivation of the initial antenna array model. Operating the step 4.2;
and 4, creating a plurality of HFSS threads to process the VB script in parallel to obtain a group of radiation and scattering results xt corresponding to the random particles. The method comprises the following specific steps:
and 4.1, writing a plurality of VB scripts according to the random particles of the random optimization algorithm. The VB script is used for controlling the HFSS to carry out parameter adjustment and result derivation on the initial antenna array model (the number of the VB script is the same as that of the random particles);
and 4.2, creating a plurality of HFSS threads by adopting a parallel computing function of MATLAB (the number of the HFSS threads is determined by the processing capacity of a CPU (Central processing Unit) of the machine), processing VB scripts by the HFSS threads to obtain radiation and scattering results xt led out by each HFSS thread, wherein the xt corresponds to each particle in random particles, and thus a group of xt corresponding to the current random particles one by one is obtained.
The step calls a plurality of solving processes in parallel, so that the convergence rate of the whole optimization algorithm is improved, and each generation of algorithm can be solved more quickly.
Step 5, randomly selecting an f (x) from the fitness function f (x), respectively substituting the xt obtained in the step 4 into the selected f (x) for calculation, and conventionally screening and transforming the calculation result to obtain a new xt group so as to obtain a group of random particles corresponding to the xt group (the screening and transforming process is determined by the selected random optimization algorithm), taking the new random particles as the random particles of the random optimization algorithm, returning and iteratively executing the step 4 until the obtained current random particles meet the convergence condition, and finally obtaining a plurality of groups of xt generated in the iterative process;
and 6, substituting multiple groups of new xt into all fitness functions obtained in the step 2 as x respectively, taking the minimum value of the result of each fitness function as the optimal solution of the fitness function, and thus obtaining the total k of the optimal solutions of all the fitness functions n A, connect this k n The optimal solution is used as a training set for reinforcement learning training;
and 7, designing a Q-Learning algorithm to train the training set to obtain an intelligent agent optimizing strategy Q table. The method comprises the following substeps:
step 7.1, determining a state s, an action a and an award r in a Q-Learning algorithm; wherein the state s is k obtained in step 6 n The optimal solution, action a is the transition of state s, namely the change types of the coefficients of different fitness functions, and action a comprises 2 (k-1) n different actions;
function g (deg) to generate the target beam:
Figure RE-GDA0003063554790000071
in the formula, target is a designed required pointing angle, and any angle between 45 degrees and 135 degrees is substituted into a function; deg is degree, the value is 0-180, and the step length is 1, and all the steps are substituted into the function to obtain a target wave beam trans;
generating a cache reward tr: each data in the state s set is differentiated from the target wave beam to obtain all buffer reward values tr, and normalization processing is carried out on all the buffer reward values tr;
setting a reward r: taking the minimum value of the normalized tr, setting the reward r at the action a transferred to the minimum tr as 100, comparing the state corresponding to tr before and after all the actions a, if the transferred tr is smaller, setting the reward r as +1, otherwise, setting the reward r as-1; while the reward r at all pointing actions a that differ from the target beam by a large amount (tr is greater than 0.7 in the example) is set to-50 so that the algorithm avoids changing the fitness function to that state.
Step 7.2, starting from the initial position, the agent gradually and iteratively searches for an optimal solution path by applying the state s, the action a and the reward r determined in the step 7.1 to obtain an agent optimization strategy Q table obtained by a Q-Learning algorithm;
step 8, a fitness function corresponding to the state s at the random position of the Q table is used in a random optimization algorithm, the random optimization algorithm is used for optimizing the array to be designed to obtain an optimized radiation and scattering result x ' derived from the HFSS, and the difference between the x ' and a target beam is used for obtaining a new cache reward tr ' at the state s; and inputting the new buffer reward tr' and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed.
Having a similar arrangement means that the antenna array is similar in arrangement to the original antenna array, including:
(1) The case where the number of horizontal or vertical antennas is not changed but the number of the other one-dimension is modified. For example, for a2 × 4 original array, the method can be universally deployed and optimized on a2 × 6 universal array, a2 × 8 universal array and the like;
(2) And different array elements are used under the condition that the arrangement state of the antenna array is not changed. If the array element of the original array is a microstrip antenna, the antenna to be designed is a slot antenna.
Example 1:
step 1, according to the specific design requirements of a communication system (such as the limitation of factors such as the type, number and integral size of an antenna element), using HFSS to establish an initial antenna array model to be optimized.
Taking a2 × 4 microstrip patch antenna array as an example, the parameters of each array element include the lengths of the impedance line and the delay line. Spacing d of the antennas in this embodiment 1 -d 10 Length il of impedance line of eight antenna elements including 6 transverse and 4 longitudinal spacings of the elements 1 -il 8 And the length dl of the delay line 1 -dl 8 Are set as variables in the HFSS for subsequent optimization modification using MATLAB generation scripts.
Step 2, designing an optimization algorithm fitness function, wherein f (x) = a 1. F1+ a 2. F2+ \8230, + an. Fn and the variation range [ a1-an ] of a1-an 1 -a1 5 ]、[a2 1 -a2 5 ]…[an 1 -an 5 ]。
For a2 x 4 microstrip patch array antenna, the optimization objective is to increase the scattering intensity and shift the radiation pattern. The fitness function is designed as follows:
f(x)=-0.3*max(x 2 )+1*len(x 1 )+50*std(crest(x 1 ))-0.3*max(x 1 ) +3 × diff (x 1-target), where max () is the function taking the maximum value, len () is the function taking the width of the main lobe, std (crest ()) is the function taking the variance of the peak and valley of the side lobe, diff (x 1-target) isThe difference between the main lobe orientation and the design orientation.
And 3, selecting a random optimization algorithm as a learning basis, taking a differential evolution algorithm as an example, writing the differential evolution algorithm in MATLAB, setting the number of random particles to be 50, setting the iteration number to be 200, and setting the convergence condition to be that the optimal difference between two generations is less than 0.002.
And 4, creating a plurality of HFSS threads to process the VB script in parallel to obtain a group of radiation and scattering results xt corresponding to the random particles.
In steps 5 and 6, because the number of changes of f (x) is too large, a great amount of time is consumed if all generated fitness functions are traversed and solved. A modified solution process is therefore employed here to generate all the training sets. The method specifically comprises the steps of randomly selecting a fitness function, solving the fitness function, and storing all intermediate results xt. And then generating the optimal solution corresponding to all fitness functions. Finally, k is put n And taking the optimal solution as a training set for later reinforcement learning training.
And 7, designing a Q-Learning algorithm to train the training set to obtain an intelligent agent optimizing strategy Q table. The method specifically comprises the following steps: a state s, an action a, and a reward r are determined, wherein in the reward r setting, a function is used to generate the target beam, as shown by the dotted line in fig. 4, and the actual result is subtracted from the optimal value to obtain the cache reward tr.
And 8, applying the Q table to the simulation design antenna array, and enabling the intelligent agent to automatically optimize the antenna according to the Q table value and the really obtained reward. Particularly for the Q table obtained by training with the 2 × 4 array, the method can be universally deployed and optimized on the universal arrays such as 2 × 6, 2 × 8 and the like.
Fig. 4 shows simulation results (indicated by solid lines) obtained by optimizing the Q-table obtained by training the 2 × 4 array on the 2 × 6 microstrip array shown in fig. 3, wherein the dotted line is a target beam of the design, and it can be seen from fig. 4 that the solid line and the dotted line have a higher degree of coincidence, and thus substantially meet design requirements. Meanwhile, from the design time shown in fig. 5, the whole optimization speed is improved by 50% by optimizing the antenna array by using the reinforcement learning method of the invention. In addition, the overall optimization speed was improved by 70% when the Q-table obtained by training the 2 x 4 array was applied to the 2 x 8 array.

Claims (3)

1. An antenna array design method based on reinforcement learning and random optimization algorithm is characterized by comprising the following steps:
step 1, selecting a random optimization algorithm, establishing an initial antenna array model to be optimized by using HFSS (high frequency signal-to-satellite) and deriving a corresponding radiation and scattering result x in a simulation manner;
step 2, designing a fitness function f (x) = a 1. F1+ a 2. F2+ \ 8230, + an. Fn, and determining the values of a1-an as [ a1 ] respectively according to the requirement of matching the magnitude of f1-fn 1 -a1 k ]、[a2 1 -a2 k ]…[an 1 -an k ];
Where x comprises the radiation pattern x of the array 1 And scattering pattern x 2 (ii) a f1-fn are respectively different evaluation functions of the evaluation x; a1-an are coefficients of different evaluation functions respectively, and have k n A fitness function;
step 3, compiling the selected random optimization algorithm in MATLAB, and compiling a plurality of VB scripts according to initial random particles of the random optimization algorithm; operating the step 4.2;
step 4, creating a plurality of HFSS threads to process the VB script in parallel to obtain a group of radiation and scattering results xt corresponding to random particles, and specifically comprising the following sub-steps:
step 4.1, compiling a plurality of VB scripts according to random particles of a random optimization algorithm;
step 4.2, a plurality of HFSS threads are created by adopting a parallel computing function of MATLAB, VB scripts are processed by the HFSS threads, and radiation and scattering results xt led out by each HFSS thread are obtained, wherein xt corresponds to each particle in random particles, so that a group of xt corresponding to the current random particles one by one is obtained;
step 5, randomly selecting f (x) in the fitness function f (x), respectively substituting the xt obtained in the step 4 into the selected f (x) for calculation to obtain a new xt group, thereby obtaining a group of random particles corresponding to the xt group, taking the new random particles as random particles of a random optimization algorithm, returning to and iteratively executing the step 4 until the obtained current generation random particles meet a convergence condition, and finally obtaining a plurality of groups of xt generated in an iterative process;
and 6, substituting multiple groups of new xt into all fitness functions obtained in the step 2 as x respectively, taking the minimum value of the result of each fitness function as the optimal solution of the fitness function, and thus obtaining the optimal solutions of all the fitness functions with k in total n A, will k this n Using the optimal solution as a training set;
step 7, designing a Q-Learning algorithm to train the training set to obtain an intelligent agent optimizing strategy Q table;
the method specifically comprises the following substeps:
step 7.1, determining a state s, an action a and an award r in a Q-Learning algorithm; wherein the state s is k obtained in step 6 n The optimal solution is that action a is the transition of state s, and action a comprises 2 (k-1) n different actions;
generating a cache reward tr: each data in the state s set is differentiated from the target wave beam to obtain all buffer reward values tr, and normalization processing is carried out on all the buffer reward values tr;
setting a reward r: taking the minimum value of the normalized tr, setting all rewards r at the action a transferred to the minimum tr as 100, comparing the state correspondence tr before and after all actions a, if the transferred tr is smaller, setting the reward r as +1, otherwise, setting the reward r as-1; simultaneously setting the reward r of all the pointing actions a with the larger state difference with the target beam as-50;
the state with larger phase difference means that the generated cache reward tr is larger than 0.7;
the generation function of the target beam is as follows:
Figure FDA0003960913230000021
in the formula, target is a designed required pointing angle, and any angle between 45 degrees and 135 degrees is substituted into a function; deg is degree, the value is 0-180, and the step length is 1 to be substituted into the function to obtain the target wave beam;
step 7.2, starting from the initial position, the agent gradually and iteratively searches for an optimal solution path by applying the state s, the action a and the reward r determined in the step 7.1 to obtain an agent optimization strategy Q table obtained by a Q-Learning algorithm;
step 8, a fitness function corresponding to the state s at the random position of the Q table is used in a random optimization algorithm, the random optimization algorithm is used for optimizing the array to be designed to obtain an optimized radiation and scattering result x ' derived from the HFSS, and the difference between the x ' and a target beam is used for obtaining a new cache reward tr ' at the state s; and inputting the new buffer reward tr' and the Q table into a Q-Learning algorithm to obtain the optimal result of the antenna array to be designed.
2. The method for designing an antenna array based on reinforcement learning and random optimization algorithm according to claim 1, wherein in the step 1, the random optimization algorithm is a particle swarm algorithm, a simulated annealing algorithm or a differential evolution algorithm.
3. The method according to claim 1, wherein in step 8, the antenna array to be designed has a similar arrangement to the initial antenna array in step 1;
the similar arrangement means that the antenna array is similar to the initial antenna array in arrangement, and comprises the following steps: (1) The number of the transverse or longitudinal antennas is not changed, and only the other one-dimensional number is modified; (2) Different array elements are used under the condition that the arrangement state of the antenna array is not changed.
CN202110284035.2A 2021-03-17 2021-03-17 Antenna array design method based on reinforcement learning and random optimization algorithm Active CN113010986B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110284035.2A CN113010986B (en) 2021-03-17 2021-03-17 Antenna array design method based on reinforcement learning and random optimization algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110284035.2A CN113010986B (en) 2021-03-17 2021-03-17 Antenna array design method based on reinforcement learning and random optimization algorithm

Publications (2)

Publication Number Publication Date
CN113010986A CN113010986A (en) 2021-06-22
CN113010986B true CN113010986B (en) 2023-02-14

Family

ID=76408828

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110284035.2A Active CN113010986B (en) 2021-03-17 2021-03-17 Antenna array design method based on reinforcement learning and random optimization algorithm

Country Status (1)

Country Link
CN (1) CN113010986B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114266199B (en) * 2021-12-24 2023-06-16 北部湾大学 Solving method and system for internal variables in steady-state operation of magnetic saturation synchronous generator

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104992000A (en) * 2015-06-18 2015-10-21 哈尔滨工业大学 Method for beam forming and beam pattern optimization based on L-shaped array antenna
CN110147590A (en) * 2019-04-30 2019-08-20 中国地质大学(武汉) A kind of helical antenna design method based on Adaptive evolution optimization algorithm
CN111199126A (en) * 2019-11-08 2020-05-26 江苏科技大学 Rapid optimization design method of patch antenna
CN111783301A (en) * 2020-07-02 2020-10-16 西北大学 Design method of anti-interference scattering communication system antenna array

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8776002B2 (en) * 2011-09-06 2014-07-08 Variable Z0, Ltd. Variable Z0 antenna device design system and method
US8942659B2 (en) * 2011-09-08 2015-01-27 Drexel University Method for selecting state of a reconfigurable antenna in a communication system via machine learning
CN109635343A (en) * 2018-11-14 2019-04-16 江苏科技大学 A kind of antenna fast optimal design method
CN112100811A (en) * 2020-08-13 2020-12-18 西北工业大学 Antenna array directional diagram synthesis method based on adaptive wind-driven optimization algorithm

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104992000A (en) * 2015-06-18 2015-10-21 哈尔滨工业大学 Method for beam forming and beam pattern optimization based on L-shaped array antenna
CN110147590A (en) * 2019-04-30 2019-08-20 中国地质大学(武汉) A kind of helical antenna design method based on Adaptive evolution optimization algorithm
CN111199126A (en) * 2019-11-08 2020-05-26 江苏科技大学 Rapid optimization design method of patch antenna
CN111783301A (en) * 2020-07-02 2020-10-16 西北大学 Design method of anti-interference scattering communication system antenna array

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Paulo Victor Rodrigues Ferreira等.Multiobjective reinforcement learning for cognitive satellite communications using deep neural network ensembles.《IEEE Journal on Selected Areas in Communications》.2018,第36卷(第5期), *
基于两种智能算法的阵列天线优化;刘佳音;《中国优秀硕士学位论文全文数据库 信息科技辑》;20200215;第2020年卷(第2期);第I136-115页 *
基于改进反向传播神经网络代理模型的快速多目标天线设计;董健等;《电子与信息学报》;20181214;第2018年卷(第11期);第2712-2719页 *

Also Published As

Publication number Publication date
CN113010986A (en) 2021-06-22

Similar Documents

Publication Publication Date Title
CN113010986B (en) Antenna array design method based on reinforcement learning and random optimization algorithm
CN107798379B (en) Method for improving quantum particle swarm optimization algorithm and application based on improved algorithm
Zhang et al. Antenna design by an adaptive variable differential artificial bee colony algorithm
CN109063266A (en) A kind of skin antenna electromechanics collaborative design method based on parallel Bayes optimization
CN110147590B (en) Spiral antenna design method based on adaptive evolution optimization algorithm
Wei et al. Fully automated design method based on reinforcement learning and surrogate modeling for antenna array decoupling
CN106935973B (en) Design method of coaxial feed antenna
CN112364467B (en) Method for analyzing electromagnetic grid size by loosening far field of reflector antenna
CN115081325B (en) Lens antenna multi-objective optimization method based on particle swarm and genetic hybrid algorithm
WO2024082778A1 (en) Parameter-topology hybrid optimization method for electromagnetic device design
CN108446504B (en) Near-field array antenna directional pattern comprehensive method based on convex optimization
CN113962163A (en) Optimization method, device and equipment for realizing efficient design of passive microwave device
CN115329655A (en) Lens antenna multi-objective optimization method based on priori knowledge neural network
CN112149360A (en) ALO-LMBP neural network-based double-notch characteristic ultra-wideband antenna reverse modeling research method
CN109358503B (en) Energy-saving-oriented machine tool moving part multi-target structure optimization method
CN116720424A (en) Antenna design method based on multi-self-adaptive agent model assisted evolution algorithm
CN107995027B (en) Improved quantum particle swarm optimization algorithm and method applied to predicting network flow
Liu et al. Microwave integrated circuits design with relational induction neural network
CN111725625B (en) Dual-band Wi-Fi antenna designed based on multi-target evolution algorithm
Koziel et al. Fast and reliable knowledge-based design closure of antennas by means of iterative prediction-correction scheme
Gu et al. Intelligent design of arbitrary bandstop FSS through deep learning and genetic algorithm
Poveda-Pulla et al. SACPDE: Self-Adaptive Control Parameters in Differential Evolution Algorithm for Notch Filter Design in UWB Antenna Applications
Zhang et al. Nonuniform antenna array design by parallelizing three-parent crossover genetic algorithm
CN117688901A (en) Circuit design method, system, device and storage medium for high-frequency discrete filter
Koziel Accelerated optimization of input characteristics of wideband antennas using improved trust-region gradient search

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant