CN114154383A - Multi-robot-source search scheme generation method and system based on cognitive search strategy - Google Patents

Multi-robot-source search scheme generation method and system based on cognitive search strategy Download PDF

Info

Publication number
CN114154383A
CN114154383A CN202111457545.1A CN202111457545A CN114154383A CN 114154383 A CN114154383 A CN 114154383A CN 202111457545 A CN202111457545 A CN 202111457545A CN 114154383 A CN114154383 A CN 114154383A
Authority
CN
China
Prior art keywords
source
robot
search
particle
sensor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111457545.1A
Other languages
Chinese (zh)
Inventor
陈彬
季雅泰
吕欣
赵勇
刘忠
王锐
王昊冉
何华
肖军浩
卢惠民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN202111457545.1A priority Critical patent/CN114154383A/en
Publication of CN114154383A publication Critical patent/CN114154383A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/25Design optimisation, verification or simulation using particle-based methods
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1679Programme controls characterised by the tasks executed

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Computer Hardware Design (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Manipulator (AREA)

Abstract

The invention discloses a multi-robot-source search scheme generation method and a multi-robot-source search scheme generation system based on a cognitive search strategy, wherein the method comprises the steps of establishing a gas leakage model and a detection model of a sensor; modeling a leakage source searching process based on a gas leakage model and a detection model of a sensor; representing iterative updating of source item parameter estimation in a leakage source searching process by using a particle filtering method, and establishing a source searching action scheme based on a cognitive search strategy; improving the obstacle avoidance function of the source searching action scheme aiming at the actual obstacle scene; aiming at the combination of the improved source searching action scheme and the multi-robot control technology, a multi-robot source searching action scheme based on a cognitive search strategy is generated. The invention can realize the gas leakage source search of multiple robots, takes the uncertainty in the source searching process into consideration, and controls multiple robots to cooperatively search the source in a scene with large spatial scale by utilizing the multi-robot cooperative algorithm, thereby effectively improving the source searching efficiency and the source searching accuracy.

Description

Multi-robot-source search scheme generation method and system based on cognitive search strategy
Technical Field
The invention relates to the field of robot control, in particular to a multi-robot-source search scheme generation method and system based on a cognitive search strategy, which are suitable for various gas source search scenes including the search of sources of hazardous chemical leakage accidents in chemical industrial parks, the search of gas leakage sources in residential buildings and the like.
Background
The occurrence of various dangerous gas leakage accidents can cause serious casualties and economic property loss. In order to maintain personnel safety and avoid property loss, the method for quickly searching the position of a leakage source and acquiring source item parameters becomes important work. Autonomous source search using a mobile device such as a robot equipped with various sensors is an effective method, and the robot moves to a leakage source step by step according to information acquired by the sensors until the leakage source is found. In the process, how to generate a specific action scheme to control the movement of the robot is an urgent target to be explored. For the problem of sensing and searching of gas leakage sources, there are mainly four methods that can support the generation of action schemes, namely gradient-based algorithms, bionics algorithms, probability and map-based algorithms, and information theory principle-based algorithms.
Pure gas concentration gradient-based algorithms are the most primitive implementation in this field. The method guides the robot to the position of the leakage source by using the concentration gradient, but because the distribution uncertainty of the gas in the diffusion process under the real environment is greatly influenced by factors such as turbulence and the like, the method is difficult to realize under the real scene. The bionics algorithm is realized by a method for researching animals to find odor sources and planning paths, and the method also needs stable concentration gradients and is mostly used for source searching experiments in small scenes. The probability and map-based algorithm models the position of the odor source into probability distribution, and through continuous observation, the probability distribution is changed into a Dirac function, so that the position of the source is determined. Such algorithms are relatively late and complex in mathematical calculations, but are widely studied for their effectiveness, such as searching for a source of gas diffusion indoors using particle filtering.
In order to solve the problem of source positioning in a turbulent environment, a cognitive search strategy based on the information theory principle is widely researched and applied. This strategy uses probabilistic estimation to locate scent sources while using information state based reward functions for sequential decisions in uncertain cases. In particular, the source search process may be described as a Partially Observable Markov Decision Process (POMDP) consisting of three elements: information state, action set, and reward function. Vergasola et al proposed the earliest cognitive search strategy, Infotaxis, that employs a grid-based approach to maintaining information state. Then, researchers have replaced the grid-based method in the Infotaxis algorithm with a sequential monte carlo framework based on particle filtering, so that the algorithm can estimate the source intensity and the source position. Different cognitive search strategies may employ different reward functions. The original Infotaxi algorithm designed a reward function containing two terms to achieve the trade-off between Exploration (Exploration) and development (Exploitation). Another cognitive search algorithm named Entrotaxi, proposed by Hakinson et al, designs a reward function based on the principle of maximum entropy sampling. This reward function measures the uncertainty of the future expected probe event rather than the uncertainty of the source information considered in the Infotaxis algorithm.
One feature of the gas leakage source search scenario is the large spatial scale. In this scenario, searching for diffuse sources using a single robot may take too much time, or may not be able to perform efficient iterative calculations due to being far from the source, resulting in no sources being found. By using a multi-robot system, multiple robots complete tasks through coordinated actions, many complex tasks that are difficult to complete by a single robot can be effectively completed. The problem of multi-robot cooperative control has attracted a wide range of attention. Soares et al propose a graph-based formation control algorithm that can organize robots into arbitrary, evolving shapes, and achieve tracing to gas diffusion sources. The odor source positioning algorithm based on the particle swarm optimization proposed by Jatmiko combines chemotaxis and wind tendency methods in the improved particle swarm optimization algorithm, can position a source in an environment with obstacles, and solves the problem of dynamic convective diffusion. In addition, Hajieghrary and Ani propose a multi-robot cooperation "information leasing" strategy to deal with large-scale robots. However, in consideration of uncertainty in the sourcing process, there is still a wide research space on how to control multiple robots to perform collaborative sourcing in a large-space-scale scene to improve the sourcing efficiency and the sourcing accuracy.
Disclosure of Invention
The technical problems to be solved by the invention are as follows: the invention can realize the gas leakage source search of multiple robots, and the multi-robot cooperative algorithm is utilized to control a plurality of robots to cooperatively search the source in a scene with large spatial scale by considering the uncertainty existing in the source searching process, so that the source searching efficiency and the source searching accuracy can be effectively improved, and the method and the system are suitable for searching scenes of various gas sources, including the source searching of dangerous chemical leakage accidents in chemical industrial parks, the gas leakage source searching in residential buildings and the like.
In order to solve the technical problems, the invention adopts the technical scheme that:
a multi-robot-source search scheme generation method based on a cognitive search strategy comprises the following steps:
1) establishing a gas leakage model and a detection model of a sensor;
2) modeling a leakage source searching process based on a gas leakage model and a detection model of a sensor;
3) representing iterative updating of source item parameter estimation in a leakage source searching process by using a particle filtering method, and establishing a source searching action scheme based on a cognitive search strategy;
4) improving the obstacle avoidance function of the source searching action scheme aiming at the actual obstacle scene;
5) aiming at the combination of the improved source searching action scheme and the multi-robot control technology, a multi-robot source searching action scheme based on a cognitive search strategy is generated.
Optionally, the functional expression of the gas leakage model established in step 1) is:
Figure BDA0003387066330000021
in the above formula, V is the average wind speed,
Figure BDA0003387066330000022
gradient in the y-axis direction, c (r | θ)0) Is the concentration of gas at position r ═ { x, y }, θ0={r0Q is a leakage source r0Position r of0={x0,y0The source parameter of Q is the diffusion intensity, D is the effective diffusion coefficient of the gas, Δ c (r | θ }0) For the variation of concentration, τ is the gas molecular lifetime and δ is the dirac function.
Optionally, the functional expression of the detection model of the sensor established in step 1) is:
Figure BDA0003387066330000031
in the above formula, P (d (r) | θ0) Is the probability that the sensor will contact d gas molecules per unit time at position R ═ x, y ═ d is the number of times the sensor will contact gas molecules per unit time at position R ═ x, y ═ R (R | θ!), R0) Is the average number of contacts of the sensor with gas molecules per unit time, and has:
R(r|θ0)=4πDac(r|θ0)
in the above formula, D is the effective diffusion coefficient of the gas, a is the radius of the spherical sensor, and c (r | theta [ ])0) Is the concentration of the gas at position r ═ { x, y }, and has:
Figure BDA0003387066330000032
in the above formula, c (r | θ)0) Is the concentration of gas at position r ═ { x, y }, θ0={r0Q is a leakage source r0Position r of0={x0,y0The source parameter of Q is diffusion intensity, V is average wind speed, and the functional expression of the intermediate variable lambda is:
Figure BDA0003387066330000033
optionally, step 2) comprises: the source term parameter to be estimated is θ0={r0Q, the estimation of the source term parameters is represented by a probability density function, which will represent the post probability density function P (theta) at the kth step of the information state obtained after any kth stepk|Dk) Determined by all the information that has been collected in the first k steps, where θkSource term parameters estimated for the k-th step, Dk={d1(r1),d2(r2),…,dk(rk) Denotes the kth step at position rk={xk,ykAll the information collected on, d1(r1)~dk(rk) Respectively reading the sensor obtained in the steps 1-k; and an initial post probability density function P (theta)0) The posterior probability density function P (theta) of any k step is preset by prior knowledgek|Dk) Updating by using a Bayesian formula according to the following formula:
Figure BDA0003387066330000034
in the above formula, P (θ)k|Dk-1) Posterior probability Density of step k-1, P (d)k(rk)|θk) As density weight, P (d)k(rk)|Dk-1) Is a normalization factor; considering the four-connectivity condition, the selectable action set U is U { ×), ↓, → }, four elements { ({), ↓), (±), → } in the selectable action set U respectively represent four directions of the action, the sourcing robot calculates an information gain i (U) that can be obtained at each candidate position in the selectable action set U by using a reward function in each step, and selects a position U with the largest information gain i (U) according to the following formula*As the next position, the model function expression of the leakage source search process is obtained as follows:
Figure BDA0003387066330000035
in the above formula, u*For the next position to be selected, U is the candidate position corresponding to the element in the selectable action set U, i (U) is the information gain that can be obtained at the candidate position, and argmax represents the position at which the information gain i (U) is selected to be the maximum.
Optionally, step 3) comprises: for any k-th step, the post probability density function P (theta)k|Dk) Sampling to generate N random samples with weights
Figure BDA0003387066330000041
Wherein
Figure BDA0003387066330000042
Represents the estimation of the source term parameters by the ith particle of the kth step,
Figure BDA0003387066330000043
are particles
Figure BDA0003387066330000044
Corresponding weight, N weights
Figure BDA0003387066330000045
With a total of 1, using N weighted random samples
Figure BDA0003387066330000046
The N particles as the particle filtering method approximately represent the post probability density function P (theta) of the k stepk|Dk) Comprises the following steps:
Figure BDA0003387066330000047
in the above formula, δ is a dirac function; thereby obtaining the post probability density function P (theta) of any k step in the leakage source searching processk|Dk) Is expressed as a particle update of the particle filter method, and at the time of the particle update of the cash particle filter method, the effective sampling scale N is calculatedeffTo define the degradation degree of particle filter if the effective sampling scale NeffAnd when the particle size is reduced to be smaller than a set threshold eta, a new particle set is obtained by sampling the original particle set in an integrated manner by adopting a residual error resampling method.
Optionally, the obtaining a new particle set from the original particle lumped sample by the method of residual resampling includes: calculating the cumulative probability of the k step according to the following formula:
Figure BDA0003387066330000048
in the above formula, the first and second carbon atoms are,
Figure BDA0003387066330000049
for the first j cumulative probabilities of step k,
Figure BDA00033870663300000410
the first j-1 cumulative probabilities for step k,
Figure BDA00033870663300000411
indicating the original set of particles
Figure BDA00033870663300000412
The jth weight in (1), N is the number of particles in the particle set; generating N at [0,1 ]]Set of random numbers evenly distributed within an interval
Figure BDA00033870663300000413
For random number sets
Figure BDA00033870663300000414
Each of which is random
Figure BDA00033870663300000415
Find the minimum j value so as to satisfy
Figure BDA00033870663300000416
Example of having new particles concentrated if a satisfied j value is found
Figure BDA00033870663300000417
And is
Figure BDA00033870663300000418
Thereby obtaining a resampled particle set
Figure BDA00033870663300000419
Then on the resampled particle set
Figure BDA00033870663300000420
And sampling a new particle set by using a Metropolis-Hastings algorithm, and taking a normal distribution probability function as the transition probability of sampling by using the Metropolis-Hastings algorithm.
Optionally, step 4) comprises:
4.1) initializing source item parameters and determining a parameter estimation range; initializing particle filter parameters and setting the value of step k to be 1;
4.2) obtaining the sensor degree d at the k-th step position r ═ { x, y }k(rk);
4.3) executing the k step updating particle filtering;
4.4) calculating the passable action set Uj of the k stepkOf each candidate positionInformation entropy, calculating action set U of k stepkThe information entropy of each candidate position in the k step is calculated according to the following formula to obtain the optimal passable action direction uj* kAnd optimal direction of action u* k
Figure BDA00033870663300000421
In the above formula, I (uj)k) Is the optional action set Uj of the k stepkInformation entropy of each candidate position in, I (u)k) Is the optional action set Uj of the k stepkThe information entropy of each candidate position;
4.5) judging uj of the k step* kAnd u* kIf both are equal, if uj of the k-th step* kAnd u* kIf both are equal, uj is selected* kAs next position and move, jump to step 4.7);
4.6) traversal from 1 to the recent memory step number m satisfies
Figure BDA0003387066330000051
&
Figure BDA0003387066330000052
The number of (1), where uj* k-mOptimal passable direction of motion, u, without k-m steps* k-mThe optimal action direction of the k-m steps; if the number count is in a preset threshold n, then uj is selected* kThe direction is moved to turn to the next intersection u* kDirection; otherwise, select uj* kAs next position and move, jump to step 4.7);
4.7) judging whether a preset stopping condition is met, if so, ending the source searching, and skipping to the step 5); otherwise, jumping to step 4.2) and continuing iteration.
Optionally, in step 5), when the multi-robot-source searching action scheme based on the cognitive search strategy is generated by combining the improved source searching action scheme with the multi-robot control technologyIn the multi-robot, each robot i can determine the position of the robot at each step
Figure BDA0003387066330000053
And the degree of the sensor
Figure BDA0003387066330000054
Transmitting to other robots nearby, receiving the sensor degrees collected by the robots nearby by each robot i, independently executing the same searching source and maintaining the post probability density function P of the k stepik|Dk) And the post probability density function P of the k step of each robotik|Dk) Are the same size.
In addition, the invention also provides a multi-robot-source search scheme generation system based on the cognitive search strategy, which comprises a microprocessor and a memory which are connected with each other, wherein the microprocessor is programmed or configured to execute the steps of the multi-robot-source search scheme generation method based on the cognitive search strategy.
In addition, the invention also provides a computer readable storage medium, wherein a computer program programmed or configured to execute the cognitive search strategy-based multi-robot-source search scheme generation method is stored in the computer readable storage medium.
Compared with the prior art, the invention has the following advantages: the method comprises the steps of establishing a gas leakage model and a detection model of a sensor; modeling a leakage source searching process based on a gas leakage model and a detection model of a sensor; representing iterative updating of source item parameter estimation in a leakage source searching process by using a particle filtering method, and establishing a source searching action scheme based on a cognitive search strategy; improving the obstacle avoidance function of the source searching action scheme aiming at the actual obstacle scene; aiming at the combination of the improved source searching action scheme and the multi-robot control technology, a multi-robot source searching action scheme based on a cognitive search strategy is generated. The invention can realize the search of the gas leakage source of a plurality of robots, takes the uncertainty in the source searching process into consideration, utilizes a multi-robot cooperative algorithm to control a plurality of robots to cooperatively search the source in a scene with large spatial scale, can effectively improve the source searching efficiency and the source searching accuracy, and is suitable for the search scenes of various gas sources, including the source searching of the hazardous chemical leakage accidents in chemical industrial parks, the search of the gas leakage source in residential buildings and the like.
Drawings
FIG. 1 is a schematic diagram of a basic flow of a method according to an embodiment of the present invention.
Fig. 2 is a schematic diagram of a detection model of a sensor in an embodiment of the invention.
Fig. 3 is a schematic diagram of an obstacle avoidance action scheme according to an embodiment of the present invention.
FIG. 4 is a schematic flow chart of step 4) in the embodiment of the present invention.
FIG. 5 is a schematic diagram of an ideal multi-robot cooperative control action scenario in the embodiment of the present invention.
FIG. 6 is a schematic diagram illustrating a multi-robot cooperative control action scenario under the condition of blocked information communication according to an embodiment of the present invention.
FIG. 7 is a schematic flow chart of step 5) in the embodiment of the present invention.
Detailed Description
The following describes in detail a method for generating an action plan for controlling a plurality of robots to search for a gas leakage source based on a cognitive search strategy, taking a large obstacle scene as an example.
As shown in fig. 1, the method for generating a multi-robot-source search scheme based on a cognitive search policy in this embodiment includes:
1) establishing a gas leakage model and a detection model of a sensor;
2) modeling a leakage source searching process based on a gas leakage model and a detection model of a sensor;
3) representing iterative updating of source item parameter estimation in a leakage source searching process by using a particle filtering method, and establishing a source searching action scheme based on a cognitive search strategy;
4) improving the obstacle avoidance function of the source searching action scheme aiming at the actual obstacle scene;
5) aiming at the combination of the improved source searching action scheme and the multi-robot control technology, a multi-robot source searching action scheme based on a cognitive search strategy is generated.
In the embodiment, step 1) establishes a gas leakage model based on an atmospheric transmission diffusion model, and establishes a detection model of a sensor to process observed data; step 2) modeling a leakage source searching process, converting the leakage source searching process into a partial observable Markov decision process, and determining an information state, an available action set and a reward function; step 3) representing iterative updating of source item estimation by using a particle filtering method, adding a resampling step to avoid the problem of particle degradation, sampling new particles by using a Metropolis-Hastings algorithm, and then redesigning a reward function according to the distribution and weight of the particles to control the action of the robot; and 4) taking a complex obstacle scene as a background, considering the influence of the road network constraint from the algorithm level, and introducing an intermittent search strategy to improve the robot action scheme based on the cognitive search strategy. And 5) aiming at the characteristic of large space scale of a complex obstacle scene, considering uncertainty existing in the source searching process, and controlling a plurality of robots to perform collaborative source searching in the scene with large space scale by using a multi-robot collaborative algorithm. Through the steps, the gas leakage source search of multiple robots can be realized, uncertainty in the source searching process is considered, the multiple robots are controlled to search the source in a large-space-scale scene in a collaborative mode through the multi-robot collaborative algorithm, the source searching efficiency and the source searching accuracy can be effectively improved, and the method is suitable for the requirements of searching multiple gas source scenes including chemical industry park hazardous chemical leakage accident source searching, gas leakage source search in residential buildings and the like.
In this embodiment, the functional expression of the gas leakage model established in step 1) is:
Figure BDA0003387066330000061
in the above formula, V is the average wind speed,
Figure BDA0003387066330000062
gradient in the y-axis direction, c (r | θ)0) Is the concentration of the gas at the position r ═ { x, y },θ0={r0Q is a leakage source r0Position r of0={x0,y0The source parameter of Q is the diffusion intensity, D is the effective diffusion coefficient of the gas, Δ c (r | θ }0) For the variation of concentration, τ is the gas molecular lifetime and δ is the dirac function. The gas leakage model is derived from a steady-state convection diffusion Equation (addition-diffusion Equation), wherein V is the average wind speed, the average wind direction is the same as the negative direction of the Y axis, a specific coordinate system is shown in figure 2, and the horizontal and vertical coordinates in the figure represent the size of a scene. In this embodiment, the functional expression of the detection model of the sensor established in step 1) is:
Figure BDA0003387066330000071
in the above formula, P (d (r) | θ0) Is the probability that the sensor will contact d gas molecules per unit time at position R ═ x, y ═ d is the number of times the sensor will contact gas molecules per unit time at position R ═ x, y ═ R (R | θ!), R0) Is the average number of contacts of the sensor with gas molecules per unit time, and has:
R(r|θ0)=4πDac(r|θ0), (3)
in the above formula, D is the effective diffusion coefficient of the gas, a is the radius of the spherical sensor, and c (r | theta [ ])0) Is the concentration of the gas at position r ═ { x, y }, and has:
Figure BDA0003387066330000072
in the above formula, c (r | θ)0) Is the gas concentration at position r ═ { x, y }, which is also a three-dimensional analytical solution of equation (1); theta0={r0Q is a leakage source r0Position r of0={x0,y0The source parameter of Q is diffusion intensity, V is average wind speed, and the functional expression of the intermediate variable lambda is:
Figure BDA0003387066330000073
in the actual source searching process, the accuracy problem of the existing gas sensor is considered, and the detected concentration and the actual concentration have large errors. Therefore, a detection model of the sensor needs to be established to process observed data, the contact process of the sensor and gas molecules can be compared with the electromagnetic phenomenon, a Schumohofsky formula is introduced, concentration data of any position are converted into the average contact times of the sensor and gas chemical warfare agent molecules in unit time, and finally an average contact time model of the sensor and the gas molecules in unit time in the formula (3) is obtained. Where the gas concentration is higher, the sensor can contact more gas molecules per unit time.
At the same time, turbulence effects disturb the concentration field, resulting in the sensor only obtaining sporadic, intermittent valid readings, the effect of which has to be taken into account. We introduce a poisson process in the detection model of the sensor to approximate the effect of turbulence effects on gas diffusion. The Poisson process is one of random processes and is defined by the occurrence time of events, if a random process N (t) is a one-dimensional Poisson process with homogeneous time, the Poisson process meets the following two conditions, namely, the number of events occurring in two mutually exclusive intervals is mutually independent random variables; second, the probability distribution of the number of events occurring within the interval [ t, t + τ ] is:
Figure BDA0003387066330000081
in the above equation, μ is a positive number and is generally referred to as an arrival rate.
In this embodiment, the average number of times the sensor contacts the gas molecule per unit time is used as the arrival rate, i.e., μ ═ R (R | θ)0) Then, the probability that the sensor contacts d molecules per unit time at the r ═ x, y position is shown as equation (2), and fig. 2 shows the detection values of the sensor in one case scenario before and after considering the turbulence effect, respectively. Figure 2 is a schematic diagram of a detection model of the sensor in the embodiment,fig. 2 (a) shows the detection value of the sensor before the turbulent effect is considered, and fig. 2 (b) shows the detection value of the sensor after the turbulent effect is considered.
In this embodiment, step 2) includes: the source term parameter to be estimated is θ0={r0Q, the estimation of the source term parameters is represented by a probability density function, which will represent the post probability density function P (theta) at the kth step of the information state obtained after any kth stepk|Dk) Determined by all the information that has been collected in the first k steps, where θkSource term parameters estimated for the k-th step, Dk={d1(r1),d2(r2),…,dk(rk) Denotes the kth step at position rk={xk,ykAll the information collected on, d1(r1)~dk(rk) Respectively reading the sensor obtained in the steps 1-k; and an initial post probability density function P (theta)0) The posterior probability density function P (theta) of any k step is preset by prior knowledgek|Dk) Updating by using a Bayesian formula according to the following formula:
Figure BDA0003387066330000082
in the above formula, P (θ)k|Dk-1) Posterior probability Density of step k-1, P (d)k(rk)|θk) As density weight, P (d)k(rk)|Dk-1) Is a normalization factor; considering the four-connectivity condition, the selectable action set U is U { ×), ↓, → }, four elements { ({), ↓), (±), → } in the selectable action set U respectively represent four directions of the action, the sourcing robot calculates an information gain i (U) that can be obtained at each candidate position in the selectable action set U by using a reward function in each step, and selects a position U with the largest information gain i (U) according to the following formula*As the next position, the model function expression of the leakage source search process is obtained as follows:
Figure BDA0003387066330000083
in the above formula, u*For the next position to be selected, U is the candidate position corresponding to the element in the selectable action set U, i (U) is the information gain that can be obtained at the candidate position, and argmax represents the position at which the information gain i (U) is selected to be the maximum. When the source searching starts, because the source item information is not collected, the initial probability density function P (theta) of the source item parameters can be obtained according to the prior knowledge0). It is assumed here that the parameter estimation ranges can be determined using a priori knowledge and that the initial probability density function is represented by a uniform distribution. And the cognitive search algorithm guides the searching robot to move by adopting a fixed step length in each step. In this embodiment, only the case of four connections is considered, and the selectable action set is U { ↓, ←, → }. The source searching robot adopts a reward function to calculate the information gain I (u) which can be obtained from each candidate position in the selectable action set in each step, and selects the position u with the maximum information gain*As the next position.
In this embodiment, step 3) includes: for any k-th step, the post probability density function P (theta)k|Dk) Sampling to generate N random samples with weights
Figure BDA0003387066330000084
Wherein
Figure BDA0003387066330000085
Represents the estimation of the source term parameters by the ith particle of the kth step,
Figure BDA0003387066330000086
are particles
Figure BDA0003387066330000091
Corresponding weight, N weights
Figure BDA0003387066330000092
With a total of 1, using N weighted random samples
Figure BDA0003387066330000093
The N particles as the particle filtering method approximately represent the post probability density function P (theta) of the k stepk|Dk) Comprises the following steps:
Figure BDA0003387066330000094
in the above formula, δ is a dirac function; thereby obtaining the post probability density function P (theta) of any k step in the leakage source searching processk|Dk) Is expressed as a particle update of the particle filter method, and at the time of the particle update of the cash particle filter method, the effective sampling scale N is calculatedeffTo define the degradation degree of particle filter if the effective sampling scale NeffAnd when the particle size is reduced to be smaller than a set threshold eta, a new particle set is obtained by sampling the original particle set in an integrated manner by adopting a residual error resampling method.
In this embodiment, obtaining a new particle set from an original particle set by using a residual resampling method includes: calculating the cumulative probability of the k step according to the following formula:
Figure BDA0003387066330000095
in the above formula, the first and second carbon atoms are,
Figure BDA0003387066330000096
for the first j cumulative probabilities of step k,
Figure BDA0003387066330000097
the first j-1 cumulative probabilities for step k,
Figure BDA0003387066330000098
indicating the original set of particles
Figure BDA0003387066330000099
The jth weight in (1), N is the number of particles in the particle set; generating N at [0,1 ]]Set of random numbers evenly distributed within an interval
Figure BDA00033870663300000910
For random number sets
Figure BDA00033870663300000911
Each of which is random
Figure BDA00033870663300000912
Find the minimum j value so as to satisfy
Figure BDA00033870663300000913
Example of having new particles concentrated if a satisfied j value is found
Figure BDA00033870663300000914
And is
Figure BDA00033870663300000915
Thereby obtaining a resampled particle set
Figure BDA00033870663300000916
Then on the resampled particle set
Figure BDA00033870663300000917
And sampling a new particle set by using a Metropolis-Hastings algorithm, and taking a normal distribution probability function as the transition probability of sampling by using the Metropolis-Hastings algorithm.
Step 3) comprises particle filtering and resampling.
When particle filtering is carried out, iterative updating of source item estimation is expressed by using a particle filtering method, and the posterior probability density function P (theta) of the k step is subjected tok|Dk) Sampling to generate N weighted random samples
Figure BDA00033870663300000918
Wherein
Figure BDA00033870663300000919
Represents the estimation of the source term parameters by the ith particle of the kth step,
Figure BDA00033870663300000920
the weight value corresponding to the particle is
Figure BDA00033870663300000921
Approximating P (theta) with the N particlesk|Dk) As shown in formula (9). In the resampling step, when the kth step of the sensor is at the position rkOn obtained sensor reading dkThe particle weight is updated as per the above equation. It can be seen that if a particle estimates the source parameter, a reading d can be generatedkThe larger the probability of (c), the larger the updated weight of the particle. After several iterations, some particles that may be close to the true source parameter may get heavier and heavier, which leads to particle degradation problems. Typically using an effective sampling dimension NeffTo define the degradation level of the particle filter:
Figure BDA00033870663300000922
in the above formula, N is the number of weighted random samples,
Figure BDA00033870663300000923
the weight value corresponding to the particle. When N is presenteffWhen the particle size is reduced to a set threshold eta, a residual error resampling method is adopted to solve the problem of particle degradation. From existing collections of particles
Figure BDA00033870663300000924
Intermediate sampling to obtain new particle set
Figure BDA0003387066330000101
The method mainly comprises the following steps: calculating cumulative probability Ck
Figure BDA0003387066330000102
Generating N at [0,1 ]]Random numbers uniformly distributed in intervals
Figure BDA0003387066330000103
For each
Figure BDA0003387066330000104
Find the minimum j such that
Figure BDA0003387066330000105
Then order
Figure BDA0003387066330000106
According to the method, the particles with larger weight values are copied for a plurality of times, and the particles with small weight values and small contribution to the calculation of the posterior probability function are eliminated. The resampling step, however, results in a loss of particle diversity, and after several iterations, all particles may become duplicates of some few particles. Furthermore, updating the particle filter does not reduce or eliminate the error of the initial particle to point estimation of the source term parameters. Therefore, there is a need to increase the diversity of particles to solve this problem, and after the resampling step, a new set of particles is sampled from the original set of particles using the Metropolis-Hastings algorithm. And adopting a normal distribution probability function as the transition probability.
Under the constraint of a road network, the k-th step of the source searching robot is selected as the set
Figure BDA0003387066330000107
The robot can only follow UjkIs selected as the next step
Figure BDA0003387066330000108
The position of (a). However, this may result in the homing device wandering in one area all the time and not being able to move in the direction of origin. At this time, combining the intermittent search principle, if the movement in the same direction cannot be performed for n times due to the road network constraint in the previous m steps, the purpose of moving to the source direction is achieved by adopting long-distance steering movement to leave the area and steering the direction which cannot be performed due to the road network constraint at the nearest intersection. The process is shown in fig. 3, where the orientation is calibrated, the solid black line represents the robot trajectory, the blue rectangle represents the obstacle, and the red dot represents the robot working at the position sensor. The blue circle in the left picture represents the robot's placeAt the position, the dotted line with the arrow points to the moving direction of the robot without the road network constraint. It can be seen that due to the blocking of the obstacle, the robot can not move forward for many times, and only can move left and right, so that the robot adopts long-distance steering movement to bypass the obstacle, and the sensor does not work in the period. As shown in fig. 4, step 4) in this embodiment includes:
4.1) initializing source item parameters and determining a parameter estimation range; initializing particle filter parameters and setting the value of step k to be 1;
4.2) obtaining the sensor degree d at the k-th step position r ═ { x, y }k(rk);
4.3) executing the k step updating particle filtering;
4.4) calculating the passable action set Uj of the k stepkCalculating the action set U of the k step according to the information entropy of each candidate positionkThe information entropy of each candidate position in the k step is calculated according to the following formula to obtain the optimal passable action direction uj* kAnd optimal direction of action u* k
Figure BDA0003387066330000109
In the above formula, I (uj)k) Is the optional action set Uj of the k stepkInformation entropy of each candidate position in, I (u)k) Is the optional action set Uj of the k stepkThe information entropy of each candidate position;
4.5) judging uj of the k step* kAnd u* kIf both are equal, if uj of the k-th step* kAnd u* kIf both are equal, uj is selected* kAs next position and move, jump to step 4.7);
4.6) traversal from 1 to the recent memory step number m satisfies
Figure BDA00033870663300001010
&
Figure BDA00033870663300001011
The number of (1), where uj* k-mOptimal passable direction of motion, u, without k-m steps* k-mThe optimal action direction of the k-m steps; if the number count is in a preset threshold n, then uj is selected* kThe direction is moved to turn to the next intersection u* kDirection; otherwise, select uj* kAs next position and move, jump to step 4.7);
4.7) judging whether a preset stopping condition is met, if so, ending the source searching, and skipping to the step 5); otherwise, jumping to step 4.2) and continuing iteration.
As shown in FIGS. 5 and 6, for the multi-robot based collaborative search algorithm, it is assumed that each robot i in the team (composed of N robots) individually executes the aforementioned Entrotaxi-Turn sourcing algorithm and maintains the respective source item estimates Pik|Dk) The estimate describes the location where the source may be present. When each robot carries out source searching, the robot can find the position of the robot
Figure BDA0003387066330000115
And the obtained sensor readings
Figure BDA0003387066330000114
Each robot may receive the information collected by the nearby robots and update its estimate of the source item. Each robot passing a respective probability estimation function Pik|Dk) To decide where to move next. Without communication delay or loss of information, the probabilistic estimate for each robot about the source location is the same, i.e., P1k|Dk)=P2k|Dk)=...=PNk|Dk). This means that, in the algorithm level, all robots share the same particle filter to estimate the source position, and the particle filter can estimate the positions where all robots will move. Where FIG. 5 is an ideal case, P1k|Dk)=P2k|Dk)=P3k|Dk) (ii) a FIG. 6 shows a situation where communication of information is blocked under non-ideal conditions, P1k|Dk)≠P2k|Dk)=P3k|Dk)。
In this embodiment, when the multi-robot source search action scheme based on the cognitive search policy is generated by combining the improved source search action scheme with the multi-robot control technology in step 5), each robot i in the multiple robots uses its own position at each step
Figure BDA0003387066330000116
And the degree of the sensor
Figure BDA0003387066330000117
Transmitting to other robots nearby, receiving the sensor degrees collected by the robots nearby by each robot i, independently executing the same searching source and maintaining the post probability density function P of the k stepik|Dk) And the post probability density function P of the k step of each robotik|Dk) Are the same size.
As shown in fig. 7, step 5) in this embodiment includes: 5.1) initializing source item parameters and determining a parameter estimation range; initializing particle filter parameters; initializing a value for representing step k to 1; 5.2) obtaining the sensor degree d at the k-th step position r ═ { x, y }k(rk) (ii) a 5.3) executing the k step updating particle filtering; 5.4) calculating the passable action set Uj of the k stepkCalculating the action set U of the k step according to the information entropy of each candidate positionkThe information entropy of each candidate position in the k step, and the optimal passable action direction uj of the k step is calculated according to the following formula* kAnd optimal direction of action u* k
Figure BDA0003387066330000111
In the above formula, the first and second carbon atoms are,I(ujk) Is the optional action set Uj of the k stepkInformation entropy of each candidate position in, I (u)k) Is the optional action set Uj of the k stepkThe information entropy of each candidate position; 5.5) if uj of step k* kAnd u* kIf both are equal, uj is selected* kAs next position and move, jump to step 5.7); 5.6) traversal from 1 to the recent memory step number m satisfies
Figure BDA0003387066330000112
&
Figure BDA0003387066330000113
The number of (1), where uj* k-mOptimal passable direction of motion, u, without k-m steps* k-mThe optimal action direction of the k-m steps; if the number count is in a preset threshold value n, selecting a direction away from other robots to move, and turning to the next intersection u* kDirection; otherwise, select uj* kAs next position and move, jump to step 5.7); 5.7) judging whether a preset stopping condition is met, if so, ending the source searching, and skipping to the step 5); otherwise, jump to step 5.2). In this embodiment, by increasing the number of robots, the time for finding the original position can be reduced, and the variance of the source position estimation can also be reduced, because the increased number of robots effectively improves the team detection rate. Therefore, a source searching robot team consisting of N robots can complete source searching work by using fewer particle filtering iteration times, so that the source searching time is shortened, and the source searching success rate is improved.
In summary, the method of the present embodiment includes constructing a gas leakage model and a sensor detection model according to an atmospheric transmission diffusion model; modeling a leakage source searching process; establishing a source searching action scheme based on a cognitive search strategy; improving the obstacle avoidance function of the action scheme aiming at the actual obstacle scene; aiming at a large-scale obstacle scene, an action scheme is combined with a multi-robot control technology to generate a multi-robot source search action scheme based on a cognitive search strategy. The method has the advantages of wide applicable scenes, high searching efficiency and success rate and strong robustness in a complex environment. The key method for realizing the embodiment is a multi-robot-source search action scheme generation method based on a cognitive search strategy. According to the method, an action scheme can be provided for the gas source search problem in a large obstacle scene. The method establishes a gas leakage model and establishes a detection model according to the accuracy problem of the robot sensor. The whole source searching process is modeled based on a cognitive searching strategy, an obstacle avoidance algorithm is designed, and meanwhile, a multi-robot control action scheme corresponding to a large-scale station under the scene is provided. Compared with the existing autonomous source searching action scheme, the method has the advantages that the adaptive scenes are wider, and the searching efficiency and the success rate in large obstacle scenes are obviously improved. In practical application, the method of the embodiment has stronger practicability.
In addition, the embodiment also provides a system for generating the multi-robot-source search scheme based on the cognitive search strategy, which comprises a microprocessor and a memory which are connected with each other, wherein the microprocessor is programmed or configured to execute the steps of the method for generating the multi-robot-source search scheme based on the cognitive search strategy.
In addition, the present embodiment also provides a computer-readable storage medium, in which a computer program is stored, which is programmed or configured to execute the aforementioned multi-robot-source search scheme generation method based on the cognitive search policy.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-readable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein. The present application is directed to methods, apparatus (systems), and computer program products according to embodiments of the application, wherein the instructions that execute via the flowcharts and/or processor of the computer program product create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above embodiments, and all technical solutions belonging to the idea of the present invention belong to the protection scope of the present invention. It should be noted that modifications and embellishments within the scope of the invention may occur to those skilled in the art without departing from the principle of the invention, and are considered to be within the scope of the invention.

Claims (10)

1. A multi-robot-source search scheme generation method based on a cognitive search strategy is characterized by comprising the following steps:
1) establishing a gas leakage model and a detection model of a sensor;
2) modeling a leakage source searching process based on a gas leakage model and a detection model of a sensor;
3) representing iterative updating of source item parameter estimation in a leakage source searching process by using a particle filtering method, and establishing a source searching action scheme based on a cognitive search strategy;
4) improving the obstacle avoidance function of the source searching action scheme aiming at the actual obstacle scene;
5) aiming at the combination of the improved source searching action scheme and the multi-robot control technology, a multi-robot source searching action scheme based on a cognitive search strategy is generated.
2. The method for generating the multi-robot-source search scheme based on the cognitive search strategy as claimed in claim 1, wherein the functional expression of the gas leakage model established in the step 1) is as follows:
Figure FDA0003387066320000011
in the above formula, V is the average wind speed,
Figure FDA0003387066320000012
gradient in the y-axis direction, c (r | θ)0) Is the concentration of gas at position r ═ { x, y }, θ0={r0Q is a leakage source r0Position r of0={x0,y0The source parameter of Q is the diffusion intensity, D is the effective diffusion coefficient of the gas, Δ c (r | θ }0) For the variation of concentration, τ is the gas molecular lifetime and δ is the dirac function.
3. The method for generating the multi-robot-source search scheme based on the cognitive search strategy as claimed in claim 2, wherein the functional expression of the detection model of the sensor established in the step 1) is as follows:
Figure FDA0003387066320000013
in the above formula, P (d (r) | θ0) Is the probability that the sensor will contact d gas molecules per unit time at position R ═ x, y ═ d is the number of times the sensor will contact gas molecules per unit time at position R ═ x, y ═ R (R | θ!), R0) Is the average number of contacts of the sensor with gas molecules per unit time, and has:
R(r|θ0)=4πDac(r|θ0)
in the above formula, D is the effective diffusion coefficient of the gas, a is the radius of the spherical sensor, and c (r | theta [ ])0) Is the concentration of the gas at position r ═ { x, y }, and has:
Figure FDA0003387066320000014
in the above formula, c (r | θ)0) Is the concentration of gas at position r ═ { x, y }, θ0={r0Q is a leakage source r0Position r of0={x0,y0The source parameter of Q is diffusion intensity, V is average wind speed, and the functional expression of the intermediate variable lambda is:
Figure FDA0003387066320000015
4. the multi-robot-source search scheme generation method based on the cognitive search strategy as claimed in claim 3, wherein the step 2) comprises: the source term parameter to be estimated is θ0={r0Q, the estimation of the source term parameters is represented by a probability density function, which will represent the post probability density function P (theta) at the kth step of the information state obtained after any kth stepk|Dk) Determined by all the information that has been collected in the first k steps, where θkSource term parameters estimated for the k-th step, Dk={d1(r1),d2(r2),…,dk(rk) Denotes the kth step at position rk={xk,ykAll the information collected on, d1(r1)~dk(rk) Respectively reading the sensor obtained in the steps 1-k; and an initial post probability density function P (theta)0) The posterior probability density function P (theta) of any k step is preset by prior knowledgek|Dk) Updating by using a Bayesian formula according to the following formula:
Figure FDA0003387066320000021
in the above formula, P (θ)k|Dk-1) Posterior probability Density of step k-1, P (d)k(rk)|θk) As density weight, P (d)k(rk)|Dk-1) Is a normalization factor; considering the four-connectivity condition, the selectable action set U is U { ×), ↓, → }, four elements { ({), ↓), (±), → } in the selectable action set U respectively represent four directions of the action, the sourcing robot calculates an information gain i (U) that can be obtained at each candidate position in the selectable action set U by using a reward function in each step, and selects a position U with the largest information gain i (U) according to the following formula*As the next position, the model function expression of the leakage source search process is obtained as follows:
Figure FDA0003387066320000022
in the above formula, u*For the next position to be selected, U is the candidate position corresponding to the element in the selectable action set U, i (U) is the information gain that can be obtained at the candidate position, and argmax represents the position at which the information gain i (U) is selected to be the maximum.
5. The multi-robot-source search scheme generation method based on the cognitive search strategy as claimed in claim 4, wherein the step 3) comprises: for any k-th step, the post probability density function P (theta)k|Dk) Sampling to generate N weighted random samples { xk i,wk i}i=1:NWherein x isk iRepresents the estimate of the source term parameter for the ith particle of the kth step, wk iIs a particle xk iCorresponding weight, N weights wk iWith a total of 1, using N weighted random samples { x }k i,wk i}i=1:NAs particle filtering methodsThe N particles approximate the post probability density function P (theta) of the k stepk|Dk) Comprises the following steps:
Figure FDA0003387066320000023
in the above formula, δ is a dirac function; thereby obtaining the post probability density function P (theta) of any k step in the leakage source searching processk|Dk) Is expressed as a particle update of the particle filter method, and at the time of the particle update of the cash particle filter method, the effective sampling scale N is calculatedeffTo define the degradation degree of particle filter if the effective sampling scale NeffAnd when the particle size is reduced to be smaller than a set threshold eta, a new particle set is obtained by sampling the original particle set in an integrated manner by adopting a residual error resampling method.
6. The method for generating the multi-robot-source search scheme based on the cognitive search strategy according to claim 5, wherein the obtaining of the new particle set from the original particle set by using the residual resampling method comprises: calculating the cumulative probability of the k step according to the following formula:
Figure FDA0003387066320000031
in the above formula, the first and second carbon atoms are,
Figure FDA0003387066320000032
is the first j cumulative probabilities, C, of step kk i-1The first j-1 cumulative probabilities for step k,
Figure FDA0003387066320000033
indicating the original set of particles
Figure FDA0003387066320000034
The jth weight in (1), N is the number of particles in the particle set; generating N at [0,1 ]]Uniformly distributed within the intervalSet of machine numbers [ mu ]k i}i=1:N(ii) a For a set of random numbers muk i}i=1:NEach random number mu ink iFind the minimum value of j such that
Figure FDA0003387066320000035
Example of having new particles concentrated if a satisfied j value is found
Figure FDA0003387066320000036
And wk i1/N, resulting in a resampled set of particles) { (θ)k i,wk i)}i=1:NThen for the resampled set of particles) { (θ)k i,wk i)}i=1:NAnd sampling a new particle set by using a Metropolis-Hastings algorithm, and taking a normal distribution probability function as the transition probability of sampling by using the Metropolis-Hastings algorithm.
7. The multi-robot-source search scheme generation method based on the cognitive search strategy as claimed in claim 6, wherein the step 4) comprises:
4.1) initializing source item parameters and determining a parameter estimation range; initializing particle filter parameters and setting the value of step k to be 1;
4.2) obtaining the sensor degree d at the k-th step position r ═ { x, y }k(rk);
4.3) executing the k step updating particle filtering;
4.4) calculating the passable action set Uj of the k stepkCalculating the action set U of the k step according to the information entropy of each candidate positionkThe information entropy of each candidate position in the k step is calculated according to the following formula to obtain the optimal passable action direction uj* kAnd optimal direction of action u* k
Figure FDA0003387066320000037
In the above formula, I (uj)k) Is the optional action set Uj of the k stepkInformation entropy of each candidate position in, I (u)k) Is the optional action set Uj of the k stepkThe information entropy of each candidate position;
4.5) judging uj of the k step* kAnd u* kIf both are equal, if uj of the k-th step* kAnd u* kIf both are equal, uj is selected* kAs next position and move, jump to step 4.7);
4.6) traversal from 1 to the recent memory step number m satisfies
Figure FDA0003387066320000038
The number of (1), where uj* k-mOptimal passable direction of motion, u, without k-m steps* k-mThe optimal action direction of the k-m steps; if the number count is in a preset threshold n, then uj is selected* kThe direction is moved to turn to the next intersection u* kDirection; otherwise, select uj* kAs next position and move, jump to step 4.7);
4.7) judging whether a preset stopping condition is met, if so, ending the source searching, and skipping to the step 5); otherwise, jumping to step 4.2) and continuing iteration.
8. The method as claimed in claim 7, wherein the step 5) combines the improved sourcing behavior scheme with the multi-robot control technology to generate the multi-robot sourcing search behavior scheme based on the cognitive search strategy, and each robot i in the multi-robot combines its own position r at each stepk iAnd the sensor degree dk iTransmitting to other robots nearby, receiving the sensor degrees collected by the robots nearby by each robot i, independently executing the same searching source and maintaining the post probability density function P of the k stepik|Dk) And each ofThe k-th step of the robot is followed by the probability density function Pik|Dk) Are the same size.
9. A multi-robot-source search scenario generation system based on cognitive search strategy, comprising a microprocessor and a memory connected to each other, wherein the microprocessor is programmed or configured to perform the steps of the multi-robot-source search scenario generation method based on cognitive search strategy according to any one of claims 1 to 8.
10. A computer-readable storage medium having stored thereon a computer program programmed or configured to perform the cognitive search policy-based multi-robot-source search scenario generation method of any of claims 1-8.
CN202111457545.1A 2021-12-01 2021-12-01 Multi-robot-source search scheme generation method and system based on cognitive search strategy Pending CN114154383A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111457545.1A CN114154383A (en) 2021-12-01 2021-12-01 Multi-robot-source search scheme generation method and system based on cognitive search strategy

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111457545.1A CN114154383A (en) 2021-12-01 2021-12-01 Multi-robot-source search scheme generation method and system based on cognitive search strategy

Publications (1)

Publication Number Publication Date
CN114154383A true CN114154383A (en) 2022-03-08

Family

ID=80455826

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111457545.1A Pending CN114154383A (en) 2021-12-01 2021-12-01 Multi-robot-source search scheme generation method and system based on cognitive search strategy

Country Status (1)

Country Link
CN (1) CN114154383A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114326755A (en) * 2022-03-15 2022-04-12 中国人民解放军国防科技大学 Robot gas source searching method based on local path planning
CN117148872A (en) * 2023-11-01 2023-12-01 中国人民解放军国防科技大学 Robot collaborative source searching method, device and equipment under multi-gas diffusion source scene

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110514567A (en) * 2019-08-28 2019-11-29 哈尔滨工业大学 Gas source searching method based on comentropy
CN111505205A (en) * 2020-01-20 2020-08-07 中国安全生产科学研究院 Improved search algorithm for strong back calculation of gas leakage source
CN112231964A (en) * 2020-11-06 2021-01-15 中国人民解放军国防科技大学 Gas leakage source autonomous searching and positioning method based on deep reinforcement learning
CN112488153A (en) * 2020-11-06 2021-03-12 中国人民解放军国防科技大学 Autonomous searching and positioning method for odor sources based on clustering tendency principle
CN113052373A (en) * 2021-03-17 2021-06-29 南京工业职业技术大学 Monthly runoff change trend prediction method based on improved ELM model

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110514567A (en) * 2019-08-28 2019-11-29 哈尔滨工业大学 Gas source searching method based on comentropy
CN111505205A (en) * 2020-01-20 2020-08-07 中国安全生产科学研究院 Improved search algorithm for strong back calculation of gas leakage source
CN112231964A (en) * 2020-11-06 2021-01-15 中国人民解放军国防科技大学 Gas leakage source autonomous searching and positioning method based on deep reinforcement learning
CN112488153A (en) * 2020-11-06 2021-03-12 中国人民解放军国防科技大学 Autonomous searching and positioning method for odor sources based on clustering tendency principle
CN113052373A (en) * 2021-03-17 2021-06-29 南京工业职业技术大学 Monthly runoff change trend prediction method based on improved ELM model

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
宋程;贺昱曜;雷小康;杨盼盼;: "基于认知差异的多机器人协同信息趋向烟羽源搜索方法", 控制与决策, no. 01, 31 December 2018 (2018-12-31) *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114326755A (en) * 2022-03-15 2022-04-12 中国人民解放军国防科技大学 Robot gas source searching method based on local path planning
CN117148872A (en) * 2023-11-01 2023-12-01 中国人民解放军国防科技大学 Robot collaborative source searching method, device and equipment under multi-gas diffusion source scene
CN117148872B (en) * 2023-11-01 2024-01-26 中国人民解放军国防科技大学 Robot collaborative source searching method, device and equipment under multi-gas diffusion source scene

Similar Documents

Publication Publication Date Title
Hutchinson et al. A review of source term estimation methods for atmospheric dispersion events using static or mobile sensors
Jatmiko et al. A pso-based mobile robot for odor source localization in dynamic advection-diffusion with obstacles environment: theory, simulation and measurement
Song et al. Online inspection path planning for autonomous 3D modeling using a micro-aerial vehicle
Zhou et al. Optimal motion strategies for range-only constrained multisensor target tracking
CN114154383A (en) Multi-robot-source search scheme generation method and system based on cognitive search strategy
Deepak et al. PSO based path planner of an autonomous mobile robot
MahmoudZadeh et al. Uninterrupted path planning system for Multi-USV sampling mission in a cluttered ocean environment
Song et al. Collaborative infotaxis: Searching for a signal-emitting source based on particle filter and Gaussian fitting
CN109902374B (en) Sudden pollution tracing method based on flight sensor patrol track optimization
Ojeda et al. Information-driven gas source localization exploiting gas and wind local measurements for autonomous mobile robots
Jatmiko et al. Robots implementation for odor source localization using PSO algorithm
CN112612290B (en) Underwater vehicle three-dimensional multi-task path planning method considering ocean currents
Charrow et al. Active control strategies for discovering and localizing devices with range-only sensors
Liu et al. Evasion of a team of dubins vehicles from a hidden pursuer
Miller et al. Optimal planning for target localization and coverage using range sensing
Deepak et al. Target seeking behaviour of an intelligent mobile robot using advanced particle swarm optimization
CN112231964A (en) Gas leakage source autonomous searching and positioning method based on deep reinforcement learning
Zhang et al. Safe and efficient robot manipulation: Task-oriented environment modeling and object pose estimation
Sahyoun et al. Dynamic plume tracking using mobile sensors
Ahmad et al. End-to-end probabilistic depth perception and 3d obstacle avoidance using pomdp
Liang et al. Global path planning for mobile robot based genetic algorithm and modified simulated annealing algorithm
Keshavarz-Mohammadiyan et al. PSO-PF target tracking in range-based Wireless Sensor Networks with distance-dependent measurement noise
CN116520852A (en) Method, device and equipment for capturing multiple targets by group robots under local information
Wang et al. Research on SLAM road sign observation based on particle filter
Li et al. Multi-mode filter target tracking method for mobile robot using multi-agent reinforcement learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination