WO2024120187A1 - Method for estimating dynamic target of unmanned aerial vehicle in information rejection environment - Google Patents

Method for estimating dynamic target of unmanned aerial vehicle in information rejection environment Download PDF

Info

Publication number
WO2024120187A1
WO2024120187A1 PCT/CN2023/133085 CN2023133085W WO2024120187A1 WO 2024120187 A1 WO2024120187 A1 WO 2024120187A1 CN 2023133085 W CN2023133085 W CN 2023133085W WO 2024120187 A1 WO2024120187 A1 WO 2024120187A1
Authority
WO
WIPO (PCT)
Prior art keywords
drone
target
information
axis
state
Prior art date
Application number
PCT/CN2023/133085
Other languages
French (fr)
Chinese (zh)
Inventor
徐升
朱兵
侯睿明
徐蓉蓉
徐天添
吴新宇
Original Assignee
中国科学院深圳先进技术研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院深圳先进技术研究院 filed Critical 中国科学院深圳先进技术研究院
Publication of WO2024120187A1 publication Critical patent/WO2024120187A1/en

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Definitions

  • the present invention relates to the technical field of unmanned aerial vehicles, and more specifically, to a method for estimating dynamic targets of unmanned aerial vehicles in an information denial environment.
  • Target tracking technology is widely used in various applications, such as aerial photography, public safety, and humanitarian search and rescue.
  • Estimating the position and velocity of a target from noisy measurements obtained from different sensors is a widely studied direction. Due to the excellent maneuverability of unmanned aerial vehicles (UAVs), they can be used for target tracking after being equipped with angle of arrival (AOA) sensors.
  • AOA angle of arrival
  • UAVs unmanned aerial vehicles
  • the position and velocity of the self and the target can be estimated from noisy AOA measurements.
  • researchers have proposed different estimation algorithms to estimate the target state from nonlinear AOA measurements. For example, the maximum likelihood estimator (MLE) and pseudo linear estimator (PLE) as classic batch filters are used to estimate the target position and velocity from AOA target measurements with Gaussian noise.
  • MLE maximum likelihood estimator
  • PLE pseudo linear estimator
  • Kalman-based methods which are regarded as recursive filters, are more commonly used in the field of target tracking.
  • the extended Kalman filter (EKF), unscented Kalman filter (UKF), pseudo linear Kalman filter (PLKF) and cubature Kalman filter (CKF) are all used to varying degrees.
  • a cost function should be designed for UAV path optimization.
  • Common cost functions include the estimated covariance matrix and the Fisher Information Matrix (FIM).
  • FIM Fisher Information Matrix
  • Various algorithms can be used to optimize the cost function, such as gradient-based, exhaustive search, and learning-based methods.
  • the location of the drone is assumed to be accurately obtained using external information, such as the Global Positioning System (GPS).
  • GPS Global Positioning System
  • the drone may operate in an environment where external signals are missing (or called an information denial environment), such as indoor spaces and interference areas. In these areas, the drone's external signals are missing and its location cannot be obtained.
  • SLAM Simultaneous Localization and Mapping
  • additional information about surrounding anchor points is added to obtain the absolute target location.
  • target positioning through multiple anchor points (such as base station positioning) is already relatively mature, but since it is basically performed through the time difference of arrival (TDOA), the number of anchor points required must be greater than or equal to 3 or other prior knowledge must be introduced to uniquely determine the target location.
  • TDOA time difference of arrival
  • the purpose of the present invention is to overcome the defects of the above-mentioned prior art and provide a method for estimating dynamic targets of unmanned aerial vehicles in an information denial environment, the method comprising:
  • Acquire three-dimensional angle information where the three-dimensional angle information includes an arrival angle from the drone to the first anchor point, an arrival angle from the drone to the second anchor point, and an arrival angle from the drone to the target;
  • the three-dimensional angle information is filtered by an extended Kalman filter to obtain estimated drone state information and corresponding state covariance information, wherein the drone state information is used to characterize the position and speed of the drone and the position and speed of the target;
  • the loss function is constructed using the drone state information and the state covariance information to optimize the flight path of the drone at subsequent moments and achieve observation of the target.
  • the advantage of the present invention is that it provides a method for autonomous target tracking of drones based on azimuth (angle, which can be obtained by PTZ camera) and anchor points.
  • azimuth angle
  • anchor points with known absolute positions
  • the drone can provide the drone with the absolute position information of itself and the target in an environment where external signals (GPS, RTK, etc.) are lost, and track the target, providing a more robust solution for positioning and tracking of the drone itself and the target.
  • the present invention can be applied to the positioning systems of various unmanned aerial vehicles such as multi-rotor drones and fixed-wing drones. It can be used as a redundant means of self-positioning, and can also provide the absolute geographic coordinates of the target according to the azimuth, which improves the positioning speed and the positioning accuracy.
  • FIG1 is a geometric diagram of target tracking according to an embodiment of the present invention.
  • FIG2 is a flow chart of a method for estimating a dynamic target of a UAV in an information denial environment according to an embodiment of the present invention
  • FIG3 is a flow chart of an extended Kalman filter according to an embodiment of the present invention.
  • FIG4 is a schematic diagram of constructing a loss function according to an embodiment of the present invention.
  • FIG. 5 is a process diagram of a path optimization method according to an embodiment of the present invention.
  • the present invention provides a method for autonomous target positioning of unmanned aerial vehicles in an environment lacking external signals, which mainly includes two parts: target filtering estimation and unmanned aerial vehicle path optimization.
  • the method introduces two stationary markers with known absolute positions, and uses the three-dimensional angle information collected by the angle of arrival (AOA) sensor (including the angle of arrival ⁇ ub1 between the unmanned aerial vehicle and anchor point 1, the angle of arrival ⁇ ub2 between the unmanned aerial vehicle and anchor point 2, and the angle of arrival ⁇ up between the unmanned aerial vehicle and the mobile target) as input.
  • AOA angle of arrival
  • the position and velocity errors of the mobile target and the unmanned aerial vehicle itself can be converged to a lower range, and finally the position and velocity of itself and the target are output.
  • the method is introduced after filtering.
  • the path optimization algorithm improves the efficiency and accuracy of target positioning.
  • different sizes of no-fly zones are set around the target and landmarks for the drone during path planning.
  • anchor 1 anchor 1
  • anchor 2 anchor 2
  • anchor 1 and anchor 2 are markers with known absolute positions.
  • T represents transposition
  • the coordinates are unknown
  • the position of the drone at time k is marked as uk (unknown).
  • k is a discrete time identifier, and the discrete time interval is set to M.
  • the provided method for estimating dynamic targets of unmanned aerial vehicles in an information denial environment mainly includes: step S110, obtaining AOA sensor measurements; step S120, extending the Kalman filter; step S130, a path optimization algorithm based on gradient descent; step S140, determining whether it is within the no-fly distance range; if the determination is yes, performing path replanning (step S150); and then moving in a specified direction at a fixed speed (step S160).
  • the filter can use a recursive Kalman filter. Since the sensor measurement and the model are in a nonlinear relationship, nonlinear filtering is ultimately performed based on the extended Kalman filter.
  • the process mainly includes the following steps:
  • Step S210 predicting the prior state and covariance according to the state transition model and the initial state.
  • k-1 FX k-1
  • k-1 FX k-1
  • k-1 is the prior estimate based on the state X k-1
  • F is the state transfer matrix;
  • k-1 is the covariance matrix of the state X k
  • m k is the process noise, which is used to quantify the system error that has not been fully considered.
  • m k is an independent zero-mean additive white Gaussian noise, that is, m k ⁇ N(0,Q k ), and Q k is the covariance matrix of the system error m k ;
  • the state matrix X consists of the position and velocity of the drone itself and the position and velocity of the target, for example, in, represents the derivative (speed) of xuk with respect to time.
  • xuk is the position of the drone on the x-axis. is the velocity component of the drone on the x-axis
  • yuk is the position of the drone on the y-axis
  • x pk is the position of the target on the x-axis
  • y pk is the position of the target on the y-axis
  • k- 1 at time K-1 is transferred to the prior state X k
  • k-1 through the state transfer matrix F, where the state transfer is assumed to be a constant speed model. Therefore, the state transfer matrix is Where Fi is That is, P k+1 P k +V k M.
  • Step S220 calculating the Jacobian matrix according to the current state and measurement.
  • H k Jacobian(X k
  • H k is the measurement matrix at time k, which is a Jacobi matrix with 3 rows and 8 columns, expressed as:
  • dub1
  • dub2
  • dup
  • Step S230 calculating the estimated state residual according to the measurement model.
  • n k represents the sensor measurement noise, for example, is an independent zero-mean additive Gaussian white noise, that is, n k ⁇ N(0,R k ).
  • Step S240 calculating the Kalman gain.
  • Rk is the measurement noise, which is used to characterize the noise of the sensor measurement data.
  • Sk is an intermediate variable for simplified writing
  • Kk is the Kalman gain at time k.
  • Step S250 updating the posterior estimate and covariance.
  • k (IK k H k )P k
  • k is the posterior estimation state
  • k is the posterior covariance matrix
  • I is the 8*8 identity matrix.
  • the path optimization algorithm it takes the current state as input and predicts the direction of the drone that can effectively reduce the loss function (or cost function) through gradient descent, so that the drone can obtain the best observation path when moving in this direction.
  • FIG4 is a schematic diagram of constructing a loss function, which takes the current state X k
  • constructing the loss function specifically includes: X k
  • k-1 FX k-1
  • k-1 FX k-1
  • k, ⁇ X k+1
  • is a small displacement vector
  • k ⁇ is the state of state X k+1
  • H k+1 is calculated according to X k+1
  • tr( ⁇ ) is the matrix trace
  • J( ⁇ ) is the cost value after a small shift.
  • the movement of the drone has two degrees of freedom, so it can be decomposed into two components along the x-axis and the y-axis, which can be divided into positive and negative directions.
  • four movable directions [(d,0), (-d,0), (0,d), (0,-d)] are obtained.
  • FIG5 is a schematic diagram of the path optimization process, in which one branch is used to calculate the cost value of the UAV flying along the x-axis, and the other branch is used to calculate the cost value of the UAV flying along the y-axis, which specifically includes the following steps:
  • the present invention introduces markers (one or more) to cooperate with the AOA sensor to complete the acquisition of the drone's own coordinates, and introduces markers (one or more) to cooperate with the AOA sensor to complete the acquisition of the target coordinates.
  • the gradient descent of the cost function is completed by random perturbation to optimize the flight path of the drone and obtain the best observation of the moving target.
  • the present invention only requires, for example, a PTZ camera to measure the azimuth between the drone and the anchor point and the target.
  • the absolute position information of the drone and the target can be obtained without any other information. At least two anchor point information with known absolute positions can be used to complete the position tracking of the drone and the target. It has been verified that the present invention improves the efficiency and accuracy of target tracking based on AOA.
  • the present invention may be a system, a method and/or a computer program product.
  • the computer program product may include a computer-readable storage medium carrying computer-readable program instructions for causing a processor to implement various aspects of the present invention.
  • Computer readable storage medium can be a tangible device that can hold and store instructions used by an instruction execution device.
  • Computer readable storage medium can be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination thereof.
  • Non-exhaustive list of computer readable storage medium include: a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a static random access memory (SRAM), a portable compact disk read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanical encoding device, for example, a punch card or a convex structure in a groove on which instructions are stored, and any suitable combination thereof.
  • RAM random access memory
  • ROM read-only memory
  • EPROM or flash memory erasable programmable read-only memory
  • SRAM static random access memory
  • CD-ROM compact disk read-only memory
  • DVD digital versatile disk
  • memory stick a floppy disk
  • mechanical encoding device for example, a punch card or a convex structure in a groove on which instructions are stored, and any suitable combination thereof.
  • the computer readable storage medium used here is not interpreted as a transient signal itself, such as a radio wave or other freely propagating electromagnetic wave, an electromagnetic wave propagated by a waveguide or other transmission medium (for example, a light pulse by an optical fiber cable), or an electrical signal transmitted by a wire.
  • the computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to each computing/processing device, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
  • the network can include copper transmission cables, optical fiber transmissions, wireless transmissions, routers, firewalls, switches, gateway computers, and/or edge servers.
  • the network adapter card or network interface in each computing/processing device receives the computer-readable program instructions from the network and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device.
  • the computer program instructions for performing the operations of the present invention may be assembler instructions, instruction set architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state setting data, or source code or object code written in any combination of one or more programming languages, including object-oriented programming languages such as Smalltalk, C++, Python, etc., and conventional procedural programming languages such as "C" language or similar programming languages.
  • the computer readable program instructions may be executed entirely on the user's computer, partially on the user's computer, as a separate software package, partially on the user's computer and partially on a remote computer, or entirely on a remote computer or server.
  • the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (e.g., using an Internet service provider to access the user's computer).
  • the electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can execute computer-readable program instructions by utilizing the state information of the computer-readable program instructions to implement various aspects of the present invention.
  • These computer-readable program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine, so that when these instructions are executed by the processor of the computer or other programmable data processing device, a device that implements the functions/actions specified in one or more boxes in the flowchart and/or block diagram is generated.
  • These computer-readable program instructions can also be stored in a computer-readable storage medium, and these instructions cause the computer, programmable data processing device, and/or other equipment to work in a specific manner, so that the computer-readable medium storing the instructions includes a manufactured product, which includes instructions for implementing various aspects of the functions/actions specified in one or more boxes in the flowchart and/or block diagram.
  • Computer-readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device so that a series of operating steps are performed on the computer, other programmable data processing apparatus, or other device to produce a computer-implemented process, thereby causing the instructions executed on the computer, other programmable data processing apparatus, or other device to implement the functions/actions specified in one or more boxes in the flowchart and/or block diagram.
  • each box in the flowchart or block diagram may represent a module, a program segment, or a portion of an instruction, and the module, program segment, or a portion of an instruction contains one or more executable instructions for implementing a specified logical function.
  • the functions marked in the boxes may also occur in an order different from that marked in the accompanying drawings. For example, two consecutive boxes may actually be executed substantially in parallel, and they may sometimes be executed in the opposite order, depending on the functions involved.
  • each box in the block diagram and/or flowchart, and combinations of boxes in the block diagram and/or flowchart may be It can be implemented by a dedicated hardware-based system that performs the specified functions or actions, or it can be implemented by a combination of dedicated hardware and computer instructions. It is well known to those skilled in the art that implementation by hardware, implementation by software, and implementation by a combination of software and hardware are all equivalent.

Landscapes

  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)

Abstract

A method for estimating a dynamic target of an unmanned aerial vehicle in an information rejection environment. The method comprises: obtaining three-dimensional angle information, the three-dimensional angle information comprising the angle of arrival of an unmanned aerial vehicle to a first anchor point, the angle of arrival of the unmanned aerial vehicle to a second anchor point, and the angle of arrival of the unmanned aerial vehicle to a target; filtering the three-dimensional angle information by using an extended Kalman filter, and obtaining estimated unmanned aerial vehicle state information and corresponding state covariance information, the unmanned aerial vehicle state information being used for characterizing the location and velocity of the unmanned aerial vehicle and the location and velocity of the target; and constructing a loss function by using the unmanned aerial vehicle state information and the state covariance information, so as to optimize the flight path of the unmanned aerial vehicle, implementing observation of the target. According to the described method, tracking the locations of an unmanned aerial vehicle and a target can be implemented by using information of two anchor points for which absolute locations are known.

Description

一种用于信息拒止环境下的无人机动态目标估计方法A method for dynamic target estimation of unmanned aerial vehicles in information denial environment 技术领域Technical Field
本发明涉及无人机技术领域,更具体地,涉及一种用于信息拒止环境下的无人机动态目标估计方法。The present invention relates to the technical field of unmanned aerial vehicles, and more specifically, to a method for estimating dynamic targets of unmanned aerial vehicles in an information denial environment.
背景技术Background technique
目标跟踪技术在各种应用中被广泛应用,如空中摄影、公共安全和人道主义搜索和救援。从不同传感器获得的噪声测量值中估计出目标的位置和速度是一个研究较为广泛的方向。由于无人机(UAV)出色的可操作性,在配备了到达角(AOA)传感器后,可被用于目标跟踪。通过解算三角几何问题配合滤波估计器,自身和目标的位置及速度可以从嘈杂的AOA测量数据中估计出来。此外,研究人员还提出了不同的估计算法,以便用非线性AOA测量值估计目标状态。例如,最大似然估计器(MLE)和伪线性估计器(PLE)作为经典的批处理滤波器,被用于从带有高斯噪声的AOA目标测量中估计目标位置及速度。基于卡尔曼的方法,被视为递归滤波器,则被更普遍地应用于目标跟踪领域。对于AOA目标跟踪,扩展卡尔曼滤波(EKF)、无迹卡尔曼滤波(UKF)、伪线性卡尔曼滤波(PLKF)和容积卡尔曼滤波器(CKF)均有不同程度的应用。Target tracking technology is widely used in various applications, such as aerial photography, public safety, and humanitarian search and rescue. Estimating the position and velocity of a target from noisy measurements obtained from different sensors is a widely studied direction. Due to the excellent maneuverability of unmanned aerial vehicles (UAVs), they can be used for target tracking after being equipped with angle of arrival (AOA) sensors. By solving trigonometric geometry problems and filter estimators, the position and velocity of the self and the target can be estimated from noisy AOA measurements. In addition, researchers have proposed different estimation algorithms to estimate the target state from nonlinear AOA measurements. For example, the maximum likelihood estimator (MLE) and pseudo linear estimator (PLE) as classic batch filters are used to estimate the target position and velocity from AOA target measurements with Gaussian noise. Kalman-based methods, which are regarded as recursive filters, are more commonly used in the field of target tracking. For AOA target tracking, the extended Kalman filter (EKF), unscented Kalman filter (UKF), pseudo linear Kalman filter (PLKF) and cubature Kalman filter (CKF) are all used to varying degrees.
现有研究表明,通过优化无人机飞行路径,使从测量中收集到的信息更加有效,可以显著提高目标跟踪性能。为提高目标估计精度,应设计一个成本函数用于无人机路径优化。常见的成本函数包括估计协方差矩阵和费雪信息矩阵(FIM)等。可采用各种算法优化成本函数,如基于梯度、穷举搜索和基于学习的方法。Existing studies have shown that the target tracking performance can be significantly improved by optimizing the UAV flight path to make the information collected from the measurements more effective. To improve the target estimation accuracy, a cost function should be designed for UAV path optimization. Common cost functions include the estimated covariance matrix and the Fisher Information Matrix (FIM). Various algorithms can be used to optimize the cost function, such as gradient-based, exhaustive search, and learning-based methods.
在现有技术中,无人机的位置被假定为使用外部信息并且可以准确获取,例如全球定位系统(GPS)。然而,无人机可能在外部信号缺失的环境中工作(或称为信息拒止环境),例如,室内空间和干扰区域。在这些区域中,无人机的外部信号缺失,无法获得其位置。目前,在未知自身位置的情况下进行目标定位引起了很多人的兴趣。例如,在同步定位和测绘(SLAM) 应用中,增加周围锚点的额外信息(如附近建筑物的位置)来获得绝对目标位置。在无线通信中,通过多锚点进行目标定位(如基站定位)已经较为成熟,但由于其基本通过到达时间差(TDOA)的方式进行,所需的锚点数必须大于等于3或者引入其它先验知识,才能唯一确定目标位置。In the prior art, the location of the drone is assumed to be accurately obtained using external information, such as the Global Positioning System (GPS). However, the drone may operate in an environment where external signals are missing (or called an information denial environment), such as indoor spaces and interference areas. In these areas, the drone's external signals are missing and its location cannot be obtained. Currently, target positioning without knowing its own location has attracted a lot of interest. For example, in Simultaneous Localization and Mapping (SLAM) In the application, additional information about surrounding anchor points (such as the location of nearby buildings) is added to obtain the absolute target location. In wireless communications, target positioning through multiple anchor points (such as base station positioning) is already relatively mature, but since it is basically performed through the time difference of arrival (TDOA), the number of anchor points required must be greater than or equal to 3 or other prior knowledge must be introduced to uniquely determine the target location.
经分析,在现有技术中,无论是单纯的基于到达时间差(TDOA)还是到达角(AOA)的目标估计方案,都只能获取到目标相对于无人机的距离及角度,不能获得其绝对位置。After analysis, it is found that in the existing technology, whether it is a target estimation scheme based solely on time difference of arrival (TDOA) or angle of arrival (AOA), it can only obtain the distance and angle of the target relative to the drone, but cannot obtain its absolute position.
发明内容Summary of the invention
本发明的目的是克服上述现有技术的缺陷,提供一种用于信息拒止环境下的无人机动态目标估计方法,该方法包括:The purpose of the present invention is to overcome the defects of the above-mentioned prior art and provide a method for estimating dynamic targets of unmanned aerial vehicles in an information denial environment, the method comprising:
获取三维角度信息,所述三维角度信息包括无人机到第一锚点的到达角、无人机到第二锚点的到达角以及无人机到目标的到达角;Acquire three-dimensional angle information, where the three-dimensional angle information includes an arrival angle from the drone to the first anchor point, an arrival angle from the drone to the second anchor point, and an arrival angle from the drone to the target;
对于所述三维角度信息,利于拓展卡尔曼滤波器进行滤波,获得估计的无人机状态信息以及对应的状态协方差信息,所述无人机状态信息用于表征无人机的位置、速度以及目标的位置和速度;The three-dimensional angle information is filtered by an extended Kalman filter to obtain estimated drone state information and corresponding state covariance information, wherein the drone state information is used to characterize the position and speed of the drone and the position and speed of the target;
利用所述无人机状态信息和所述状态协方差信息构建损失函数,以优化无人机后续时刻的飞行路径,实现对目标的观测。The loss function is constructed using the drone state information and the state covariance information to optimize the flight path of the drone at subsequent moments and achieve observation of the target.
与现有技术相比,本发明的优点在于,提供一种基于方位角(角度,可由PTZ相机获取)与锚点的无人机自主目标跟踪方法,通过引入已知绝对位置的锚点,可为无人机在外部信号(GPS、RTK等)丢失环境下提供自身及目标的绝对位置信息,并对其进行跟踪,为无人机自身与目标的定位跟踪提供了更健壮的解决方案。本发明可应用于多旋翼无人机、固定翼无人机等多种无人飞行器的定位系统,既可作为自身定位的冗余手段,又可根据方位角提供目标的绝对地理坐标,在提升定位速度的同时,也提高了定位精度。Compared with the prior art, the advantage of the present invention is that it provides a method for autonomous target tracking of drones based on azimuth (angle, which can be obtained by PTZ camera) and anchor points. By introducing anchor points with known absolute positions, the drone can provide the drone with the absolute position information of itself and the target in an environment where external signals (GPS, RTK, etc.) are lost, and track the target, providing a more robust solution for positioning and tracking of the drone itself and the target. The present invention can be applied to the positioning systems of various unmanned aerial vehicles such as multi-rotor drones and fixed-wing drones. It can be used as a redundant means of self-positioning, and can also provide the absolute geographic coordinates of the target according to the azimuth, which improves the positioning speed and the positioning accuracy.
通过以下参照附图对本发明的示例性实施例的详细描述,本发明的其它特征及其优点将会变得清楚。Further features and advantages of the present invention will become apparent from the following detailed description of exemplary embodiments of the present invention with reference to the attached drawings.
附图说明 BRIEF DESCRIPTION OF THE DRAWINGS
被结合在说明书中并构成说明书的一部分的附图示出了本发明的实施例,并且连同其说明一起用于解释本发明的原理。The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention.
图1是根据本发明一个实施例的目标跟踪的几何示意图;FIG1 is a geometric diagram of target tracking according to an embodiment of the present invention;
图2是根据本发明一个实施例的用于信息拒止环境下的无人机动态目标估计方法的流程图;FIG2 is a flow chart of a method for estimating a dynamic target of a UAV in an information denial environment according to an embodiment of the present invention;
图3是根据本发明一个实施例的拓展卡尔曼滤波的流程图;FIG3 is a flow chart of an extended Kalman filter according to an embodiment of the present invention;
图4是根据本发明一个实施例的构建损失函数的示意图;FIG4 is a schematic diagram of constructing a loss function according to an embodiment of the present invention;
图5是根据本发明一个实施例的路径优化方法的过程示意图。FIG. 5 is a process diagram of a path optimization method according to an embodiment of the present invention.
具体实施方式Detailed ways
现在将参照附图来详细描述本发明的各种示例性实施例。应注意到:除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本发明的范围。Various exemplary embodiments of the present invention will now be described in detail with reference to the accompanying drawings. It should be noted that the relative arrangement of components and steps, numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present invention unless otherwise specifically stated.
以下对至少一个示例性实施例的描述实际上仅仅是说明性的,决不作为对本发明及其应用或使用的任何限制。The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the invention, its application, or uses.
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为说明书的一部分。Technologies, methods, and equipment known to ordinary technicians in the relevant art may not be discussed in detail, but where appropriate, the technologies, methods, and equipment should be considered as part of the specification.
在这里示出和讨论的所有例子中,任何具体值应被解释为仅仅是示例性的,而不是作为限制。因此,示例性实施例的其它例子可以具有不同的值。In all examples shown and discussed herein, any specific values should be interpreted as merely exemplary and not limiting. Therefore, other examples of the exemplary embodiments may have different values.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步讨论。It should be noted that like reference numerals and letters refer to similar items in the following figures, and therefore, once an item is defined in one figure, it need not be further discussed in subsequent figures.
本发明提供一套外部信号缺失环境下的无人机自主目标定位方法,主要包括目标滤波估计、无人机路径优化两部分。该方法引入两个已知绝对位置的静止标志物,以到达角(AOA)传感器采集的三维角度信息(包括无人机与锚点1的到达角θub1,无人机与锚点2的到达角θub2,无人机与移动目标的到达角θup)为输入,在连续的观测一段时间后,可以将移动目标和无人机自身的位置与速度误差收敛到较低的范围内,最终输出自身和目标的位置及速度。由于传感器位置会影响滤波估计器对目标的观测,在滤波后引入 路径优化算法改善目标定位的效率及准确度。此外,为了无人机的安全,在路径规划时为无人机在目标及地标周围设置了不同大小的禁飞区。The present invention provides a method for autonomous target positioning of unmanned aerial vehicles in an environment lacking external signals, which mainly includes two parts: target filtering estimation and unmanned aerial vehicle path optimization. The method introduces two stationary markers with known absolute positions, and uses the three-dimensional angle information collected by the angle of arrival (AOA) sensor (including the angle of arrival θ ub1 between the unmanned aerial vehicle and anchor point 1, the angle of arrival θ ub2 between the unmanned aerial vehicle and anchor point 2, and the angle of arrival θ up between the unmanned aerial vehicle and the mobile target) as input. After continuous observation for a period of time, the position and velocity errors of the mobile target and the unmanned aerial vehicle itself can be converged to a lower range, and finally the position and velocity of itself and the target are output. Since the sensor position will affect the filter estimator's observation of the target, the method is introduced after filtering. The path optimization algorithm improves the efficiency and accuracy of target positioning. In addition, for the safety of the drone, different sizes of no-fly zones are set around the target and landmarks for the drone during path planning.
参见图1的目标跟踪几何示意图,示出了k时刻移动目标、无人机、锚点1(anchor1)和锚点2(anchor2)之间的位置关系,其中锚点1和锚点2是已知绝对位置的标志物。例如,锚点1的坐标为b1=[xb1,yb1]T,锚点2的坐标为b2=[xb2,yb2]T,T表示转置;移动目标在k时刻的坐标为pk=[xpk,ypk]T,坐标未知,无人机在k时刻的位置标记为uk(未知)。k是离散时间标识,设定离散的时间间隔为M。Referring to the target tracking geometry diagram in FIG1 , the positional relationship between the mobile target, the drone, anchor 1 (anchor1) and anchor 2 (anchor2) at time k is shown, where anchor 1 and anchor 2 are markers with known absolute positions. For example, the coordinates of anchor 1 are b 1 = [x b1 , y b1 ] T , the coordinates of anchor 2 are b 2 = [x b2 , y b2 ] T , and T represents transposition; the coordinates of the mobile target at time k are p k = [x pk , y pk ] T , the coordinates are unknown, and the position of the drone at time k is marked as uk (unknown). k is a discrete time identifier, and the discrete time interval is set to M.
参见图2所示,所提供的用于信息拒止环境下的无人机动态目标估计方法主要包括:步骤S110,获取AOA传感器量测;步骤S120,拓展卡尔曼滤波器;步骤S130,基于梯度下降的路径优化算法;步骤S140,判断是否在禁飞距离范围内;如判断为是,则执行路径重规划(步骤S150);进而以固定速度向指定方向移动(步骤S160)。As shown in FIG. 2 , the provided method for estimating dynamic targets of unmanned aerial vehicles in an information denial environment mainly includes: step S110, obtaining AOA sensor measurements; step S120, extending the Kalman filter; step S130, a path optimization algorithm based on gradient descent; step S140, determining whether it is within the no-fly distance range; if the determination is yes, performing path replanning (step S150); and then moving in a specified direction at a fixed speed (step S160).
在下文中,将具体介绍拓展卡尔曼滤波器及梯度下降优化算法。滤波器可采用递归式的卡尔曼滤波器,由于传感器量测与模型成非线性关系,因此最终以拓展卡尔曼滤波器为基础进行非线性滤波。In the following, the extended Kalman filter and the gradient descent optimization algorithm will be specifically introduced. The filter can use a recursive Kalman filter. Since the sensor measurement and the model are in a nonlinear relationship, nonlinear filtering is ultimately performed based on the extended Kalman filter.
参见图3所示的拓展卡尔曼滤波的过程,主要包括以下步骤:Referring to the process of extended Kalman filtering shown in FIG3 , the process mainly includes the following steps:
步骤S210,根据状态转移模型和初始状态,预测先验状态和协方差。
Xk|k-1=FXk-1|k-1+mk (1)
Pk|k-1=FXk-1|k-1FT+Qk (2)
Step S210, predicting the prior state and covariance according to the state transition model and the initial state.
X k|k-1 = FX k-1|k-1 + m k (1)
P k|k-1 =FX k-1|k-1 F T +Q k (2)
其中,Xk|k-1为基于k-1时刻状态Xk-1|k-1的先验估计;F为状态转移矩阵;Pk|k-1为状态Xk|k-1的协方差矩阵,用于表征状态Xk|k-1的不确定程度;mk为过程噪声,用于量化未能充分考虑的系统误差,例如,mk是独立的零均值加性高斯白噪声,即mk~N(0,Qk),Qk为系统误差mk的协方差矩阵;Wherein, X k|k-1 is the prior estimate based on the state X k-1|k-1 at time k-1; F is the state transfer matrix; P k|k-1 is the covariance matrix of the state X k|k-1 , which is used to characterize the uncertainty of the state X k|k-1 ; m k is the process noise, which is used to quantify the system error that has not been fully considered. For example, m k is an independent zero-mean additive white Gaussian noise, that is, m k ~N(0,Q k ), and Q k is the covariance matrix of the system error m k ;
状态矩阵X由无人机自身的位置、速度以及目标的位置、速度构成,例如,其中,表示xuk对时间的导数(速度),具体第,xuk是无人机在x轴的位置,是无人机在x轴的速度分量,yuk是无人机在y轴的位置,是无人机在y轴的速度分量,xpk是目标在x轴的位置,是目标在x轴的速度分量,ypk是目标在y轴的位置, 是目标在y轴的速度分量。The state matrix X consists of the position and velocity of the drone itself and the position and velocity of the target, for example, in, represents the derivative (speed) of xuk with respect to time. Specifically, xuk is the position of the drone on the x-axis. is the velocity component of the drone on the x-axis, yuk is the position of the drone on the y-axis, is the velocity component of the drone on the y-axis, x pk is the position of the target on the x-axis, is the velocity component of the target on the x-axis, y pk is the position of the target on the y-axis, is the velocity component of the target on the y-axis.
K-1时刻的状态Xk-1|k-1通过状态转移矩阵F转移到先验状态Xk|k-1,其中状态转移假设为速度恒定模型,因此,状态转移矩阵为其中Fi即Pk+1=Pk+VkM。The state X k-1|k- 1 at time K-1 is transferred to the prior state X k|k-1 through the state transfer matrix F, where the state transfer is assumed to be a constant speed model. Therefore, the state transfer matrix is Where Fi is That is, P k+1 =P k +V k M.
步骤S220,根据当前状态和量测计算雅各比矩阵。
Hk=Jacobian(Xk|k-1,Zk) (2)
Step S220, calculating the Jacobian matrix according to the current state and measurement.
H k = Jacobian(X k|k-1 ,Z k ) (2)
其中,Zk为k时刻通过AOA传感器观测到的三维角度信息,即Zk=[θb1kb2kpk]T,Hk为k时刻的量测矩阵,该矩阵为3行8列的雅各比矩阵,表示为:
Wherein, Z k is the three-dimensional angle information observed by the AOA sensor at time k, that is, Z k = [θ b1k , θ b2k , θ pk ] T , H k is the measurement matrix at time k, which is a Jacobi matrix with 3 rows and 8 columns, expressed as:
其中,dub1=||uk-b1||2,dub2=||uk-b2||2,dup=||uk-pk||2Among them, dub1 =|| uk - b1 || 2 , dub2 =||uk - b2 || 2 , dup =||uk - pk || 2 .
步骤S230,根据量测模型计算估计状态残差。
Step S230, calculating the estimated state residual according to the measurement model.
其中,为残差,表示量测与估计之间的误差;函数h(·)为非线性的量测函数,将数据从状态空间转换到量测空间,具体展开为:
in, is the residual, which represents the error between measurement and estimation; function h(·) is a nonlinear measurement function, which transforms data from state space to measurement space, which is specifically expanded as follows:
其中,nk表示传感器量测噪声。例如,是独立的零均值加性高斯白噪声,即nk~N(0,Rk)。Wherein, n k represents the sensor measurement noise, for example, is an independent zero-mean additive Gaussian white noise, that is, n k ~N(0,R k ).
步骤S240,计算卡尔曼增益。

Step S240, calculating the Kalman gain.

其中,Rk为测量噪声,用于表征传感器量测数据的噪声情况,例如, 可以从传感器的规格书中获取;Sk为简化书写的中间变量;Kk为k时刻的卡尔曼增益。Where Rk is the measurement noise, which is used to characterize the noise of the sensor measurement data. For example, It can be obtained from the specification sheet of the sensor; Sk is an intermediate variable for simplified writing; Kk is the Kalman gain at time k.
步骤S250,更新后验估计和协方差。

Pk|k=(I-KkHk)Pk|k-1 (9)
Step S250, updating the posterior estimate and covariance.

P k|k =(IK k H k )P k|k-1 (9)
其中,Xk|k为后验估计状态;Pk|k为后验协方差矩阵;I为8*8的单位矩阵。Among them, X k|k is the posterior estimation state; P k|k is the posterior covariance matrix; I is the 8*8 identity matrix.
对于路径优化算法,其以当前状态作为输入,通过梯度下降的方式预测能有效降低损失函数(或称代价函数)的无人机的方向,从而使无人机在沿该方向移动时能获得最佳的观测路径。For the path optimization algorithm, it takes the current state as input and predicts the direction of the drone that can effectively reduce the loss function (or cost function) through gradient descent, so that the drone can obtain the best observation path when moving in this direction.
图4是构建损失函数的示意,其以当前状态Xk|k和小位移向量δ为输入,将其传入原滤波器的拷贝,完成一步预测,从而得到以小位移δ移动得到的状态协方差矩阵,最终以协方差矩阵的迹为输出。FIG4 is a schematic diagram of constructing a loss function, which takes the current state X k|k and a small displacement vector δ as input, passes it to a copy of the original filter, completes a one-step prediction, and obtains the state covariance matrix obtained by moving with a small displacement δ, and finally outputs the trace of the covariance matrix.
结合图4所示,构建损失函数具体包括:
Xk|k-1=FXk-1|k-1+mk (10)
Pk|k-1=FXk-1|k-1FT+Qk (11)
Xk+1|k,δ=Xk+1|k+δ (12)
As shown in Figure 4, constructing the loss function specifically includes:
X k|k-1 = FX k-1|k-1 + m k (10)
P k|k-1 =FX k-1|k-1 F T +Q k (11)
X k+1|k,δ =X k+1|k +δ (12)
其中,δ为小位移向量;Xk+1|k,δ为状态Xk+1|k经过小位移之后的状态。Among them, δ is a small displacement vector; X k+1|k,δ is the state of state X k+1|k after a small displacement.
然后,根据Xk+1|k,δ计算Hk+1Then, H k+1 is calculated according to X k+1|k,δ .
计算: calculate:
计算: calculate:
计算:Pk+1|k+1=(I-Kk+1Hk+1)Pk+1|k (15)Calculation: P k+1|k+1 =(IK k+1 H k+1 )P k+1|k (15)
计算:J(X`k+1|k+1)=tr(Pk+1|k+1) (16)Calculation: J(X` k+1|k+1 )=tr(P k+1|k+1 ) (16)
其中,tr(·)为取矩阵迹;J(·)为经过小位移之后的代价值。Among them, tr(·) is the matrix trace; J(·) is the cost value after a small shift.
在一个实施例中,假设无人机在2D平面进行移动,无人机的移动具有两个自由度,因此可分解为沿x轴与y轴共两个分量,又可分正负两个方向,组合后得到4个可移动的方向[(d,0),(-d,0),(0,d),(0,-d)]。以上述四个方向进行移动,得到各个方向移动的代价值,从而得到各个方向移动进行观测好坏程度的度量。 In one embodiment, assuming that the drone moves in a 2D plane, the movement of the drone has two degrees of freedom, so it can be decomposed into two components along the x-axis and the y-axis, which can be divided into positive and negative directions. After combination, four movable directions [(d,0), (-d,0), (0,d), (0,-d)] are obtained. By moving in the above four directions, the cost value of moving in each direction is obtained, and thus the measurement of the degree of observation quality of moving in each direction is obtained.
图5是路径优化过程的示意图,其中一个分支用于计算无人机沿x轴飞行的代价值,另一个分支用于计算无人机沿y轴飞行的代价值,具体包括以下步骤:FIG5 is a schematic diagram of the path optimization process, in which one branch is used to calculate the cost value of the UAV flying along the x-axis, and the other branch is used to calculate the cost value of the UAV flying along the y-axis, which specifically includes the following steps:
以δ=[d,0]为参数计算沿x轴正方向移动的代价值Jxp,其中,d为较小的标量,表示步长;The cost value J xp of moving along the positive direction of the x-axis is calculated using δ = [d, 0] as a parameter, where d is a smaller scalar, indicating the step size;
以δ=[-d,0]为参数计算沿x轴负方向移动的代价值JxnCalculate the cost J xn of moving along the negative x-axis direction using δ = [-d, 0] as a parameter;
计算无人机沿x轴方向飞行的总代价 Calculate the total cost of the drone flying along the x-axis
以δ=[0,d]为参数计算沿y轴正方向移动的代价值JypCalculate the cost J yp of moving along the positive direction of the y-axis using δ = [0, d] as a parameter;
以δ=[0,-d]为参数计算沿y轴负方向移动的代价值JynCalculate the cost J yn of moving along the negative direction of the y-axis using δ = [0, -d] as a parameter;
计算无人机沿y轴方向飞行的总代价 Calculate the total cost of the drone flying along the y-axis
计算总代价值:J=[Jx,Jy];Calculate the total cost: J = [J x ,J y ];
计算k时刻经过优化后的方向,表示为:
Calculate the optimized direction at time k, expressed as:
其中,||.||2为L2正则,v表示无人机恒定的飞行速度。Among them, ||.|| 2 is the L2 regularization, and v represents the constant flight speed of the drone.
在完成路径规划后,由于越靠近标志物或目标,获得的信息会更加准确,这个现象会导致无人机会向标志物或目标不断靠近,直至重合发生撞击。因此,在路径规划完成后,需要根据无人机与标志物或目标的位置设置禁飞区,以避免无人机飞行过近,发生危险。After completing the path planning, the closer the drone is to the marker or target, the more accurate the information it obtains. This phenomenon will cause the drone to keep getting closer to the marker or target until they overlap and collide. Therefore, after completing the path planning, it is necessary to set a no-fly zone based on the position of the drone and the marker or target to prevent the drone from flying too close and causing danger.
综上,本发明引入标志物(一个或多个)配合AOA传感器完成无人机自身坐标获取,并通过引入标志物(一个或多个)配合AOA传感器完成目标坐标获取。此外,通过随机扰动的方式完成代价函数梯度下降进而优化无人机飞行路径,获得针对移动目标的最佳观测。本发明仅需要例如PTZ相机量测无人机与锚点、目标之间的方位角,无需其它任何信息即可获得无人机与目标的绝对位置信息,至少采用两个已知绝对位置的锚点信息即可完成无人机和目标的位置跟踪。经验证,本发明提升了基于AOA实现目标跟踪的效率及精度。In summary, the present invention introduces markers (one or more) to cooperate with the AOA sensor to complete the acquisition of the drone's own coordinates, and introduces markers (one or more) to cooperate with the AOA sensor to complete the acquisition of the target coordinates. In addition, the gradient descent of the cost function is completed by random perturbation to optimize the flight path of the drone and obtain the best observation of the moving target. The present invention only requires, for example, a PTZ camera to measure the azimuth between the drone and the anchor point and the target. The absolute position information of the drone and the target can be obtained without any other information. At least two anchor point information with known absolute positions can be used to complete the position tracking of the drone and the target. It has been verified that the present invention improves the efficiency and accuracy of target tracking based on AOA.
本发明可以是系统、方法和/或计算机程序产品。计算机程序产品可以包括计算机可读存储介质,其上载有用于使处理器实现本发明的各个方面的计算机可读程序指令。 The present invention may be a system, a method and/or a computer program product. The computer program product may include a computer-readable storage medium carrying computer-readable program instructions for causing a processor to implement various aspects of the present invention.
计算机可读存储介质可以是可以保持和存储由指令执行设备使用的指令的有形设备。计算机可读存储介质例如可以是但不限于电存储设备、磁存储设备、光存储设备、电磁存储设备、半导体存储设备或者上述的任意合适的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、静态随机存取存储器(SRAM)、便携式压缩盘只读存储器(CD-ROM)、数字多功能盘(DVD)、记忆棒、软盘、机械编码设备、例如其上存储有指令的打孔卡或凹槽内凸起结构、以及上述的任意合适的组合。这里所使用的计算机可读存储介质不被解释为瞬时信号本身,诸如无线电波或者其他自由传播的电磁波、通过波导或其他传输媒介传播的电磁波(例如,通过光纤电缆的光脉冲)、或者通过电线传输的电信号。Computer readable storage medium can be a tangible device that can hold and store instructions used by an instruction execution device. Computer readable storage medium can be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination thereof. More specific examples (non-exhaustive list) of computer readable storage medium include: a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a static random access memory (SRAM), a portable compact disk read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanical encoding device, for example, a punch card or a convex structure in a groove on which instructions are stored, and any suitable combination thereof. The computer readable storage medium used here is not interpreted as a transient signal itself, such as a radio wave or other freely propagating electromagnetic wave, an electromagnetic wave propagated by a waveguide or other transmission medium (for example, a light pulse by an optical fiber cable), or an electrical signal transmitted by a wire.
这里所描述的计算机可读程序指令可以从计算机可读存储介质下载到各个计算/处理设备,或者通过网络、例如因特网、局域网、广域网和/或无线网下载到外部计算机或外部存储设备。网络可以包括铜传输电缆、光纤传输、无线传输、路由器、防火墙、交换机、网关计算机和/或边缘服务器。每个计算/处理设备中的网络适配卡或者网络接口从网络接收计算机可读程序指令,并转发该计算机可读程序指令,以供存储在各个计算/处理设备中的计算机可读存储介质中。The computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to each computing/processing device, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network. The network can include copper transmission cables, optical fiber transmissions, wireless transmissions, routers, firewalls, switches, gateway computers, and/or edge servers. The network adapter card or network interface in each computing/processing device receives the computer-readable program instructions from the network and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device.
用于执行本发明操作的计算机程序指令可以是汇编指令、指令集架构(ISA)指令、机器指令、机器相关指令、微代码、固件指令、状态设置数据、或者以一种或多种编程语言的任意组合编写的源代码或目标代码,所述编程语言包括面向对象的编程语言—诸如Smalltalk、C++、Python等,以及常规的过程式编程语言—诸如“C”语言或类似的编程语言。计算机可读程序指令可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络—包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来 通过因特网连接)。在一些实施例中,通过利用计算机可读程序指令的状态信息来个性化定制电子电路,例如可编程逻辑电路、现场可编程门阵列(FPGA)或可编程逻辑阵列(PLA),该电子电路可以执行计算机可读程序指令,从而实现本发明的各个方面。The computer program instructions for performing the operations of the present invention may be assembler instructions, instruction set architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state setting data, or source code or object code written in any combination of one or more programming languages, including object-oriented programming languages such as Smalltalk, C++, Python, etc., and conventional procedural programming languages such as "C" language or similar programming languages. The computer readable program instructions may be executed entirely on the user's computer, partially on the user's computer, as a separate software package, partially on the user's computer and partially on a remote computer, or entirely on a remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (e.g., using an Internet service provider to access the user's computer). In some embodiments, the electronic circuit, such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can execute computer-readable program instructions by utilizing the state information of the computer-readable program instructions to implement various aspects of the present invention.
这里参照根据本发明实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本发明的各个方面。应当理解,流程图和/或框图的每个方框以及流程图和/或框图中各方框的组合,都可以由计算机可读程序指令实现。Various aspects of the present invention are described herein with reference to the flow charts and/or block diagrams of the methods, devices (systems) and computer program products according to embodiments of the present invention. It should be understood that each box of the flow chart and/or block diagram and the combination of each box in the flow chart and/or block diagram can be implemented by computer-readable program instructions.
这些计算机可读程序指令可以提供给通用计算机、专用计算机或其它可编程数据处理装置的处理器,从而生产出一种机器,使得这些指令在通过计算机或其它可编程数据处理装置的处理器执行时,产生了实现流程图和/或框图中的一个或多个方框中规定的功能/动作的装置。也可以把这些计算机可读程序指令存储在计算机可读存储介质中,这些指令使得计算机、可编程数据处理装置和/或其他设备以特定方式工作,从而,存储有指令的计算机可读介质则包括一个制造品,其包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的各个方面的指令。These computer-readable program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine, so that when these instructions are executed by the processor of the computer or other programmable data processing device, a device that implements the functions/actions specified in one or more boxes in the flowchart and/or block diagram is generated. These computer-readable program instructions can also be stored in a computer-readable storage medium, and these instructions cause the computer, programmable data processing device, and/or other equipment to work in a specific manner, so that the computer-readable medium storing the instructions includes a manufactured product, which includes instructions for implementing various aspects of the functions/actions specified in one or more boxes in the flowchart and/or block diagram.
也可以把计算机可读程序指令加载到计算机、其它可编程数据处理装置、或其它设备上,使得在计算机、其它可编程数据处理装置或其它设备上执行一系列操作步骤,以产生计算机实现的过程,从而使得在计算机、其它可编程数据处理装置、或其它设备上执行的指令实现流程图和/或框图中的一个或多个方框中规定的功能/动作。Computer-readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device so that a series of operating steps are performed on the computer, other programmable data processing apparatus, or other device to produce a computer-implemented process, thereby causing the instructions executed on the computer, other programmable data processing apparatus, or other device to implement the functions/actions specified in one or more boxes in the flowchart and/or block diagram.
附图中的流程图和框图显示了根据本发明的多个实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或指令的一部分,所述模块、程序段或指令的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可 以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。对于本领域技术人员来说公知的是,通过硬件方式实现、通过软件方式实现以及通过软件和硬件结合的方式实现都是等价的。The flowcharts and block diagrams in the accompanying drawings show possible architectures, functions, and operations of systems, methods, and computer program products according to multiple embodiments of the present invention. In this regard, each box in the flowchart or block diagram may represent a module, a program segment, or a portion of an instruction, and the module, program segment, or a portion of an instruction contains one or more executable instructions for implementing a specified logical function. In some alternative implementations, the functions marked in the boxes may also occur in an order different from that marked in the accompanying drawings. For example, two consecutive boxes may actually be executed substantially in parallel, and they may sometimes be executed in the opposite order, depending on the functions involved. It should also be noted that each box in the block diagram and/or flowchart, and combinations of boxes in the block diagram and/or flowchart, may be It can be implemented by a dedicated hardware-based system that performs the specified functions or actions, or it can be implemented by a combination of dedicated hardware and computer instructions. It is well known to those skilled in the art that implementation by hardware, implementation by software, and implementation by a combination of software and hardware are all equivalent.
以上已经描述了本发明的各实施例,上述说明是示例性的,并非穷尽性的,并且也不限于所披露的各实施例。在不偏离所说明的各实施例的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。本文中所用术语的选择,旨在最好地解释各实施例的原理、实际应用或对市场中的技术改进,或者使本技术领域的其它普通技术人员能理解本文披露的各实施例。本发明的范围由所附权利要求来限定。 Embodiments of the present invention have been described above, and the above description is exemplary, not exhaustive, and is not limited to the disclosed embodiments. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The selection of terms used herein is intended to best explain the principles of the embodiments, practical applications, or technical improvements in the marketplace, or to enable other persons of ordinary skill in the art to understand the embodiments disclosed herein. The scope of the present invention is defined by the appended claims.

Claims (10)

  1. 一种用于信息拒止环境下的无人机动态目标估计方法,包括以下步骤:A method for estimating a dynamic target of an unmanned aerial vehicle in an information denial environment comprises the following steps:
    获取三维角度信息,所述三维角度信息包括无人机到第一锚点的到达角、无人机到第二锚点的到达角以及无人机到目标的到达角;Acquire three-dimensional angle information, where the three-dimensional angle information includes an arrival angle from the drone to the first anchor point, an arrival angle from the drone to the second anchor point, and an arrival angle from the drone to the target;
    对于所述三维角度信息,利于拓展卡尔曼滤波器进行滤波,获得估计的无人机状态信息以及对应的状态协方差信息,所述无人机状态信息用于表征无人机的位置、速度以及目标的位置和速度;The three-dimensional angle information is filtered by an extended Kalman filter to obtain estimated drone state information and corresponding state covariance information, wherein the drone state information is used to characterize the position and speed of the drone and the position and speed of the target;
    利用所述无人机状态信息和所述状态协方差信息构建损失函数,以优化无人机的飞行路径,实现对目标的观测。The UAV state information and the state covariance information are used to construct a loss function to optimize the flight path of the UAV and achieve observation of the target.
  2. 根据权利要求1所述的方法,其特征在于,对于所述三维角度信息,利于拓展卡尔曼滤波器进行滤波,获得估计的无人机状态信息以及对应的状态协方差信息包括:The method according to claim 1 is characterized in that, for the three-dimensional angle information, filtering is performed using an extended Kalman filter to obtain estimated drone state information and corresponding state covariance information, including:
    预测无人机的先验状态和协方差,表示为:
    Xk|k-1=FXk-1|k-1+mk
    Pk|k-1=FXk-1|k-1FT+Qk
    Predict the drone’s prior state and covariance, expressed as:
    X k|k-1 =FX k-1|k-1 +m k
    P k|k-1 =FX k-1|k-1 F T +Q k
    其中,Xk|k-1为基于k-1时刻状态Xk-1|k-1的先验估计,Pk|k-1为状态Xk|k-1的协方差矩阵,mk为过程噪声,Qk是mk的协方差矩阵, xuk是无人机在x轴的位置,是无人机在x轴的速度分量,yuk是无人机在y轴的位置,是无人机在y轴的速度分量,xpk是目标在x轴的位置,是目标在x轴的速度分量,ypk是目标在y轴的位置,是目标在y轴的速度分量,F是状态转移矩阵;Among them, X k|k-1 is the prior estimate based on the state X k-1|k-1 at time k-1, P k|k-1 is the covariance matrix of the state X k|k-1 , m k is the process noise, Q k is the covariance matrix of m k , x uk is the position of the drone on the x-axis, is the velocity component of the drone on the x-axis, yuk is the position of the drone on the y-axis, is the velocity component of the drone on the y-axis, x pk is the position of the target on the x-axis, is the velocity component of the target on the x-axis, y pk is the position of the target on the y-axis, is the velocity component of the target on the y-axis, and F is the state transfer matrix;
    计算雅各比矩阵,表示为:
    Hk=Jacobian(Xk|k-1,Zk)
    Compute the Jacobian matrix, expressed as:
    H k = Jacobian(X k|k-1 ,Z k )
    其中,Zk为k时刻观测到的三维角度信息,Hk是3行8列的雅各比矩阵 Among them, Zk is the three-dimensional angle information observed at time k, and Hk is the Jacobian matrix with 3 rows and 8 columns
    计算估计状态残差,表示为:
    Calculate the estimated state residual, expressed as:
    其中,为残差,h(Xk|k-1)表示为:
    in, is the residual, h(X k|k-1 ) is expressed as:
    计算卡尔曼增益,表示为:

    Calculate the Kalman gain, expressed as:

    其中,Rk为测量噪声,Sk是中间变量,Kk为k时刻的卡尔曼增益;Where R k is the measurement noise, S k is the intermediate variable, and K k is the Kalman gain at time k;
    更新后验估计和协方差,作为估计的无人机状态信息以及对应的状态协方差信息,表示为:

    Pk|k=(I-KkHk)Pk|k-1
    Update the posterior estimate and covariance as the estimated drone state information and the corresponding state covariance information, expressed as:

    P k|k =(IK k H k )P k|k-1
    其中,Xk|k为后验估计状态,Pk|k为后验协方差矩阵,I为8*8的单位矩阵,xb1是第一锚点的x轴坐标,yb1是第一锚点的y轴坐标,xb2是第二锚点的x轴坐标,yb2是第二锚点的y轴坐标,xpk是目标在k时刻的x轴坐标,ypk是目标在k时刻的y轴坐标,dub1=||uk-b1||2,dub2=||uk-b2||2,dup=||uk-pk||2,b1=[xb1,yb1]T,b2=[xb2,yb2]T,pk=[xpk,ypk]T,uk=[xuk,yuk]T,nk是传感器的量测噪声。Wherein, Xk |k is the posterior estimated state, Pk |k is the posterior covariance matrix, I is the 8*8 identity matrix, xb1 is the x-axis coordinate of the first anchor point, yb1 is the y-axis coordinate of the first anchor point, xb2 is the x-axis coordinate of the second anchor point, yb2 is the y-axis coordinate of the second anchor point, xpk is the x-axis coordinate of the target at time k, ypk is the y-axis coordinate of the target at time k, dub1 =|| uk - b1 || 2 , dub2 = || uk-b2|| 2 , dup =|| uk- pk || 2 , b1 =[ xb1 , yb1 ] T , b2 =[ xb2 , yb2 ] T , pk =[xpk, ypk ] T , uk =[ xuk , yuk ] T , and nk is the measurement noise of the sensor.
  3. 根据权利要求2所述的方法,其特征在于,所述损失函数根据以下步骤构建:The method according to claim 2, characterized in that the loss function is constructed according to the following steps:
    计算状态Xk+1|k经过小位移δ之后的状态,表示为:
    Xk+1|k,δ=Xk+1|k+δ;
    The state after calculating the state X k+1|k with a small displacement δ is expressed as:
    X k+1|k,δ =X k+1|k +δ;
    根据Xk+1|k,δ计算Hk+1Calculate H k+1 according to X k+1|k,δ ;
    计算 calculate
    计算 calculate
    计算Pk+1|k+1=(I-Kk+1Hk+1)Pk+1|kCalculate P k+1|k+1 =(IK k+1 H k+1 )P k+1|k ;
    计算J(X`k+1|k+1)=tr(Pk+1|k+1);Calculate J(X` k+1|k+1 )=tr(P k+1|k+1 );
    其中,tr(·)为取矩阵迹,J(·)为经过小位移之后的代价值。 Among them, tr(·) is the matrix trace, and J(·) is the cost value after a small shift.
  4. 根据权利要求3所述的方法,其特征在于,利于所述无人机状态信息和所述状态协方差信息构建损失函数,以优化无人机的飞行路径包括:The method according to claim 3, characterized in that constructing a loss function based on the drone state information and the state covariance information to optimize the flight path of the drone comprises:
    以δ=[d,0]为参数计算无人机沿x轴正方向移动的代价值JxpCalculate the cost J xp of the drone moving along the positive direction of the x-axis using δ = [d, 0] as a parameter;
    以δ=[-d,0]为参数计算无人机沿x轴负方向移动的代价值JxnCalculate the cost J xn of the drone moving along the negative x-axis using δ = [-d, 0] as a parameter;
    计算无人机沿x轴方向的总代价 Calculate the total cost of the drone along the x-axis
    以δ=[0,d]为参数计算无人机沿y轴正方向移动的代价值JypCalculate the cost J yp of the drone moving along the positive direction of the y axis using δ = [0, d] as a parameter;
    以δ=[0,-d]为参数计算无人机沿y轴负方向移动的代价值JynCalculate the cost J yn of the drone moving along the negative direction of the y axis using δ = [0, -d] as a parameter;
    计算无人机沿y轴方向的总代价 Calculate the total cost of the drone along the y-axis
    计算总代价值J=[Jx,Jy];Calculate the total cost J = [J x , J y ];
    利用计算的总代价值获得优化的k时刻的方向,表示为:
    The calculated total cost value is used to obtain the direction of the optimized k-th moment, expressed as:
    其中,||.||2为L2正则,d表示步长。Among them, ||.|| 2 is L2 regularization, and d represents the step size.
  5. 根据权利要求1所述的方法,其特征在于,所述三维角度信息利用到达角传感器测量获得。The method according to claim 1 is characterized in that the three-dimensional angle information is obtained by measuring an angle of arrival sensor.
  6. 根据权利要求1所述的方法,其特征在于,还包括:在优化无人机的飞行路径过程中,在目标及地标周围设置禁飞区。The method according to claim 1 is characterized in that it also includes: setting a no-fly zone around the target and landmark during the process of optimizing the flight path of the drone.
  7. 根据权利要求1所述的方法,其特征在于,所述无人机是无人飞行器,包括多旋翼无人机或固定翼无人机。The method according to claim 1 is characterized in that the drone is an unmanned aerial vehicle, including a multi-rotor drone or a fixed-wing drone.
  8. 根据权利要求1所述的方法,其特征在于,还包括:根据所述无人机到目标的到达角来确定目标的绝对地理坐标。The method according to claim 1 is characterized in that it also includes: determining the absolute geographic coordinates of the target based on the arrival angle of the drone to the target.
  9. 一种计算机可读存储介质,其上存储有计算机程序,其中,该计算机程序被处理器执行时实现根据权利要求1至8中任一项所述方法的步骤。A computer-readable storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, implements the steps of the method according to any one of claims 1 to 8.
  10. 一种计算机设备,包括存储器和处理器,在所述存储器上存储有能够在处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至8中任一项所述的方法的步骤。 A computer device comprises a memory and a processor, wherein a computer program that can be run on the processor is stored in the memory, and wherein the processor implements the steps of any one of the methods of claims 1 to 8 when executing the computer program.
PCT/CN2023/133085 2022-12-05 2023-11-21 Method for estimating dynamic target of unmanned aerial vehicle in information rejection environment WO2024120187A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211548332.4A CN115790603A (en) 2022-12-05 2022-12-05 Unmanned aerial vehicle dynamic target estimation method used in information rejection environment
CN202211548332.4 2022-12-05

Publications (1)

Publication Number Publication Date
WO2024120187A1 true WO2024120187A1 (en) 2024-06-13

Family

ID=85445590

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/133085 WO2024120187A1 (en) 2022-12-05 2023-11-21 Method for estimating dynamic target of unmanned aerial vehicle in information rejection environment

Country Status (2)

Country Link
CN (1) CN115790603A (en)
WO (1) WO2024120187A1 (en)

Also Published As

Publication number Publication date
CN115790603A (en) 2023-03-14

Similar Documents

Publication Publication Date Title
CN110554376A (en) Radar range finding method for vehicles
US20170229023A1 (en) Communication link accessibility aware navigation
CN110889808A (en) Positioning method, device, equipment and storage medium
US11353574B2 (en) System and method for tracking motion of target in indoor environment
CN114111774B (en) Vehicle positioning method, system, equipment and computer readable storage medium
JP7034353B2 (en) Positioning system and positioning method
CN113960622A (en) Real-time positioning method and device fusing laser radar and IMU sensor information
Masmitja et al. Range-only single-beacon tracking of underwater targets from an autonomous vehicle: From theory to practice
Warren et al. Towards visual teach and repeat for GPS-denied flight of a fixed-wing UAV
JP2017524932A (en) Video-assisted landing guidance system and method
Chen et al. An optimal selection of sensors in multi-sensor fusion navigation with factor graph
Ivancsits et al. Visual navigation system for small unmanned aerial vehicles
Wen et al. GNSS/LiDAR integration aided by self-adaptive Gaussian mixture models in urban scenarios: An approach robust to non-Gaussian noise
Lutz et al. Visual-inertial SLAM aided estimation of anchor poses and sensor error model parameters of UWB radio modules
Santos et al. Applying deep neural networks to improve uav navigation in satellite-less environments
Sun et al. RTS assisted Kalman filtering for robot localization using UWB measurement
JP2023549684A (en) Method and system for magnetic-based cooperative positioning
US11536797B2 (en) Mobile network localization
WO2022045982A1 (en) Unmanned aerial vehicle and localization method for unmanned aerial vehicle
WO2024120187A1 (en) Method for estimating dynamic target of unmanned aerial vehicle in information rejection environment
CN117320148A (en) Multi-source data fusion positioning method, system, electronic equipment and storage medium
Zhang et al. Adaptive cubature Kalman filter based on the variance-covariance components estimation
Shen et al. Scalable cooperative localization with minimal sensor configuration
KR101862065B1 (en) Vision-based wind estimation apparatus and method using flight vehicle
Opromolla et al. PCA‐Based Line Detection from Range Data for Mapping and Localization‐Aiding of UAVs