WO2022194158A1 - Target tracking method and apparatus, device, and medium - Google Patents

Target tracking method and apparatus, device, and medium Download PDF

Info

Publication number
WO2022194158A1
WO2022194158A1 PCT/CN2022/080985 CN2022080985W WO2022194158A1 WO 2022194158 A1 WO2022194158 A1 WO 2022194158A1 CN 2022080985 W CN2022080985 W CN 2022080985W WO 2022194158 A1 WO2022194158 A1 WO 2022194158A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
position information
video frame
shooting position
coordinate system
Prior art date
Application number
PCT/CN2022/080985
Other languages
French (fr)
Chinese (zh)
Inventor
郭亨凯
杜思聪
Original Assignee
北京字跳网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字跳网络技术有限公司 filed Critical 北京字跳网络技术有限公司
Publication of WO2022194158A1 publication Critical patent/WO2022194158A1/en
Priority to US18/468,647 priority Critical patent/US20240005552A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • G06T2207/20164Salient point detection; Corner detection

Definitions

  • Embodiments of the present disclosure also provide a target tracking device, the device comprising:
  • the embodiment of the present disclosure also provides a computer program, and the computer program is used to execute the target tracking method provided by the embodiment of the present disclosure.
  • FIG. 3 is a schematic diagram of another normal vector projection provided by an embodiment of the present disclosure.
  • the shooting position module 303 is used for:
  • the initial shooting position information is transformed from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position information.
  • the units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner. Among them, the name of the unit does not constitute a limitation of the unit itself under certain circumstances.
  • determining the target shooting position information according to the initial shooting position information and the target transformation relationship includes:
  • the present disclosure provides a target tracking device, including:

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Studio Devices (AREA)
  • Image Analysis (AREA)

Abstract

Embodiments of the present disclosure relate to a target tracking method and apparatus, a device, and a medium. The method comprises: obtaining a target video frame; determining the position information of a target region in the target video frame by performing feature detection on the target video frame; if it is failed to determine the position information of the target region in the target video frame, determining target photographing position information; and according to the target photographing position information and a camera projection algorithm, determining the position information of the target region in the target video frame again.

Description

一种目标跟踪方法、装置、设备及介质A target tracking method, device, equipment and medium
本申请要求于2021年03月15日提交中国专利局、申请号为202110276360.4、名称为“一种目标跟踪方法、装置、设备及介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202110276360.4 and titled "A Target Tracking Method, Apparatus, Equipment and Medium" filed with the China Patent Office on March 15, 2021, the entire contents of which are incorporated by reference in in this application.
技术领域technical field
本公开涉及视频处理技术领域,尤其涉及一种目标跟踪方法、装置、设备及介质。The present disclosure relates to the technical field of video processing, and in particular, to a target tracking method, apparatus, device and medium.
背景技术Background technique
随着智能终端技术的不断发展,对视频内容识别和跟踪等需求日益增加。With the continuous development of intelligent terminal technology, the demand for video content identification and tracking is increasing.
目前,通过边缘特征可以对视频中各视频帧进行目标跟踪,但是上述方式在相机运动较快时会因检测不到目标而导致跟踪失败,鲁棒性较差。At present, each video frame in the video can be tracked through edge features, but the above method will fail to detect the target when the camera moves quickly, resulting in poor robustness.
发明内容SUMMARY OF THE INVENTION
为了解决上述技术问题或者至少部分地解决上述技术问题,本公开提供了一种目标跟踪方法、装置、电子设备、存储介质、计算机程序产品及计算机程序。In order to solve the above technical problems or at least partially solve the above technical problems, the present disclosure provides a target tracking method, apparatus, electronic device, storage medium, computer program product and computer program.
本公开实施例提供了一种目标跟踪方法,所述方法包括:The embodiment of the present disclosure provides a target tracking method, the method includes:
获取目标视频帧;Get the target video frame;
通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;Determine the position information of the target area in the target video frame by detecting the feature of the target video frame;
如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;If the determination of the position information of the target area in the target video frame fails, determining the target shooting position information;
根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。According to the target shooting position information and the camera projection algorithm, the position information of the target area in the target video frame is determined again.
本公开实施例还提供了一种目标跟踪装置,所述装置包括:Embodiments of the present disclosure also provide a target tracking device, the device comprising:
视频帧模块,用于获取目标视频帧;The video frame module is used to obtain the target video frame;
第一位置模块,用于通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;a first position module, for determining the position information of the target area in the target video frame by detecting the feature of the target video frame;
拍摄位置模块,用于如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;a shooting position module, configured to determine target shooting position information if the position information of the target area in the target video frame fails to be determined;
第二位置模块,用于根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。The second position module is configured to re-determine the position information of the target area in the target video frame according to the target shooting position information and the camera projection algorithm.
本公开实施例还提供了一种电子设备,所述电子设备包括:处理器;用于存储所述处理器可执行指令的存储器;所述处理器,用于从所述存储器中读取所述可执行指令,并执行所述指令以实现如本公开实施例提供的目标跟踪方法。An embodiment of the present disclosure further provides an electronic device, the electronic device includes: a processor; a memory for storing instructions executable by the processor; the processor for reading the memory from the memory The instructions can be executed, and the instructions can be executed to implement the target tracking method provided by the embodiments of the present disclosure.
本公开实施例还提供了一种计算机可读存储介质,所述存储介质存储有计算机程序,所述计算机程序用于执行如本公开实施例提供的目标跟踪方法。An embodiment of the present disclosure further provides a computer-readable storage medium, where the storage medium stores a computer program, and the computer program is used to execute the target tracking method provided by the embodiment of the present disclosure.
本公开实施例还提供了一种计算机程序产品,包括计算机程序,所述计算机程序用于执行如本公开实施例提供的目标跟踪方法。Embodiments of the present disclosure also provide a computer program product, including a computer program, and the computer program is used to execute the target tracking method provided by the embodiments of the present disclosure.
本公开实施例还提供了一种计算机程序,所述计算机程序用于执行如本公开实施例提供的目标跟踪方法。The embodiment of the present disclosure also provides a computer program, and the computer program is used to execute the target tracking method provided by the embodiment of the present disclosure.
本公开实施例提供的技术方案与现有技术相比具有如下优点:本公开实施例提供的目标跟踪方案,获取目标视频帧;通过对目标视频帧的特征检测确定目标区域在目标视频帧中的位置信息;如果目标区域在目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息。采用上述技术方案,在对视频帧的目标区域跟踪失败之后,可以根据同步定位与建图算法确定的拍摄位置来重新确定该视频帧中的目标区域的位置,实现跟踪恢复,在相机运动较快时也能实现目标跟踪,提升了跟踪鲁棒性。Compared with the prior art, the technical solution provided by the embodiment of the present disclosure has the following advantages: the target tracking solution provided by the embodiment of the present disclosure obtains the target video frame; the feature detection of the target video frame determines the position of the target area in the target video frame. Position information; if the determination of the position information of the target area in the target video frame fails, determine the target shooting position information; according to the target shooting position information and the camera projection algorithm, determine the position information of the target area in the target video frame again. With the above technical solution, after the tracking of the target area of the video frame fails, the position of the target area in the video frame can be re-determined according to the shooting position determined by the synchronous positioning and mapping algorithm, so as to realize the tracking recovery, and the camera can move faster. It can also achieve target tracking and improve the tracking robustness.
附图说明Description of drawings
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。The above and other features, advantages and aspects of various embodiments of the present disclosure will become more apparent when taken in conjunction with the accompanying drawings and with reference to the following detailed description. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It should be understood that the drawings are schematic and that the originals and elements are not necessarily drawn to scale.
图1为本公开实施例提供的一种目标跟踪方法的流程示意图;FIG. 1 is a schematic flowchart of a target tracking method according to an embodiment of the present disclosure;
图2为本公开实施例提供的一种法向量投影的示意图;2 is a schematic diagram of a normal vector projection provided by an embodiment of the present disclosure;
图3为本公开实施例提供的另一种法向量投影的示意图;3 is a schematic diagram of another normal vector projection provided by an embodiment of the present disclosure;
图4为本公开实施例提供的另一种目标跟踪方法的流程示意图;4 is a schematic flowchart of another target tracking method provided by an embodiment of the present disclosure;
图5为本公开实施例提供的再一种目标跟踪方法的流程示意图;FIG. 5 is a schematic flowchart of still another target tracking method provided by an embodiment of the present disclosure;
图6为本公开实施例提供的一种目标跟踪装置的结构示意图;6 is a schematic structural diagram of a target tracking device according to an embodiment of the present disclosure;
图7为本公开实施例提供的一种电子设备的结构示意图。FIG. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, but rather are provided for the purpose of A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes, and are not intended to limit the protection scope of the present disclosure.
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。As used herein, the term "including" and variations thereof are open-ended inclusions, ie, "including but not limited to". The term "based on" is "based at least in part on." The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions of other terms will be given in the description below.
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as "first" and "second" mentioned in the present disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order of functions performed by these devices, modules or units or interdependence.
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "a" and "a plurality" mentioned in the present disclosure are illustrative rather than restrictive, and those skilled in the art should understand that unless the context clearly indicates otherwise, they should be understood as "one or a plurality of". multiple".
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present disclosure are only for illustrative purposes, and are not intended to limit the scope of these messages or information.
图1为本公开实施例提供的一种目标跟踪方法的流程示意图,该方法可以由目标跟踪装置执行,其中该装置可以采用软件和/或硬件实现,一般可集成在电子设备中。如图1所示,该方法包括:FIG. 1 is a schematic flowchart of a target tracking method according to an embodiment of the present disclosure. The method may be executed by a target tracking apparatus, where the apparatus may be implemented by software and/or hardware, and may generally be integrated in an electronic device. As shown in Figure 1, the method includes:
步骤101、获取目标视频帧。Step 101: Acquire a target video frame.
视频帧也称图像帧,可以为组成视频的最小单位,也可以为单独的一个图像。其中,目标视频帧可以为任意一个需要进行检测和跟踪的图像帧,可以为采用具有视频采集功能的设备拍摄得到的视频或从互联网获取得到的视频中的一个视频帧,也可以为实时拍摄的一个图像,具体不限。A video frame, also called an image frame, can be the smallest unit that composes a video, or it can be a single image. The target video frame can be any image frame that needs to be detected and tracked, it can be a video frame captured by a device with a video capture function or a video frame obtained from the Internet, or it can be a video frame captured in real time. An image, no specific limit.
步骤102、通过对目标视频帧的特征检测确定目标区域在目标视频帧中的位置信息。Step 102: Determine the position information of the target area in the target video frame by detecting the feature of the target video frame.
其中,目标区域是指目标视频帧中目标形状物体所在的区域,目标形状具体不限,例如目标形状可以包括椭圆形、圆形和矩形等,本公开实施例以目标区域为目标视频帧中椭圆形区域为例进行说明。目标区域的位置信息可以为能够表征目标区域在视频帧中位置的信息,具体可以包括目标区域在视频帧中的顶点坐标、中心点坐标等信息。The target area refers to the area in the target video frame where the object with the target shape is located, and the target shape is not limited. For example, the target shape may include an ellipse, a circle, and a rectangle. shape area as an example. The location information of the target area may be information that can represent the location of the target area in the video frame, and may specifically include information such as vertex coordinates and center point coordinates of the target area in the video frame.
本公开实施例中,获取到目标视频帧之后,可以采用预设检测算法确定目标视频帧中目标区域的位置信息。上述预设检测算法可以为基于深度学习的检测算法或轮廓检测算法等,具体可以根据实际情况确定。例如目标区域为椭圆形区域时,预设检测算法可以为任意一种椭圆检测算法,确定视频帧中目标区域的位置信息的过程可以包括:采用椭圆检测算法对第一视频帧进行轮廓检测,然后对轮廓检测得到的椭圆轮廓进行拟合,得到椭圆形区域视频帧中的位置信息。In the embodiment of the present disclosure, after the target video frame is acquired, a preset detection algorithm may be used to determine the position information of the target area in the target video frame. The above-mentioned preset detection algorithm may be a deep learning-based detection algorithm or a contour detection algorithm, etc., which may be specifically determined according to the actual situation. For example, when the target area is an oval area, the preset detection algorithm may be any ellipse detection algorithm, and the process of determining the position information of the target area in the video frame may include: using the ellipse detection algorithm to perform contour detection on the first video frame, and then The ellipse contour obtained by contour detection is fitted to obtain the position information in the video frame of the ellipse region.
可选的,当目标视频帧为目标视频中的一个视频帧,确定目标视频帧中目标区域的位置信息,还可以包括:提取目标视频中的第一图像,并确定目标区域在第一图像中的第一位置信息;根据第一位置信息确定的初始特征点对目标视频帧进行光流跟踪,得到目标特征点;其中,目标视频帧为目标视频中第一图像的相邻视频帧;对目标特征点进行拟合,得到目标区域在目标视频帧中的位置信息。Optionally, when the target video frame is a video frame in the target video, determining the position information of the target area in the target video frame may also include: extracting the first image in the target video, and determining that the target area is in the first image. The first position information of the target video frame is performed according to the initial feature point determined by the first position information, and the target video frame is tracked by optical flow to obtain the target feature point; wherein, the target video frame is the adjacent video frame of the first image in the target video; The feature points are fitted to obtain the position information of the target area in the target video frame.
目标视频帧可以为目标视频中的任意一个图像帧,第一图像可以为目标视频中与目标视频帧相邻的按照时间顺序的上一个图像帧。本公开实施例中,可以采用上述预设检测算法对第一图像进行目标区域的检测,确定目标区域在第一图像中的第一位置信息。根据第一位置信息确定初始特征点,包括:根据第一位置信息对第一图像中的目标区域的边缘轮廓进行采样,确定初始特征点。可选的,根据第一位置信息对第一图像中的目标区域的边缘轮廓进行采样,确定初始特征点,包括:当目标区域为椭圆形区域,根据第一位置信息将目标区域在极坐标下进行表示得到椭圆轮廓;其中,第一位置信息包括目标区域在第一图像中的顶点坐标和/或中心点坐标;按照预设极角间隔在椭圆轮廓中进行采样,得到初始特征点。之后,对上述采样得到的初始特征点采用光流跟踪算法进行跟踪,保留跟踪成功的特征点为目标特征点,剔除跟踪失败的特征点。对目标特征点进行拟合,得到目标区域在目标视频帧中的位置信息。The target video frame may be any image frame in the target video, and the first image may be the last image frame in the target video adjacent to the target video frame in time sequence. In the embodiment of the present disclosure, the above-mentioned preset detection algorithm may be used to detect the target area on the first image, and determine the first position information of the target area in the first image. Determining the initial feature point according to the first position information includes: sampling the edge contour of the target area in the first image according to the first position information to determine the initial feature point. Optionally, sampling the edge contour of the target area in the first image according to the first position information, and determining the initial feature points, including: when the target area is an elliptical area, according to the first position information, the target area is in polar coordinates. Perform representation to obtain an ellipse outline; wherein, the first position information includes vertex coordinates and/or center point coordinates of the target area in the first image; sampling is performed in the ellipse outline according to preset polar angle intervals to obtain initial feature points. After that, the optical flow tracking algorithm is used to track the initial feature points obtained by the above sampling, and the feature points that are successfully tracked are reserved as the target feature points, and the feature points that fail to be tracked are eliminated. Fit the target feature points to obtain the position information of the target area in the target video frame.
在一些实施例中,对目标特征点进行拟合,得到目标区域在目标视频帧中的位置信息,包括:如果目标特征点在目标区域的边缘轮廓上的覆盖范围大于或等于预设范围,则对目标特征点进行拟合,得到目标区域在目标视频帧中的位置信息。其中,预设范围是指预先设置的满足目标区域形状的范围,具体可以根据实际情况设置,例如预设范围可以为边缘轮廓的整个范围的3/4。具体的,确定目标特征点之后,可以判断目标点在目标区域的边缘轮廓上的覆盖范围是否大于或等于预设范围,若是,则采用拟合算法对目标特征点进行拟合,得到目标区域在在目标视频帧中的位置信息。如果目标特征点在目标区域的边缘轮廓上的覆盖范围小于预设范围,则可以直接采用预设检测算法对目标视频帧进行检测,确定目标区域在目标视频帧中的位置信息。In some embodiments, fitting the target feature points to obtain position information of the target region in the target video frame, including: if the coverage of the target feature points on the edge contour of the target region is greater than or equal to a preset range, then Fit the target feature points to obtain the position information of the target area in the target video frame. The preset range refers to a preset range that satisfies the shape of the target area, which may be set according to actual conditions. For example, the preset range may be 3/4 of the entire range of the edge contour. Specifically, after determining the target feature points, it can be determined whether the coverage of the target points on the edge contour of the target area is greater than or equal to the preset range, and if so, use a fitting algorithm to fit the target feature points to obtain the target area in Location information in the target video frame. If the coverage area of the target feature point on the edge contour of the target area is smaller than the preset area, a preset detection algorithm can be directly used to detect the target video frame to determine the position information of the target area in the target video frame.
在一些实施例中,在确定目标区域在第一图像中的第一位置信息之后,还包括:确定目标视频帧相对于第一图像的变化参数;根据第一位置信息确定的初始特征点对目标视频帧进行光流跟踪,得到目标特征点,包括:如果基于变化参数确定目标不满足复用条件,则执行根据第一位置信息确定的初始特征点对目标视频帧进行光流跟踪,得到目标特征点。In some embodiments, after determining the first position information of the target area in the first image, the method further includes: determining a change parameter of the target video frame relative to the first image; initial feature points determined according to the first position information Perform optical flow tracking on the video frame to obtain target feature points, including: if the target does not meet the multiplexing condition based on the change parameter, performing optical flow tracking on the target video frame based on the initial feature point determined according to the first position information to obtain the target feature point.
其中,变换参数是指表征目标视频帧相对于第一图像的变化情况的参数。可选的,确定目标视频帧相对于第一图像的变化参数,可以包括:提取第一图像中的第一特征点;根据第一特征点对目标视频帧进行光流跟踪,确定第二特征点,将第二特征点与第一特征点之间的移动距离确定为变化参数。第一特征点可以为采用加速段测试的特征(Features from Accelerated Segment Test,FAST)角点检测算法对第一图像检测得到的角点。复用条件是指目标视频帧对目标区域的位置确定能否复用第一图像的具体判断条件。其中,变化阈值是指预先设置的阈值,可以根据实际情况设置,例如通过为目标视频帧中特征点相对于第一视频中对应的特征点的移动信息表征变化参数时,变换阈值可以为距离阈值设置为0.8。The transformation parameter refers to a parameter representing the change of the target video frame relative to the first image. Optionally, determining a change parameter of the target video frame relative to the first image may include: extracting a first feature point in the first image; performing optical flow tracking on the target video frame according to the first feature point, and determining a second feature point , and the moving distance between the second feature point and the first feature point is determined as a change parameter. The first feature point may be a corner point detected on the first image by using a feature from Accelerated Segment Test (FAST) corner detection algorithm. The multiplexing condition refers to a specific judgment condition for determining whether the first image can be multiplexed by the target video frame to the position of the target area. The change threshold refers to a preset threshold, which can be set according to the actual situation. For example, when the change parameter is represented by the movement information of the feature points in the target video frame relative to the corresponding feature points in the first video, the transformation threshold can be the distance threshold. Set to 0.8.
具体的,确定目标视频帧相对于第一图像的变化参数之后,可以将变化参数与变化阈值进行比对,如果确定变化参数大于变化阈值,则可以确定目标视频帧不满足复用条件,需要重新跟踪,执行根据第一位置信息确定的初始特征点对目标视频帧进行光流跟踪,得到目标特征点;否则确定目标视频帧满足复用条件,则将第一位置信息确定为目标区域在目标视频帧中的位置信息。Specifically, after determining the change parameter of the target video frame relative to the first image, the change parameter can be compared with the change threshold. If the change parameter is determined to be greater than the change threshold, it can be determined that the target video frame does not meet the multiplexing conditions and needs to be re-used. Tracking, performing optical flow tracking on the target video frame based on the initial feature point determined according to the first position information, to obtain the target feature point; otherwise, it is determined that the target video frame meets the multiplexing condition, then the first position information is determined as the target area in the target video frame. position information in the frame.
在上述方案中,在对视频的一个图像帧的目标区域检测的基础上,通过特征点跟踪和拟合即可实现更加准确地确定目标视频帧中目标区域的位置,提升了对目标视频帧中目标区域位置确定的计算效率。并且,通过对相邻两个视频帧增加复用条件的判断,当视频中相邻两个视频帧的变化较大时,采用上述特征点跟踪和拟合实现目标区域的位置的确定;当视频中相邻两个视频帧的变化或差异较小时,则说明两个视频帧相似性较高,此时下一个视频帧可以直接复用上一个视频帧的目标区域的位置信息,不用重新进行检测,节省了工作量,提高了计算效率。In the above solution, on the basis of detecting the target area of an image frame of the video, the position of the target area in the target video frame can be more accurately determined by feature point tracking and fitting, which improves the accuracy of the target area in the target video frame. Computational efficiency of target area location determination. Moreover, by adding multiplexing conditions to two adjacent video frames, when the changes of the two adjacent video frames in the video are large, the above-mentioned feature point tracking and fitting are used to determine the position of the target area; When the change or difference between two adjacent video frames is small, the similarity between the two video frames is high. At this time, the next video frame can directly reuse the position information of the target area of the previous video frame without re-detection. The workload is saved and the computing efficiency is improved.
步骤103、如果目标区域在目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息。Step 103: If the determination of the position information of the target area in the target video frame fails, determine the target shooting position information.
本公开实施例中,在相机运动较快时可能在目标视频帧中未能成功确定目标区域的位置信息,则确定目标拍摄位置信息。目标拍摄位置信息是指拍摄位置在世界坐标系下的坐标。In the embodiment of the present disclosure, when the camera moves quickly, the location information of the target area may not be successfully determined in the target video frame, and then the target shooting location information is determined. The target shooting position information refers to the coordinates of the shooting position in the world coordinate system.
本公开实施例中,确定目标拍摄位置信息,可以包括:通过同步定位与建图算法确定第一坐标系下的初始拍摄位置信息;确定第一坐标系与相机投影算法对应的第二坐标系的目标变换关系;根据初始拍摄位置信息以及目标变换关系,确定目标拍摄位置信息。In the embodiment of the present disclosure, determining the target shooting position information may include: determining the initial shooting position information in the first coordinate system by synchronous positioning and mapping algorithm; determining the first coordinate system and the second coordinate system corresponding to the camera projection algorithm Target transformation relationship; according to the initial shooting position information and the target transformation relationship, determine the target shooting position information.
其中,同步定位与建图(Simultaneous Localization And Mapping,SLAM)算法,SLAM主要用于解决搭载特定传感器的设备在未知环境中运行时定位导航与地图构建的问题,如果传感器为相机时,可以确定相机的位置,也即拍摄位置。本公开实施例中通过SLAM算法来确定拍摄位置在第一坐标系下的初始拍摄位置信息。相机投影算法可以为相机的小孔投影模型的算法。Among them, Simultaneous Localization And Mapping (SLAM) algorithm, SLAM is mainly used to solve the problem of positioning, navigation and map construction when a device equipped with a specific sensor is running in an unknown environment. If the sensor is a camera, the camera can be determined. position, that is, the shooting position. In the embodiment of the present disclosure, the SLAM algorithm is used to determine the initial shooting position information of the shooting position in the first coordinate system. The camera projection algorithm may be an algorithm of a pinhole projection model of the camera.
可选的,确定第一坐标系与相机投影算法对应的第二坐标系的目标变换关系,可以包括:通过同步定位与建图算法以及相机投影算法分别确定已检视频帧的第一拍摄位置信息和第二拍摄位置信息,其中,已检视频帧为对目标区域跟踪成功的视频帧;根据已检视频帧的第一拍摄位置信息和第二拍摄位置信息,确定目标变换关系。Optionally, determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm may include: respectively determining the first shooting position information of the inspected video frame by synchronizing the positioning and mapping algorithm and the camera projection algorithm. and second shooting position information, wherein the detected video frame is a video frame that successfully tracks the target area; the target transformation relationship is determined according to the first shooting position information and the second shooting position information of the detected video frame.
已检视频帧是指跟踪成功的视频帧,也即成功确定目标区域的位置的视频帧。其中,已检视频帧的数量为一个或多个,当已检视频帧的数量为多个时,目标变换关系为各已检视频帧对应的变换关系的平均值,目标变换关系包括变换尺度和变换位移。目标变换关系是指同步定位与建图算法对应的第一坐标系与相机投影算法对应的第二坐标系之间的变换关系。目标变换关系可以包括两个坐标系之间在旋转、尺度和位移上的变换,由于两个坐标系的旋转是相同的,因此不需要进行旋转上的变换,也即目标变换关系包括变换尺度和变换位移。由当已检视频帧的数量为多个,针对每个已检视频帧均可以计算得到一个变换关系,因此目标变换关系可以为各已检视频帧对应的变换关系的平均值。The detected video frame refers to a video frame that is successfully tracked, that is, a video frame for which the position of the target area is successfully determined. Wherein, the number of detected video frames is one or more, and when the number of detected video frames is multiple, the target transformation relationship is the average value of the transformation relationship corresponding to each detected video frame, and the target transformation relationship includes transformation scale and Transform displacement. The target transformation relationship refers to the transformation relationship between the first coordinate system corresponding to the simultaneous positioning and mapping algorithm and the second coordinate system corresponding to the camera projection algorithm. The target transformation relationship can include transformations in rotation, scale and displacement between the two coordinate systems. Since the rotations of the two coordinate systems are the same, no rotation transformation is required, that is, the target transformation relationship includes the transformation scale and Transform displacement. When the number of detected video frames is multiple, a transformation relationship can be calculated for each detected video frame, so the target transformation relationship can be the average value of the transformation relationship corresponding to each detected video frame.
具体的,对于已检视频帧可以通过同步定位与建图算法确定第一拍摄位置信息,并通过相机投影算法确定第二拍摄位置信息,第一拍摄位置信息和第二拍摄位置信息为拍摄位置在上述第一坐标系和第二坐标系的位置信息,之后可以根据上述第一拍摄位置信息和第二拍摄位置信息采用最小二乘法确定上述第一坐标系和第二坐标系的目标变换关系。示例性的,第一拍摄位置信息可以采用slam_w_T_c表示,第二拍摄位置信息可以采用ellipse_w_T_c表示,目标变换关系表示为slam_w_T_c=ellipse_w_T_c*align_scale+align_translation,其中align_scale表示两个坐标系的变换尺度,align_translation表示两个坐标系的变换位移。Specifically, for the detected video frame, the first shooting position information can be determined through the synchronous positioning and mapping algorithm, and the second shooting position information can be determined through the camera projection algorithm. The first shooting position information and the second shooting position information are the shooting positions in The position information of the first coordinate system and the second coordinate system, and then the least squares method can be used to determine the target transformation relationship between the first coordinate system and the second coordinate system according to the first shooting position information and the second shooting position information. Exemplarily, the first shooting position information may be represented by slam_w_T_c, the second shooting position information may be represented by ellipse_w_T_c, and the target transformation relationship is represented as slam_w_T_c=ellipse_w_T_c*align_scale+align_translation, where align_scale represents the transformation scale of the two coordinate systems, and align_translation represents Transform displacement of the two coordinate systems.
可以理解的是,上述第一坐标系可以为同步定位与建图算法对应的第一世界坐标系,上述第二坐标系可以为相机投影算法对应的第二世界坐标系或以目标形状物体为原点的空间坐标系,上述第一世界坐标系和第二世界坐标系的原点以及坐标轴等不同,也需要变换。It can be understood that the above-mentioned first coordinate system may be the first world coordinate system corresponding to the synchronous positioning and mapping algorithm, and the above-mentioned second coordinate system may be the second world coordinate system corresponding to the camera projection algorithm or the target shape object as the origin. The origin and coordinate axis of the first world coordinate system and the second world coordinate system are different, and they also need to be transformed.
可选的,根据初始拍摄位置信息以及目标变换关系,确定目标拍摄位置信息,包括:基于目标变换关系将初始拍摄位置信息从第一坐标系的信息变换为第二坐标系的信息,得到目标拍摄位置信息。确定上述目标变换关系之后,可以将初始拍摄位置信息通过该目标变换关系从同步定位与建图算法对应的第一坐标系的信息变换为相机投影算法对应 的第二坐标系的信息,之后可以通过变换坐标系之后的目标拍摄位置信息进行后续确定目标区域在目标视频帧的位置信息的步骤。Optionally, determining the target shooting position information according to the initial shooting position information and the target transformation relationship includes: transforming the initial shooting position information from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position. location information. After the above target transformation relationship is determined, the initial shooting position information can be transformed from the information of the first coordinate system corresponding to the synchronous positioning and the mapping algorithm to the information of the second coordinate system corresponding to the camera projection algorithm through the target transformation relationship, and then can be obtained by The target shooting position information after the transformation of the coordinate system is used for the subsequent step of determining the position information of the target area in the target video frame.
步骤104、根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息。Step 104: Determine the position information of the target area in the target video frame again according to the target shooting position information and the camera projection algorithm.
本公开实施例中,根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息,可以包括:根据目标拍摄位置信息以及目标区域对应的目标形状物体的位置信息进行位置求解,确定拍摄位置到目标形状物体的位移信息;其中,目标区域为目标视频帧中目标性形状物体所在区域;将拍摄位置到目标形状物体的位移信息输入相机投影算法的投影方程中,确定目标区域在目标视频帧中的位置信息。In the embodiment of the present disclosure, re-determining the position information of the target area in the target video frame according to the target shooting position information and the camera projection algorithm may include: performing a position measurement according to the target shooting position information and the position information of the target shape object corresponding to the target area. Solve to determine the displacement information from the shooting position to the target-shaped object; wherein, the target area is the area where the target-shaped object is located in the target video frame; input the displacement information from the shooting position to the target-shaped object into the projection equation of the camera projection algorithm to determine the target The location information of the region in the target video frame.
上述目标形状物体的位置信息可以为预先设置的一个固定值,为已知量,例如世界坐标系的原点可以设置在目标形状物体所在的位置,该位置信息的坐标为(0,0,0)。根据目标形状物体的位置、目标拍摄位置以及变换方程,即可确定目标拍摄位置到目标形状物体的位移。上述变换方程可以表示为W10=W20+W12,其中,W10表示世界坐标系下拍摄位置信息,W20表示世界坐标系下目标形状物体的位置信息,W12表示世界坐标系下目标拍摄位置到目标区域的位移信息。上述W10、W20和W12三个均为世界坐标系下的向量,具有方向和大小。在该步骤中,可以将上述变换坐标系之后的目标拍摄位置信息W10和目标形状物体的位置信息W12输入变换方程中,得到目标拍摄位置到目标形状物体的位移信息W12。The position information of the target shape object can be a preset fixed value, which is a known quantity. For example, the origin of the world coordinate system can be set at the position where the target shape object is located, and the coordinates of the position information are (0, 0, 0) . According to the position of the target shape object, the target shooting position and the transformation equation, the displacement from the target shooting position to the target shape object can be determined. The above transformation equation can be expressed as W10=W20+W12, where W10 represents the shooting position information in the world coordinate system, W20 represents the position information of the target shape object in the world coordinate system, and W12 represents the target shooting position in the world coordinate system to the target area. displacement information. The above three W10, W20 and W12 are all vectors in the world coordinate system, with direction and size. In this step, the target shooting position information W10 and the position information W12 of the target shape object after the coordinate system transformation can be input into the transformation equation to obtain the displacement information W12 from the target shooting position to the target shape object.
投影方程中目标区域的位置信息与拍摄装置的内部参数、拍摄位置所在坐标系到世界坐标系的旋转矩阵、拍摄位置所在坐标系下世界坐标系的原点的位置信息相关。投影方程可以表示为p=π[K(R12*W20+T)],其中,π表示系数,K表示拍摄装置的内部参数,具体可以包括拍摄位置焦距、畸变参数等拍摄装置内部的一些参数,R12表示拍摄位置所在坐标系到世界坐标系的旋转矩阵,p表示目标区域的位置信息,T表示拍摄位置所在坐标系下世界坐标系的原点的位置信息。由上述投影方程p=π[K(R12*W20+T)]变换得到W12=-1*ratio*R21*K-1*p,R21表示从拍摄位置所在坐标系下到世界坐标系的旋转矩阵的逆矩阵,ratio=1/π,T为三维坐标,投影方程中等号号右侧的值也为三维坐标,p为二维坐标,ratio可以为三维坐标中最后一维的值。将上述世界坐标系下目标拍摄位置到目标形状物体的位移信息W12=-1*ratio*R21*K-1*p中,可以确定目标区域在目标视频帧中的位置信息p。The location information of the target area in the projection equation is related to the internal parameters of the photographing device, the rotation matrix from the coordinate system where the photographing location is located to the world coordinate system, and the location information of the origin of the world coordinate system under the coordinate system where the photographing location is located. The projection equation can be expressed as p=π[K(R12*W20+T)], where π represents the coefficient, and K represents the internal parameters of the shooting device, which may specifically include some parameters inside the shooting device such as the focal length of the shooting position, distortion parameters, etc. R12 represents the rotation matrix from the coordinate system where the shooting position is located to the world coordinate system, p represents the position information of the target area, and T represents the position information of the origin of the world coordinate system under the coordinate system where the shooting position is located. W12=-1*ratio*R21*K-1*p is transformed from the above projection equation p=π[K(R12*W20+T)], R21 represents the rotation matrix from the coordinate system where the shooting position is located to the world coordinate system The inverse matrix of , ratio=1/π, T is the three-dimensional coordinate, the value on the right side of the equal sign in the projection equation is also the three-dimensional coordinate, p is the two-dimensional coordinate, and ratio can be the value of the last dimension in the three-dimensional coordinate. The position information p of the target area in the target video frame can be determined by taking the displacement information W12=-1*ratio*R21*K-1*p from the target shooting position to the target-shaped object in the above-mentioned world coordinate system.
本公开实施例提供的目标跟踪方案,获取目标视频帧;通过对目标视频帧的特征检测确定目标区域在目标视频帧中的位置信息;如果目标区域在目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息。采用上述技术方案,在对视频帧的目标区域跟踪失败之后,可以根据同步定位与建图算法确定的拍摄位置来重新确定该视频帧中的目标区域的位置,实现跟踪恢复,在相机运动较快时也能实现目标跟踪,提升了跟踪鲁棒性。The target tracking solution provided by the embodiment of the present disclosure obtains the target video frame; the position information of the target area in the target video frame is determined by the feature detection of the target video frame; if the determination of the position information of the target area in the target video frame fails, then Determine the target shooting position information; according to the target shooting position information and the camera projection algorithm, determine the position information of the target area in the target video frame again. With the above technical solution, after the tracking of the target area of the video frame fails, the position of the target area in the video frame can be re-determined according to the shooting position determined by the synchronous positioning and mapping algorithm, so as to realize the tracking recovery, and the camera can move faster. It can also achieve target tracking and improve the tracking robustness.
在一些实施例中,目标跟踪方法还可以包括:确定目标区域对应的目标形状物体所在平面的初始法向量,并将初始法向量在水平面或竖平面上的投影确定为目标法向量。In some embodiments, the target tracking method may further include: determining the initial normal vector of the plane where the target shape object corresponding to the target area is located, and determining the projection of the initial normal vector on the horizontal or vertical plane as the target normal vector.
其中,初始法向量是指空间中目标形状物体所在平面的法向量,目标法向量是指优化后的法向量。具体的,确定目标区域在目标视频帧以及相邻视频帧中的位置信息之后,通过对两个视频帧之间的单应矩阵进行分解可以确定初始法向量,上述单应矩阵通过目标区域上采样的点计算得到,分解方法可以采用奇异值分解(Singular Value Decomposition,SVD)方式。确定初始法向量之后,可以将初始法向量在水平面或竖平面上的投影确定为目标法向量。示例性的,初始法向量为(0.05,0.03,0.994),目标法向量可以为(0,0,1)。Among them, the initial normal vector refers to the normal vector of the plane where the target shape object is located in the space, and the target normal vector refers to the optimized normal vector. Specifically, after determining the location information of the target area in the target video frame and adjacent video frames, the initial normal vector can be determined by decomposing the homography matrix between the two video frames, and the above homography matrix is upsampled by the target area. The point calculation of , the decomposition method can use the singular value decomposition (Singular Value Decomposition, SVD) method. After the initial normal vector is determined, the projection of the initial normal vector on the horizontal or vertical plane can be determined as the target normal vector. Exemplarily, the initial normal vector is (0.05, 0.03, 0.994), and the target normal vector may be (0, 0, 1).
示例性的,图2为本公开实施例提供的一种法向量投影的示意图,图3为本公开实施例提供的另一种法向量投影的示意图,图2和图3分布表征了初始法向量在竖直方向和水平方向投影之后的目标法向量。Exemplarily, FIG. 2 is a schematic diagram of a normal vector projection provided by an embodiment of the present disclosure, FIG. 3 is a schematic diagram of another normal vector projection provided by an embodiment of the present disclosure, and FIG. 2 and FIG. 3 distributions represent initial normal vectors. The target normal vector after vertical and horizontal projection.
上述方案中,假设物体仅仅位于水平面或竖平面上,通过对初始法向量的投影可以得到优化后的目标法向量,避免了因计算法向量时假设相机为正交投影模型造成的误差,提高了空间中物体所在平面法向的精度,进而提高了基于法向进行的特效展示的准确性In the above scheme, assuming that the object is only located on the horizontal or vertical plane, the optimized target normal vector can be obtained by projecting the initial normal vector, which avoids the error caused by assuming that the camera is an orthogonal projection model when calculating the normal vector, and improves the The accuracy of the normal direction of the plane where the object is located in the space, thereby improving the accuracy of the display of special effects based on the normal direction
图4为本公开实施例提供的另一种目标跟踪方法的流程示意图,本实施例在上述实施例的基础上,进一步优化了上述目标跟踪方法。如图4所示,该方法包括:FIG. 4 is a schematic flowchart of another target tracking method provided by an embodiment of the present disclosure. On the basis of the foregoing embodiment, this embodiment further optimizes the foregoing target tracking method. As shown in Figure 4, the method includes:
步骤201、获取目标视频帧。Step 201: Acquire a target video frame.
步骤202、通过对目标视频帧的特征检测确定目标区域在目标视频帧中的位置信息。Step 202: Determine the position information of the target region in the target video frame by detecting the feature of the target video frame.
步骤203、如果目标区域在目标视频帧中的位置信息确定失败,则通过同步定位与建图算法确定第一坐标系下的初始拍摄位置信息。Step 203: If the determination of the position information of the target area in the target video frame fails, the initial shooting position information in the first coordinate system is determined through a synchronous positioning and mapping algorithm.
步骤204、确定第一坐标系与相机投影算法对应的第二坐标系的目标变换关系。Step 204: Determine the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm.
可选的,确定第一坐标系与相机投影算法对应的第二坐标系的目标变换关系,包括:通过同步定位与建图算法以及相机投影算法分别确定已检视频帧的第一拍摄位置信息和第二拍摄位置信息,其中,已检视频帧为对目标区域跟踪成功的视频帧;根据已检视频帧的第一拍摄位置信息和第二拍摄位置信息,确定目标变换关系。Optionally, determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm includes: determining the first shooting position information and The second shooting position information, wherein the detected video frame is a video frame that successfully tracks the target area; the target transformation relationship is determined according to the first shooting position information and the second shooting position information of the detected video frame.
其中,已检视频帧的数量为一个或多个,当已检视频帧的数量为多个时,目标变换关系为各已检视频帧对应的变换关系的平均值,目标变换关系包括变换尺度和变换位移。Wherein, the number of detected video frames is one or more, and when the number of detected video frames is multiple, the target transformation relationship is the average value of the transformation relationship corresponding to each detected video frame, and the target transformation relationship includes transformation scale and Transform displacement.
步骤205、根据初始拍摄位置信息以及目标变换关系,确定目标拍摄位置信息。Step 205: Determine the target shooting position information according to the initial shooting position information and the target transformation relationship.
可选的,根据初始拍摄位置信息以及目标变换关系,确定目标拍摄位置信息,包括:基于目标变换关系将初始拍摄位置信息从第一坐标系的信息变换为第二坐标系的信息,得到目标拍摄位置信息。Optionally, determining the target shooting position information according to the initial shooting position information and the target transformation relationship includes: transforming the initial shooting position information from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position. location information.
步骤206、根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息。Step 206: Determine the position information of the target area in the target video frame again according to the target shooting position information and the camera projection algorithm.
可选的,根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息,包括:根据目标拍摄位置信息以及目标区域对应的目标形状物体的位置信息进行位置求解,确定拍摄位置到目标形状物体的位移信息;其中,目标区域为目标视频帧中目标性形状物体所在区域;将拍摄位置到目标形状物体的位移信息输入相机投影算法的投影方程中,确定目标区域在目标视频帧中的位置信息。Optionally, according to the target shooting position information and the camera projection algorithm, the position information of the target area in the target video frame is determined again, including: performing a position solution according to the target shooting position information and the position information of the target shape object corresponding to the target area, and determining The displacement information from the shooting position to the target-shaped object; wherein, the target area is the area where the target-shaped object is located in the target video frame; the displacement information from the shooting position to the target-shaped object is input into the projection equation of the camera projection algorithm, and the target area is determined on the target. Position information in the video frame.
在一些实施例中,目标跟踪方法还可以包括:确定目标区域对应的目标形状物体所在平面的初始法向量,并将初始法向量在水平面或竖平面上的投影确定为目标法向量。In some embodiments, the target tracking method may further include: determining the initial normal vector of the plane where the target shape object corresponding to the target area is located, and determining the projection of the initial normal vector on the horizontal or vertical plane as the target normal vector.
接下来通过一个具体的示例对本公开实施例中的目标跟踪方法进行进一步说明。示例性的,图5为本公开实施例提供的再一种目标跟踪方法的流程示意图,以目标区域为椭圆形区域,目标形状物体为圆形物体为例,具体过程可以包括:1.初始化:在目标视频的的前若干视频帧,使用SLAM算法完成对拍摄位置在第一坐标系下的计算,采用slam_w_T_c表示,使用相机投影算法根据已知的椭圆形区域的位置完成在第二坐标系下对拍摄位置的计算,采用ellipse_w_T_c表示,上述第一坐标系为第一世界坐标系,上述第二坐标系为第二世界坐标系或圆形物体坐标系。2.坐标系对齐。使用最小二乘法计算第一坐标系和第二坐标系之间已知旋转的相似变换关系,相机为智能终端中设置的图像采集模块时,旋转可以通过智能终端的惯性测量单应(Inertial Measurement Unit,IMU)获得,两个坐标系的旋转相同,因此相似变换关系包括相似变换所需的位移和尺度,从而完成两个坐标系之间的坐标系对齐(alignment)。3.跟踪:在目标视频中当前帧因为相机运动较快导致图像模糊检测不到椭圆形区域时,使用SLAM算法确定的拍摄位置和两个坐标系之间的目标变换关系,确定在相机投影算法对应的第二坐标系下的拍摄位置。采用SLAM算法确定的拍摄位置可以采用slam_w_T_c表示,相机投影算法对应的拍摄位置可以采用ellipse_w_T_c表示,目标变换关系表示为slam_w_T_c=ellipse_w_T_c*align_scale+align_translation,其中align_scale表示两个坐标系的变换尺度,align_translation表示两个坐标系的变换位移。之后根据第二坐标系下的拍摄位置、相机投影算法对应的投影方程和变换方程,确定当前帧中椭圆形区域的位置。Next, the target tracking method in the embodiment of the present disclosure will be further described by using a specific example. Exemplarily, FIG. 5 is a schematic flowchart of still another target tracking method provided by an embodiment of the present disclosure. Taking the target area as an elliptical area and the target-shaped object as a circular object as an example, the specific process may include: 1. Initialization: In the first several video frames of the target video, the SLAM algorithm is used to complete the calculation of the shooting position in the first coordinate system, which is represented by slam_w_T_c, and the camera projection algorithm is used to complete the calculation in the second coordinate system according to the known position of the elliptical area. The calculation of the shooting position is represented by ellipse_w_T_c, the first coordinate system is the first world coordinate system, and the second coordinate system is the second world coordinate system or the circular object coordinate system. 2. Coordinate system alignment. The least squares method is used to calculate the similarity transformation relationship between the known rotations between the first coordinate system and the second coordinate system. When the camera is the image acquisition module set in the smart terminal, the rotation can be passed through the inertial measurement homography of the smart terminal (Inertial Measurement Unit). , IMU), the rotation of the two coordinate systems is the same, so the similarity transformation relationship includes the displacement and scale required for the similarity transformation, so as to complete the coordinate system alignment between the two coordinate systems. 3. Tracking: When the current frame in the target video is blurred and the elliptical area cannot be detected due to the fast camera movement, the shooting position determined by the SLAM algorithm and the target transformation relationship between the two coordinate systems are used to determine the camera projection algorithm. The corresponding shooting position in the second coordinate system. The shooting position determined by the SLAM algorithm can be represented by slam_w_T_c, the shooting position corresponding to the camera projection algorithm can be represented by ellipse_w_T_c, and the target transformation relationship is represented as slam_w_T_c=ellipse_w_T_c*align_scale+align_translation, where align_scale represents the transformation scale of the two coordinate systems, and align_translation represents Transform displacement of the two coordinate systems. Then, the position of the elliptical region in the current frame is determined according to the shooting position in the second coordinate system, the projection equation and the transformation equation corresponding to the camera projection algorithm.
进一步的,上述相机投影算法确定的第二坐标系下的拍摄位置在一些情况下存在不准确的问题,导致第一坐标系和第二坐标系的变换关系的误差可能较大。针对上述问题,在整个跟踪过程中,可以设置两个队列,分别保存最近的设定数量的slam_w_T_c和ellipse_w_T_c,设定数量可以为10。当前帧因为相机运动较快导致图像模糊检测不到椭圆时,基于保存的若干个slam_w_T_c和ellipse_w_T_c进行两个坐标系的变换关系的确定,将多个变换关系的平均值确定为最终的目标变换关系。从而根据SLAM算法确定的拍摄位置和目标变换关系确定在相机投影算法对应的第二坐标系下的拍摄位置,之后根据二坐标系下的拍摄位置确定当前帧中椭圆形区域的位置。Further, the shooting position in the second coordinate system determined by the above camera projection algorithm may be inaccurate in some cases, resulting in a large error in the transformation relationship between the first coordinate system and the second coordinate system. In view of the above problems, in the whole tracking process, two queues can be set up to store the latest set number of slam_w_T_c and ellipse_w_T_c respectively, and the set number can be 10. When the current frame is blurred and the ellipse cannot be detected due to the fast camera movement, the transformation relationship between the two coordinate systems is determined based on several saved slam_w_T_c and ellipse_w_T_c, and the average value of the multiple transformation relationships is determined as the final target transformation relationship . Therefore, the shooting position in the second coordinate system corresponding to the camera projection algorithm is determined according to the shooting position determined by the SLAM algorithm and the target transformation relationship, and then the position of the elliptical area in the current frame is determined according to the shooting position in the two coordinate system.
本公开实施例提供的目标跟踪方案,获取目标视频帧;通过对目标视频帧的特征检测确定目标区域在目标视频帧中的位置信息;如果目标区域在目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息。采用上述技术方案,在对视频帧的目标区域跟踪失败之后,可以根据同步定位与建图算法确定的拍摄位置来重新确定该视频帧中的目标区域的位置,实现跟踪恢复,在相机运动较快时也能实现目标跟踪,提升了跟踪鲁棒性。The target tracking solution provided by the embodiment of the present disclosure obtains the target video frame; the position information of the target area in the target video frame is determined by the feature detection of the target video frame; if the determination of the position information of the target area in the target video frame fails, then Determine the target shooting position information; according to the target shooting position information and the camera projection algorithm, determine the position information of the target area in the target video frame again. With the above technical solution, after the tracking of the target area of the video frame fails, the position of the target area in the video frame can be re-determined according to the shooting position determined by the synchronous positioning and mapping algorithm, so as to realize the tracking recovery, and the camera can move faster. It can also achieve target tracking and improve the tracking robustness.
图6为本公开实施例提供的一种目标跟踪装置的结构示意图,该装置可由软件和/或硬件实现,一般可集成在电子设备中。如图6所示,该装置包括:FIG. 6 is a schematic structural diagram of a target tracking apparatus provided by an embodiment of the present disclosure. The apparatus may be implemented by software and/or hardware, and may generally be integrated into an electronic device. As shown in Figure 6, the device includes:
视频帧模块301,用于获取目标视频帧;A video frame module 301, configured to obtain a target video frame;
第一位置模块302,用于通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;The first position module 302 is used to determine the position information of the target area in the target video frame by detecting the feature of the target video frame;
拍摄位置模块303,用于如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;A shooting position module 303, configured to determine the target shooting position information if the position information of the target area in the target video frame fails to be determined;
第二位置模块304,用于根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。The second position module 304 is configured to re-determine the position information of the target area in the target video frame according to the target shooting position information and the camera projection algorithm.
可选的,所述拍摄位置模块303用于:Optionally, the shooting position module 303 is used for:
通过同步定位与建图算法确定第一坐标系下的初始拍摄位置信息;Determine the initial shooting position information in the first coordinate system by synchronous positioning and mapping algorithm;
确定所述第一坐标系与所述相机投影算法对应的第二坐标系的目标变换关系;determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm;
根据所述初始拍摄位置信息以及所述目标变换关系,确定所述目标拍摄位置信息。The target shooting position information is determined according to the initial shooting position information and the target transformation relationship.
可选的,所述拍摄位置模块303用于:Optionally, the shooting position module 303 is used for:
通过所述同步定位与建图算法以及所述相机投影算法分别确定已检视频帧的第一拍摄位置信息和第二拍摄位置信息,其中,所述已检视频帧为对所述目标区域跟踪成功的视频帧;The first shooting position information and the second shooting position information of the detected video frame are respectively determined by the synchronous positioning and mapping algorithm and the camera projection algorithm, wherein the detected video frame is the successful tracking of the target area. video frame;
根据所述已检视频帧的第一拍摄位置信息和第二拍摄位置信息,确定所述目标变换关系。The target transformation relationship is determined according to the first shooting position information and the second shooting position information of the detected video frame.
可选的,所述已检视频帧的数量为一个或多个,当所述已检视频帧的数量为多个时,所述目标变换关系为各所述已检视频帧对应的变换关系的平均值,所述目标变换关系包括变换尺度和变换位移。Optionally, the number of the detected video frames is one or more, and when the number of the detected video frames is multiple, the target transformation relationship is the transformation relationship corresponding to each of the detected video frames. The average value, the target transformation relationship includes transformation scale and transformation displacement.
可选的,所述拍摄位置模块303用于:Optionally, the shooting position module 303 is used for:
基于所述目标变换关系将所述初始拍摄位置信息从所述第一坐标系的信息变换为所述第二坐标系的信息,得到所述目标拍摄位置信息。The initial shooting position information is transformed from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position information.
可选的,所述第二位置模块304用于:Optionally, the second location module 304 is used for:
根据所述目标拍摄位置信息以及所述目标区域对应的目标形状物体的位置信息进行位置求解,确定拍摄位置到所述目标形状物体的位移信息;其中,目标区域为所述目标视频帧中所述目标性形状物体所在区域;According to the target shooting position information and the position information of the target shape object corresponding to the target area, the position is solved, and the displacement information from the shooting position to the target shape object is determined; wherein, the target area is the target video frame. The area where the target shape object is located;
将所述拍摄位置到所述目标形状物体的位移信息输入所述相机投影算法的投影方程中,确定所述目标区域在所述目标视频帧中的位置信息。The displacement information from the shooting position to the target-shaped object is input into the projection equation of the camera projection algorithm, and the position information of the target area in the target video frame is determined.
可选的,所述装置还包括法向模块,用于:Optionally, the device further includes a normal module for:
确定所述目标区域对应的目标形状物体所在平面的初始法向量,并将所述初始法向量在水平面或竖平面上的投影确定为目标法向量。The initial normal vector of the plane where the target shape object corresponding to the target area is located is determined, and the projection of the initial normal vector on the horizontal plane or the vertical plane is determined as the target normal vector.
本公开实施例所提供的目标跟踪装置可执行本公开任意实施例所提供的目标跟踪方法,具备执行方法相应的功能模块和有益效果。The target tracking device provided by the embodiment of the present disclosure can execute the target tracking method provided by any embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method.
本公开实施例还提供了一种计算机程序产品,包括计算机程序/指令,该计算机程序/指令被处理器执行时实现本公开任意实施例所提供的目标跟踪方法。An embodiment of the present disclosure also provides a computer program product, including a computer program/instruction, which implements the target tracking method provided by any embodiment of the present disclosure when the computer program/instruction is executed by a processor.
图7为本公开实施例提供的一种电子设备的结构示意图。下面具体参考图7,其示出了适于用来实现本公开实施例中的电子设备400的结构示意图。本公开实施例中的电子设备400可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、个人数字助理(Personal Digital Assistant,PDA)、平板电脑(Portable Android Device,PAD)、便携式多媒体播放器(Portable Media Player,PMP)、车载终端(例如车载导航终端)等等的移动终端 以及诸如数字TV、台式计算机等等的固定终端。图7示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。FIG. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure. Referring specifically to FIG. 7 below, it shows a schematic structural diagram of an electronic device 400 suitable for implementing an embodiment of the present disclosure. The electronic device 400 in the embodiment of the present disclosure may include, but is not limited to, such as a mobile phone, a notebook computer, a digital broadcast receiver, a Personal Digital Assistant (PDA), a tablet computer (Portable Android Device, PAD), a portable multimedia player Portable Media Player (PMP), mobile terminals such as in-vehicle terminals (such as in-vehicle navigation terminals), etc., as well as fixed terminals such as digital TVs, desktop computers, and the like. The electronic device shown in FIG. 7 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
如图7所示,电子设备400可以包括处理装置(例如中央处理器、图形处理器等)401,其可以根据存储在只读存储器(Read Only Memory,ROM)402中的程序或者从存储装置408加载到随机访问存储器(Random Access Memory,RAM)403中的程序而执行各种适当的动作和处理。在RAM 403中,还存储有电子设备400操作所需的各种程序和数据。处理装置401、ROM 402以及RAM 403通过总线404彼此相连。输入/输出(Input/Output,I/O)接口405也连接至总线404。As shown in FIG. 7 , the electronic device 400 may include a processing device (such as a central processing unit, a graphics processor, etc.) 401, which may be stored in a read-only memory (Read Only Memory, ROM) 402 according to a program or from a storage device 408 A program loaded into a random access memory (RAM) 403 performs various appropriate actions and processes. In the RAM 403, various programs and data required for the operation of the electronic device 400 are also stored. The processing device 401, the ROM 402, and the RAM 403 are connected to each other through a bus 404. An Input/Output (I/O) interface 405 is also connected to the bus 404 .
通常,以下装置可以连接至I/O接口405:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置406;包括例如液晶显示器(Liquid Crystal Display,LCD)、扬声器、振动器等的输出装置407;包括例如磁带、硬盘等的存储装置408;以及通信装置409。通信装置409可以允许电子设备400与其他设备进行无线或有线通信以交换数据。虽然图7示出了具有各种装置的电子设备400,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Typically, the following devices can be connected to the I/O interface 405: input devices 406 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a Liquid Crystal Display (LCD) output device 407 , a speaker, a vibrator, etc.; a storage device 408 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 409 . Communication means 409 may allow electronic device 400 to communicate wirelessly or by wire with other devices to exchange data. Although FIG. 7 shows electronic device 400 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
本公开实施例提供的技术方案与现有技术相比具有如下优点:本公开实施例提供的目标跟踪方案,获取目标视频帧;通过对目标视频帧的特征检测确定目标区域在目标视频帧中的位置信息;如果目标区域在目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;根据目标拍摄位置信息以及相机投影算法,再次确定目标区域在目标视频帧中的位置信息。采用上述技术方案,在对视频帧的目标区域跟踪失败之后,可以根据同步定位与建图算法确定的拍摄位置来重新确定该视频帧中的目标区域的位置,实现跟踪恢复,在相机运动较快时也能实现目标跟踪,提升了跟踪鲁棒性。Compared with the prior art, the technical solution provided by the embodiment of the present disclosure has the following advantages: the target tracking solution provided by the embodiment of the present disclosure obtains the target video frame; the feature detection of the target video frame determines the position of the target area in the target video frame. Position information; if the determination of the position information of the target area in the target video frame fails, determine the target shooting position information; according to the target shooting position information and the camera projection algorithm, determine the position information of the target area in the target video frame again. With the above technical solution, after the tracking of the target area of the video frame fails, the position of the target area in the video frame can be re-determined according to the shooting position determined by the synchronous positioning and mapping algorithm, so as to realize the tracking recovery, and the camera can move faster. It can also achieve target tracking and improve the tracking robustness.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置409从网络上被下载和安装,或者从存储装置408被安装,或者从ROM 402被安装。在该计算机程序被处理装置401执行时,执行本公开实施例的目标跟踪方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network via the communication device 409, or from the storage device 408, or from the ROM 402. When the computer program is executed by the processing device 401, the above-mentioned functions defined in the target tracking method of the embodiment of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read-Only Memory,EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(Portable Compact Disk Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计 算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Erasable Programmable Read-Only Memory (EPROM or flash memory), optical fiber, Portable Compact Disk Read-Only Memory (CD-ROM), optical storage device, magnetic storage device, or the above any suitable combination. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In the present disclosure, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device . The program code embodied on the computer-readable medium can be transmitted by any suitable medium, including but not limited to: electric wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
在一些实施方式中,客户端、服务器可以利用诸如超文本传输协议(HyperText Transfer Protocol,HTTP)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(Local Area Network,LAN),广域网(Wide Area Network,WAN),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, clients and servers can communicate using any currently known or future developed network protocols, such as HyperText Transfer Protocol (HTTP), and can communicate with digital data in any form or medium. Communication (eg, a communication network) interconnects. Examples of communication networks include Local Area Networks (LANs), Wide Area Networks (WANs), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently Known or future developed networks.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:获取目标视频帧;通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device is made to: acquire a target video frame; determine a target area by detecting the features of the target video frame position information in the target video frame; if the determination of the position information of the target area in the target video frame fails, determine the target shooting position information; according to the target shooting position information and the camera projection algorithm, determine again Position information of the target area in the target video frame.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言-诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言-诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)-连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages - such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. Where a remote computer is involved, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner. Among them, the name of the unit does not constitute a limitation of the unit itself under certain circumstances.
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(Field-Programmable Gate Array,FPGA)、专用集成电路(Field-Programmable Gate Array, ASIC)、专用标准产品(Application Specific Standard Product,ASSP)、片上系统(System-on-a-chip,SOC)、复杂可编程逻辑设备(Complex Programmable Logic Device,CPLD)等等。The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field-Programmable Gate Arrays (FPGAs), Application-Specific Integrated Circuits (Field-Programmable Gate Arrays, ASICs), Application-Specific Standard Products ( Application Specific Standard Product, ASSP), System-on-a-chip (SOC), Complex Programmable Logic Device (CPLD) and so on.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
根据本公开的一个或多个实施例,本公开提供了一种目标跟踪方法,包括:According to one or more embodiments of the present disclosure, the present disclosure provides a target tracking method, including:
获取目标视频帧;Get the target video frame;
通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;Determine the position information of the target area in the target video frame by detecting the feature of the target video frame;
如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;If the determination of the position information of the target area in the target video frame fails, determining the target shooting position information;
根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。According to the target shooting position information and the camera projection algorithm, the position information of the target area in the target video frame is determined again.
根据本公开的一个或多个实施例,本公开提供的目标跟踪方法中,所述确定目标拍摄位置信息,包括:According to one or more embodiments of the present disclosure, in the target tracking method provided by the present disclosure, the determining the target shooting position information includes:
通过同步定位与建图算法确定第一坐标系下的初始拍摄位置信息;Determine the initial shooting position information in the first coordinate system by synchronous positioning and mapping algorithm;
确定所述第一坐标系与所述相机投影算法对应的第二坐标系的目标变换关系;determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm;
根据所述初始拍摄位置信息以及所述目标变换关系,确定所述目标拍摄位置信息。The target shooting position information is determined according to the initial shooting position information and the target transformation relationship.
根据本公开的一个或多个实施例,本公开提供的目标跟踪方法中,确定所述第一坐标系与所述相机投影算法对应的第二坐标系的目标变换关系,包括:According to one or more embodiments of the present disclosure, in the target tracking method provided by the present disclosure, determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm includes:
通过所述同步定位与建图算法以及所述相机投影算法分别确定已检视频帧的第一拍摄位置信息和第二拍摄位置信息,其中,所述已检视频帧为对所述目标区域跟踪成功的视频帧;The first shooting position information and the second shooting position information of the detected video frame are respectively determined by the synchronous positioning and mapping algorithm and the camera projection algorithm, wherein the detected video frame is the successful tracking of the target area. video frame;
根据所述已检视频帧的第一拍摄位置信息和第二拍摄位置信息,确定所述目标变换关系。The target transformation relationship is determined according to the first shooting position information and the second shooting position information of the detected video frame.
根据本公开的一个或多个实施例,本公开提供的目标跟踪方法中,所述已检视频帧的数量为一个或多个,当所述已检视频帧的数量为多个时,所述目标变换关系为各所述已检视频帧对应的变换关系的平均值,所述目标变换关系包括变换尺度和变换位移。According to one or more embodiments of the present disclosure, in the target tracking method provided by the present disclosure, the number of the detected video frames is one or more, and when the number of the detected video frames is multiple, the The target transformation relationship is the average value of the transformation relationships corresponding to the detected video frames, and the target transformation relationship includes transformation scale and transformation displacement.
根据本公开的一个或多个实施例,本公开提供的目标跟踪方法中,根据所述初始拍摄位置信息以及所述目标变换关系,确定所述目标拍摄位置信息,包括:According to one or more embodiments of the present disclosure, in the target tracking method provided by the present disclosure, determining the target shooting position information according to the initial shooting position information and the target transformation relationship includes:
基于所述目标变换关系将所述初始拍摄位置信息从所述第一坐标系的信息变换为所述第二坐标系的信息,得到所述目标拍摄位置信息。The initial shooting position information is transformed from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position information.
根据本公开的一个或多个实施例,本公开提供的目标跟踪方法中,根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息,包括:According to one or more embodiments of the present disclosure, in the target tracking method provided by the present disclosure, according to the target shooting position information and the camera projection algorithm, the position information of the target area in the target video frame is determined again, including: :
根据所述目标拍摄位置信息以及所述目标区域对应的目标形状物体的位置信息进行位置求解,确定拍摄位置到所述目标形状物体的位移信息;其中,目标区域为所述目标视频帧中所述目标性形状物体所在区域;According to the target shooting position information and the position information of the target shape object corresponding to the target area, the position is solved, and the displacement information from the shooting position to the target shape object is determined; wherein, the target area is the target video frame. The area where the target shape object is located;
将所述拍摄位置到所述目标形状物体的位移信息输入所述相机投影算法的投影方程中,确定所述目标区域在所述目标视频帧中的位置信息。The displacement information from the shooting position to the target-shaped object is input into the projection equation of the camera projection algorithm, and the position information of the target area in the target video frame is determined.
根据本公开的一个或多个实施例,本公开提供的目标跟踪方法中,还包括:According to one or more embodiments of the present disclosure, the target tracking method provided by the present disclosure further includes:
确定所述目标区域对应的目标形状物体所在平面的初始法向量,并将所述初始法向量在水平面或竖平面上的投影确定为目标法向量。The initial normal vector of the plane where the target shape object corresponding to the target area is located is determined, and the projection of the initial normal vector on the horizontal plane or the vertical plane is determined as the target normal vector.
根据本公开的一个或多个实施例,本公开提供了一种目标跟踪装置,包括:According to one or more embodiments of the present disclosure, the present disclosure provides a target tracking device, including:
视频帧模块,用于获取目标视频帧;The video frame module is used to obtain the target video frame;
第一位置模块,用于通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;a first position module, for determining the position information of the target area in the target video frame by detecting the feature of the target video frame;
拍摄位置模块,用于如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;a shooting position module, configured to determine target shooting position information if the position information of the target area in the target video frame fails to be determined;
第二位置模块,用于根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。The second position module is configured to re-determine the position information of the target area in the target video frame according to the target shooting position information and the camera projection algorithm.
根据本公开的一个或多个实施例,本公开提供的目标跟踪装置中,所述拍摄位置模块用于:According to one or more embodiments of the present disclosure, in the target tracking device provided by the present disclosure, the shooting position module is used for:
通过同步定位与建图算法确定第一坐标系下的初始拍摄位置信息;Determine the initial shooting position information in the first coordinate system by synchronous positioning and mapping algorithm;
确定所述第一坐标系与所述相机投影算法对应的第二坐标系的目标变换关系;determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm;
根据所述初始拍摄位置信息以及所述目标变换关系,确定所述目标拍摄位置信息。The target shooting position information is determined according to the initial shooting position information and the target transformation relationship.
根据本公开的一个或多个实施例,本公开提供的目标跟踪装置中,所述拍摄位置模块用于:According to one or more embodiments of the present disclosure, in the target tracking device provided by the present disclosure, the shooting position module is used for:
通过所述同步定位与建图算法以及所述相机投影算法分别确定已检视频帧的第一拍摄位置信息和第二拍摄位置信息,其中,所述已检视频帧为对所述目标区域跟踪成功的视频帧;The first shooting position information and the second shooting position information of the detected video frame are respectively determined by the synchronous positioning and mapping algorithm and the camera projection algorithm, wherein the detected video frame is the successful tracking of the target area. video frame;
根据所述已检视频帧的第一拍摄位置信息和第二拍摄位置信息,确定所述目标变换关系。The target transformation relationship is determined according to the first shooting position information and the second shooting position information of the detected video frame.
根据本公开的一个或多个实施例,本公开提供的目标跟踪装置中,所述已检视频帧的数量为一个或多个,当所述已检视频帧的数量为多个时,所述目标变换关系为各所述已检视频帧对应的变换关系的平均值,所述目标变换关系包括变换尺度和变换位移。According to one or more embodiments of the present disclosure, in the target tracking device provided by the present disclosure, the number of the detected video frames is one or more, and when the number of the detected video frames is multiple, the The target transformation relationship is the average value of the transformation relationships corresponding to the detected video frames, and the target transformation relationship includes transformation scale and transformation displacement.
根据本公开的一个或多个实施例,本公开提供的目标跟踪装置中,所述拍摄位置模块用于:According to one or more embodiments of the present disclosure, in the target tracking device provided by the present disclosure, the shooting position module is used for:
基于所述目标变换关系将所述初始拍摄位置信息从所述第一坐标系的信息变换为所述第二坐标系的信息,得到所述目标拍摄位置信息。The initial shooting position information is transformed from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position information.
根据本公开的一个或多个实施例,本公开提供的目标跟踪装置中,所述第二位置模块用于:According to one or more embodiments of the present disclosure, in the target tracking device provided by the present disclosure, the second location module is used for:
根据所述目标拍摄位置信息以及所述目标区域对应的目标形状物体的位置信息进行位置求解,确定拍摄位置到所述目标形状物体的位移信息;其中,目标区域为所述目标视频帧中所述目标性形状物体所在区域;According to the target shooting position information and the position information of the target shape object corresponding to the target area, the position is solved, and the displacement information from the shooting position to the target shape object is determined; wherein, the target area is the target video frame. The area where the target shape object is located;
将所述拍摄位置到所述目标形状物体的位移信息输入所述相机投影算法的投影方程中,确定所述目标区域在所述目标视频帧中的位置信息。The displacement information from the shooting position to the target-shaped object is input into the projection equation of the camera projection algorithm, and the position information of the target area in the target video frame is determined.
根据本公开的一个或多个实施例,本公开提供的目标跟踪装置中,所述装置还包括法向模块,用于:According to one or more embodiments of the present disclosure, in the target tracking device provided by the present disclosure, the device further includes a normal module for:
确定所述目标区域对应的目标形状物体所在平面的初始法向量,并将所述初始法向量在水平面或竖平面上的投影确定为目标法向量。The initial normal vector of the plane where the target shape object corresponding to the target area is located is determined, and the projection of the initial normal vector on the horizontal plane or the vertical plane is determined as the target normal vector.
根据本公开的一个或多个实施例,本公开提供了一种电子设备,包括:According to one or more embodiments of the present disclosure, the present disclosure provides an electronic device, comprising:
处理器;processor;
用于存储所述处理器可执行指令的存储器;a memory for storing the processor-executable instructions;
所述处理器,用于从所述存储器中读取所述可执行指令,并执行所述指令以实现如本公开提供的任一所述的目标跟踪方法。The processor is configured to read the executable instructions from the memory, and execute the instructions to implement any one of the target tracking methods provided in the present disclosure.
根据本公开的一个或多个实施例,本公开提供了一种计算机可读存储介质,所述存储介质存储有计算机程序,所述计算机程序用于执行如本公开提供的任一所述的目标跟踪方法。According to one or more embodiments of the present disclosure, the present disclosure provides a computer-readable storage medium storing a computer program for executing any of the objects provided by the present disclosure tracking method.
根据本公开的一个或多个实施例,本公开实施例还提供了一种计算机程序产品,包括计算机程序,所述计算机程序用于执行如本公开提供的任一所述的目标跟踪方法。According to one or more embodiments of the present disclosure, the embodiments of the present disclosure further provide a computer program product, including a computer program, for executing any one of the target tracking methods provided in the present disclosure.
根据本公开的一个或多个实施例,本公开实施例还提供了一种计算机程序,所述计算机程序用于执行如本公开提供的任一所述的目标跟踪方法。According to one or more embodiments of the present disclosure, an embodiment of the present disclosure further provides a computer program for executing the target tracking method as provided in any one of the present disclosure.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is merely a preferred embodiment of the present disclosure and an illustration of the technical principles employed. Those skilled in the art should understand that the scope of the disclosure involved in the present disclosure is not limited to the technical solutions formed by the specific combination of the above-mentioned technical features, and should also cover, without departing from the above-mentioned disclosed concept, the technical solutions formed by the above-mentioned technical features or Other technical solutions formed by any combination of its equivalent features. For example, a technical solution is formed by replacing the above features with the technical features disclosed in the present disclosure (but not limited to) with similar functions.
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。Additionally, although operations are depicted in a particular order, this should not be construed as requiring that the operations be performed in the particular order shown or in a sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, although the above discussion contains several implementation-specific details, these should not be construed as limitations on the scope of the present disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。Although the subject matter has been described in language specific to structural features and/or logical acts of method, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims.

Claims (12)

  1. 一种目标跟踪方法,其特征在于,包括:A target tracking method, comprising:
    获取目标视频帧;Get the target video frame;
    通过对所述目标视频帧的特征检测确定目标区域在所述目标视频帧中的位置信息;Determine the position information of the target area in the target video frame by detecting the feature of the target video frame;
    如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;If the determination of the position information of the target area in the target video frame fails, determining the target shooting position information;
    根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。According to the target shooting position information and the camera projection algorithm, the position information of the target area in the target video frame is determined again.
  2. 根据权利要求1所述的方法,其特征在于,所述确定目标拍摄位置信息,包括:The method according to claim 1, wherein the determining the target shooting position information comprises:
    通过同步定位与建图算法确定第一坐标系下的初始拍摄位置信息;Determine the initial shooting position information in the first coordinate system by synchronous positioning and mapping algorithm;
    确定所述第一坐标系与所述相机投影算法对应的第二坐标系的目标变换关系;determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm;
    根据所述初始拍摄位置信息以及所述目标变换关系,确定所述目标拍摄位置信息。The target shooting position information is determined according to the initial shooting position information and the target transformation relationship.
  3. 根据权利要求2所述的方法,其特征在于,确定所述第一坐标系与所述相机投影算法对应的第二坐标系的目标变换关系,包括:The method according to claim 2, wherein determining the target transformation relationship between the first coordinate system and the second coordinate system corresponding to the camera projection algorithm comprises:
    通过所述同步定位与建图算法以及所述相机投影算法分别确定已检视频帧的第一拍摄位置信息和第二拍摄位置信息,其中,所述已检视频帧为对所述目标区域跟踪成功的视频帧;The first shooting position information and the second shooting position information of the detected video frame are respectively determined by the synchronous positioning and mapping algorithm and the camera projection algorithm, wherein the detected video frame is the successful tracking of the target area. video frame;
    根据所述已检视频帧的第一拍摄位置信息和第二拍摄位置信息,确定所述目标变换关系。The target transformation relationship is determined according to the first shooting position information and the second shooting position information of the detected video frame.
  4. 根据权利要求3所述的方法,其特征在于,所述已检视频帧的数量为一个或多个,当所述已检视频帧的数量为多个时,所述目标变换关系为各所述已检视频帧对应的变换关系的平均值,所述目标变换关系包括变换尺度和变换位移。The method according to claim 3, wherein the number of the detected video frames is one or more, and when the number of the detected video frames is multiple, the target transformation relationship is each of the The average value of the transformation relationship corresponding to the detected video frame, the target transformation relationship includes transformation scale and transformation displacement.
  5. 根据权利要求2所述的方法,其特征在于,根据所述初始拍摄位置信息以及所述目标变换关系,确定所述目标拍摄位置信息,包括:The method according to claim 2, wherein determining the target shooting position information according to the initial shooting position information and the target transformation relationship comprises:
    基于所述目标变换关系将所述初始拍摄位置信息从所述第一坐标系的信息变换为所述第二坐标系的信息,得到所述目标拍摄位置信息。The initial shooting position information is transformed from the information of the first coordinate system to the information of the second coordinate system based on the target transformation relationship, so as to obtain the target shooting position information.
  6. 根据权利要求1所述的方法,其特征在于,根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息,包括:The method according to claim 1, wherein determining the position information of the target area in the target video frame again according to the target shooting position information and a camera projection algorithm, comprising:
    根据所述目标拍摄位置信息以及所述目标区域对应的目标形状物体的位置信息进行位置求解,确定拍摄位置到所述目标形状物体的位移信息;其中,目标区域为所述目标视频帧中所述目标性形状物体所在区域;According to the target shooting position information and the position information of the target shape object corresponding to the target area, the position is solved, and the displacement information from the shooting position to the target shape object is determined; wherein, the target area is the target video frame. The area where the target shape object is located;
    将所述拍摄位置到所述目标形状物体的位移信息输入所述相机投影算法的投影方程中,确定所述目标区域在所述目标视频帧中的位置信息。The displacement information from the shooting position to the target-shaped object is input into the projection equation of the camera projection algorithm, and the position information of the target area in the target video frame is determined.
  7. 根据权利要求1、3或6中任一所述的方法,其特征在于,还包括:The method according to any one of claims 1, 3 or 6, further comprising:
    确定所述目标区域对应的目标形状物体所在平面的初始法向量,并将所述初始法向量在水平面或竖平面上的投影确定为目标法向量。The initial normal vector of the plane where the target shape object corresponding to the target area is located is determined, and the projection of the initial normal vector on the horizontal plane or the vertical plane is determined as the target normal vector.
  8. 一种目标跟踪装置,其特征在于,包括:A target tracking device, comprising:
    视频帧模块,用于获取目标视频帧;The video frame module is used to obtain the target video frame;
    第一位置模块,用于通过对所述目标视频帧的特征检测确定目标区域在所述目标视 频帧中的位置信息;The first position module, for determining the position information of target area in the described target video frame by the feature detection to described target video frame;
    拍摄位置模块,用于如果所述目标区域在所述目标视频帧中的位置信息确定失败,则确定目标拍摄位置信息;a shooting position module, configured to determine target shooting position information if the position information of the target area in the target video frame fails to be determined;
    第二位置模块,用于根据所述目标拍摄位置信息以及相机投影算法,再次确定所述目标区域在所述目标视频帧中的位置信息。The second position module is configured to re-determine the position information of the target area in the target video frame according to the target shooting position information and the camera projection algorithm.
  9. 一种电子设备,其特征在于,所述电子设备包括:An electronic device, characterized in that the electronic device comprises:
    处理器;processor;
    用于存储所述处理器可执行指令的存储器;a memory for storing the processor-executable instructions;
    所述处理器,用于从所述存储器中读取所述可执行指令,并执行所述指令以实现上述权利要求1-7中任一所述的目标跟踪方法。The processor is configured to read the executable instructions from the memory and execute the instructions to implement the target tracking method according to any one of the preceding claims 1-7.
  10. 一种计算机可读存储介质,其特征在于,所述存储介质存储有计算机程序,所述计算机程序用于执行上述权利要求1-7中任一所述的目标跟踪方法。A computer-readable storage medium, characterized in that the storage medium stores a computer program, and the computer program is used to execute the target tracking method according to any one of the preceding claims 1-7.
  11. 一种计算机程序产品,其特征在于,包括计算机程序,所述计算机程序用于执行上述权利要求1-7中任一所述的目标跟踪方法。A computer program product, characterized in that it includes a computer program, and the computer program is used to execute the target tracking method according to any one of the preceding claims 1-7.
  12. 一种计算机程序,其特征在于,所述计算机程序用于执行上述权利要求1-7中任一所述的目标跟踪方法。A computer program, characterized in that the computer program is used to execute the target tracking method according to any one of the above claims 1-7.
PCT/CN2022/080985 2021-03-15 2022-03-15 Target tracking method and apparatus, device, and medium WO2022194158A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/468,647 US20240005552A1 (en) 2021-03-15 2023-09-15 Target tracking method and apparatus, device, and medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110276360.4A CN115082516A (en) 2021-03-15 2021-03-15 Target tracking method, device, equipment and medium
CN202110276360.4 2021-03-15

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/080916 Continuation-In-Part WO2022194145A1 (en) 2021-03-15 2022-03-15 Photographing position determination method and apparatus, device, and medium

Related Child Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/080977 Continuation-In-Part WO2022194157A1 (en) 2021-03-15 2022-03-15 Target tracking method and apparatus, device and medium

Publications (1)

Publication Number Publication Date
WO2022194158A1 true WO2022194158A1 (en) 2022-09-22

Family

ID=83240411

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/080985 WO2022194158A1 (en) 2021-03-15 2022-03-15 Target tracking method and apparatus, device, and medium

Country Status (2)

Country Link
CN (1) CN115082516A (en)
WO (1) WO2022194158A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117218212A (en) * 2023-11-09 2023-12-12 杭州巨岩欣成科技有限公司 Camera calibration self-adaptive adjustment method and device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170178355A1 (en) * 2015-12-17 2017-06-22 Stmicroelectronics Sa Determination of an ego-motion of a video apparatus in a slam type algorithm
CN107404615A (en) * 2017-06-29 2017-11-28 联想(北京)有限公司 Picture recording method and electronic equipment
CN109211241A (en) * 2018-09-08 2019-01-15 天津大学 The unmanned plane autonomic positioning method of view-based access control model SLAM
CN110335292A (en) * 2019-07-09 2019-10-15 北京猫眼视觉科技有限公司 It is a kind of to track the method and system for realizing simulated scenario tracking based on picture
CN111612827A (en) * 2020-05-21 2020-09-01 广州海格通信集团股份有限公司 Target position determining method and device based on multiple cameras and computer equipment

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100457302B1 (en) * 2002-10-31 2004-11-16 전자부품연구원 Auto tracking and auto zooming method of multi channel by digital image processing
CN106713747A (en) * 2016-11-29 2017-05-24 维沃移动通信有限公司 Focusing method and mobile terminal
CN107993256A (en) * 2017-11-27 2018-05-04 广东工业大学 Dynamic target tracking method, apparatus and storage medium
CN108615247B (en) * 2018-04-27 2021-09-14 深圳市腾讯计算机系统有限公司 Method, device and equipment for relocating camera attitude tracking process and storage medium
CN108848304B (en) * 2018-05-30 2020-08-11 影石创新科技股份有限公司 Target tracking method and device of panoramic video and panoramic camera
CN109598744B (en) * 2018-11-29 2020-12-08 广州市百果园信息技术有限公司 Video tracking method, device, equipment and storage medium
CN109887003B (en) * 2019-01-23 2021-11-19 亮风台(上海)信息科技有限公司 Method and equipment for carrying out three-dimensional tracking initialization
CN111709973B (en) * 2020-06-16 2024-02-20 北京百度网讯科技有限公司 Target tracking method, device, equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170178355A1 (en) * 2015-12-17 2017-06-22 Stmicroelectronics Sa Determination of an ego-motion of a video apparatus in a slam type algorithm
CN107404615A (en) * 2017-06-29 2017-11-28 联想(北京)有限公司 Picture recording method and electronic equipment
CN109211241A (en) * 2018-09-08 2019-01-15 天津大学 The unmanned plane autonomic positioning method of view-based access control model SLAM
CN110335292A (en) * 2019-07-09 2019-10-15 北京猫眼视觉科技有限公司 It is a kind of to track the method and system for realizing simulated scenario tracking based on picture
CN111612827A (en) * 2020-05-21 2020-09-01 广州海格通信集团股份有限公司 Target position determining method and device based on multiple cameras and computer equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117218212A (en) * 2023-11-09 2023-12-12 杭州巨岩欣成科技有限公司 Camera calibration self-adaptive adjustment method and device, computer equipment and storage medium
CN117218212B (en) * 2023-11-09 2024-02-13 杭州巨岩欣成科技有限公司 Camera calibration self-adaptive adjustment method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN115082516A (en) 2022-09-20

Similar Documents

Publication Publication Date Title
CN110322500B (en) Optimization method and device for instant positioning and map construction, medium and electronic equipment
CN111325796B (en) Method and apparatus for determining pose of vision equipment
CN111127563A (en) Combined calibration method and device, electronic equipment and storage medium
EP3872764B1 (en) Method and apparatus for constructing map
WO2012148925A1 (en) Devices, methods and apparatuses for homography evaluation involving a mobile device
CN110660098B (en) Positioning method and device based on monocular vision
CN110349212B (en) Optimization method and device for instant positioning and map construction, medium and electronic equipment
CN112819860B (en) Visual inertial system initialization method and device, medium and electronic equipment
CN110728622B (en) Fisheye image processing method, device, electronic equipment and computer readable medium
CN114993328B (en) Vehicle positioning evaluation method, device, equipment and computer readable medium
CN110930438B (en) Image registration method, device, electronic equipment and storage medium
CN114399588B (en) Three-dimensional lane line generation method and device, electronic device and computer readable medium
CN112116655A (en) Method and device for determining position information of image of target object
WO2022194158A1 (en) Target tracking method and apparatus, device, and medium
WO2022194145A1 (en) Photographing position determination method and apparatus, device, and medium
CN116182878B (en) Road curved surface information generation method, device, equipment and computer readable medium
CN112270242A (en) Track display method and device, readable medium and electronic equipment
WO2023020268A1 (en) Gesture recognition method and apparatus, and device and medium
CN112880675B (en) Pose smoothing method and device for visual positioning, terminal and mobile robot
CN115086538B (en) Shooting position determining method, device, equipment and medium
US20240005552A1 (en) Target tracking method and apparatus, device, and medium
CN111383337B (en) Method and device for identifying objects
WO2022194157A1 (en) Target tracking method and apparatus, device and medium
CN116563817B (en) Obstacle information generation method, obstacle information generation device, electronic device, and computer-readable medium
WO2023284479A1 (en) Plane estimation method and apparatus, electronic device, and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22770511

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22770511

Country of ref document: EP

Kind code of ref document: A1