CN112990174A

CN112990174A - Visual tracking platform and method for multi-application scene

Info

Publication number: CN112990174A
Application number: CN202110171894.0A
Authority: CN
Inventors: 刘婷; 陈图川; 刘斐; 吴炘翌
Original assignee: Jiaxing Jupiter Robot Technology Co ltd
Current assignee: Jiaxing Jupiter Robot Technology Co ltd
Priority date: 2021-02-04
Filing date: 2021-02-04
Publication date: 2021-06-18

Abstract

The invention discloses a visual tracking platform and a visual tracking method for multiple application scenes, which comprise the following steps: step S1: setting parameters according to an actual application scene; step S2: setting camera parameters; step S3: a camera acquires information; step S4: optimizing software and hardware; step S5: performing resource integration; step S6: adjusting the camera; in step S2, the set camera parameters include the number of frames and the resolution. The invention can switch and debug the application of the complex scene, can carry out different detections on the same system platform, and reduces the hardware cost and difficulty; the user only needs to carry out debugging design according to the current required scene, so that the working efficiency is improved; the double-flow asynchronous algorithm of the invention makes continuous calculation processing independent, starts a new processing flow for each processing, can fully utilize the parallel capability of calculation hardware, and reduces the mutual restriction of data processing before and after time domain; maximum multiplexing of hardware modules is achieved.

Description

Visual tracking platform and method for multi-application scene

Technical Field

The present invention relates to a visual tracking system, and more particularly, to a visual tracking platform and method for multiple application scenes.

Background

Vision is one of the important ways in which humans perceive the world, with 80% of the external information acquired by humans coming from the visual system. The computational vision is to use an imaging system to replace human visual organs and use a computational grade to replace human brain to complete the processing and understanding of the input image on the basis of knowing the human vision. Meanwhile, with the development of information technology and intelligent science, computer vision is one of popular subjects in the field of artificial intelligence and one of important technologies of the perception layer of the internet of things.

The visual tracking technology, one of the popular subjects in the field of computer vision, is to perform moving target detection, feature extraction, classification and recognition, tracking filtering and behavior recognition on a continuous image sequence to obtain accurate moving information parameters (such as position, speed and the like) of a target, perform corresponding processing and analysis on the accurate moving information parameters, and realize behavior understanding of the target.

The visual tracking refers to detecting, extracting, identifying and tracking a moving target in an image sequence to obtain motion parameters of the moving target, such as position, speed, acceleration, motion track and the like, so that the next step of processing and analysis is performed to realize behavior understanding of the moving target to complete a higher-level detection task.

The existing image processing technology can basically have mature technologies in the aspects of object identification, face detection, face identification and object tracking; however, in the current artificial intelligence system, only visual identification and tracking can be performed for a single situation, and it is not easy to switch and debug the application of a complex scene, for example, a given scene is used for face identification of a user, but an actual application scene needs to detect a person who does not wear a mask, different system platforms need to be used for different detections, so that hardware cost and development difficulty are increased, and a user needs to perform debugging design for different scenes, thereby reducing working efficiency.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a visual tracking platform and a visual tracking method for a multi-application scene.

In order to solve the technical problems, the invention provides the following technical scheme:

the invention discloses a visual tracking method of a multi-application scene, which comprises the following steps:

step S1: setting parameters according to an actual application scene;

step S2: setting camera parameters;

step S3: a camera acquires information;

step S4: optimizing software and hardware;

step S5: performing resource integration;

step S6: the camera is adjusted.

As a preferred embodiment of the present invention, in step S2, the set camera parameters include the number of frames and the resolution.

As a preferred embodiment of the present invention, in step S5,

the resource integration content includes OpenCV and YoLo algorithms.

As a preferred embodiment of the present invention, in step S6,

the adjustments made to the camera include adjustments to a camera carrier.

As a preferred embodiment of the present invention, in the steps S1 to S6, the camera includes an RGB-D camera.

As a preferred technical solution of the present invention, the step S6 further includes a dual-stream asynchronous algorithm, including the following steps:

step SS 1: simultaneously reading data and calculating the data;

step SS 2: carrying out asynchronous calculation;

step SS 3: selecting a message node;

step SS 4: carrying out priority matching;

step SS 5: a continuous treatment is performed.

The invention also provides a visual tracking platform of a multi-application scene, which comprises the camera, the OpenCV module, the YoLo algorithm module, the gazebo module, the rviz module, the main control system module, the mobile chassis module, the laser radar module and the mechanical arm.

The invention has the following beneficial effects: the invention can switch and debug the application of the complex scene, can carry out different detections on the same system platform, and reduces the hardware cost and difficulty; the user only needs to carry out debugging design according to the current required scene, so that the working efficiency is improved; the double-flow asynchronous algorithm of the invention makes continuous calculation processing independent in time dimension, starts a new processing flow for each processing, can fully utilize the parallel capability of calculation hardware, and reduces the mutual restriction of data processing before and after time domain; the maximum multiplexing of the hardware modules is realized; the difficulty in development of the existing visual processing can be solved; the resources are integrated through resource packages such as Opencv and Yolo, a main control system is carried by a system platform, physical systems such as a mobile chassis, a laser radar and a mechanical arm are written in virtual control of gazebo and rviz, and a user can decide whether to perform and apply a visual tracking technology to a wider combination.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

FIG. 1 is one of the flow diagrams of the present invention;

FIG. 2 is a second schematic flow chart of the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

Examples

As shown in fig. 1-2, the present invention provides a visual tracking method for multiple application scenes, comprising the following steps:

step S1: setting parameters according to an actual application scene; firstly, the corresponding design scheme is known through product data, and then design analysis is carried out according to the actual application scene. Calling different libraries for problem solving under different conditions; step S2: setting camera parameters; in step S2, the set camera parameters include the number of frames and resolution; step S3: a camera acquires information; step S4: optimizing software and hardware; step S5: performing resource integration; step S6: the camera is adjusted. In step S5, the resource integration content includes OpenCV and YoLo algorithms.

In step S6, the adjustment performed on the camera includes adjustment of a camera carrying device, in this embodiment, the carrying device is a steering engine, and the adjustment range is 180 degrees in the horizontal direction and 90 degrees in the pitch direction.

In the steps S1 through S6, the camera includes an RGB-D camera.

In step S6, the method further includes a dual-stream asynchronous algorithm, including the following steps: step SS 1: simultaneously reading data and calculating the data; step SS 2: carrying out asynchronous calculation; step SS 3: selecting a message node; step SS 4: carrying out priority matching; step SS 5: carrying out continuous treatment;

and (3) a double-flow asynchronous real-time reasoning calculation framework. Under the condition of limited computational performance, in order to solve the problems of synchronous blocking, multi-model data redundancy and high model switching overhead possibly existing in the inference neural network operation, the multi-core parallel capability of the computation hardware is fully excavated, a double-flow asynchronous real-time inference computation architecture is designed, and a polymorphic model is supported to realize real-time inference. The designed double-flow structure separates data reading from a data calculation thread, avoids the influence of data reading delay and invalidation on subsequent processing, and also isolates data accumulation and data loss caused by data processing delay; the asynchronous calculation framework realizes the independence of continuous calculation processing in the time dimension, starts a new processing flow for each processing, can fully utilize the parallel capability of calculation hardware, and reduces the mutual restriction of data processing before and after time domain.

A multi-application scene vision tracking platform comprising a camera as claimed in the preceding claims and an OpenCV module, a YoLo algorithm module, a gazebo module, a rviz module, a main control system module, a mobile chassis module, a lidar module, and a robotic arm.

Further, the method is simple and convenient to operate.

Specifically, the existing data parameters are first acquired by the RGB-D camera. Adjusting the range of image acquisition through a steering engine; the development direction is then determined from the existing scene. And selecting and combining the resources of object recognition, face detection, face recognition, object tracking and the like which are carried by the system platform. And optimizing the software and hardware of the whole system architecture. And then, acquiring the information of the image through an ROS system, reprocessing the acquired information sample, and carrying out secondary development and calling on the existing resources according to actual requirements.

The invention can switch and debug the application of the complex scene, can carry out different detections on the same system platform, and reduces the hardware cost and difficulty; the user only needs to carry out debugging design according to the current required scene, so that the working efficiency is improved; the double-flow asynchronous algorithm of the invention makes continuous calculation processing independent in time dimension, starts a new processing flow for each processing, can fully utilize the parallel capability of calculation hardware, and reduces the mutual restriction of data processing before and after time domain; maximum multiplexing of hardware modules is achieved.

The difficulty in developing the existing visual processing can be solved. The resources are integrated through resource packages such as Opencv and Yolo, a main control system is carried by a system platform, physical systems such as a mobile chassis, a laser radar and a mechanical arm are written in virtual control of gazebo and rviz, and a user can decide whether to perform and apply a visual tracking technology to a wider combination.

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A visual tracking method for a multi-application scene is characterized by comprising the following steps:

step S1: setting parameters according to an actual application scene;

step S2: setting camera parameters;

step S3: a camera acquires information;

step S4: optimizing software and hardware;

step S5: performing resource integration;

step S6: the camera is adjusted.

2. The method for visually tracking a multi-application scene as claimed in claim 1, wherein the camera parameters set in step S2 include frame number and resolution.

3. A visual tracking method of multi-application scene as claimed in claim 1, wherein in said step S5,

the resource integration content includes OpenCV and YoLo algorithms.

4. A visual tracking method of multi-application scene as claimed in claim 1, wherein in said step S6,

the adjustments made to the camera include adjustments to a camera carrier.

5. A method for visual tracking of a multi-application scene as recited in claim 1, wherein said camera comprises an RGB-D camera in said steps S1 through S6.

6. The visual tracking method for multiple application scenes of claim 1, characterized in that in the step S6, the method further comprises a dual-stream asynchronous algorithm, comprising the following steps:

step SS 1: simultaneously reading data and calculating the data;

step SS 2: carrying out asynchronous calculation;

step SS 3: selecting a message node;

step SS 4: carrying out priority matching;

step SS 5: a continuous treatment is performed.

7. A visual tracking platform of a multi-application scenario according to claim 1, characterized by comprising a camera as described in claims 1-6 and OpenCV module, YoLo algorithm module, gazebo module, rviz module, main control system module, mobile chassis module, lidar module and robotic arm.