CN116506585A - Multi-target tracking and displaying method in panoramic video - Google Patents

Multi-target tracking and displaying method in panoramic video Download PDF

Info

Publication number
CN116506585A
CN116506585A CN202310294768.3A CN202310294768A CN116506585A CN 116506585 A CN116506585 A CN 116506585A CN 202310294768 A CN202310294768 A CN 202310294768A CN 116506585 A CN116506585 A CN 116506585A
Authority
CN
China
Prior art keywords
panoramic
target
targets
tracking
panoramic video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310294768.3A
Other languages
Chinese (zh)
Inventor
林树森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pi Technology Changzhou Co ltd
Original Assignee
Pi Technology Changzhou Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pi Technology Changzhou Co ltd filed Critical Pi Technology Changzhou Co ltd
Priority to CN202310294768.3A priority Critical patent/CN116506585A/en
Publication of CN116506585A publication Critical patent/CN116506585A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/60Analysis of geometric attributes
    • G06T7/62Analysis of geometric attributes of area, perimeter, diameter or volume
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/62Extraction of image or video features relating to a temporal dimension, e.g. time-based feature extraction; Pattern tracking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/111Transformation of image signals corresponding to virtual viewpoints, e.g. spatial image interpolation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/156Mixing image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/21805Source of audio or video content, e.g. local disk arrays enabling multiple viewpoints, e.g. using a plurality of cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30241Trajectory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Abstract

The invention provides a multi-target tracking and displaying method in panoramic video, which comprises the following steps: acquiring panoramic video data of panoramic equipment to obtain a panoramic picture; detecting and identifying targets in the panoramic picture, determining N targets according to detection and identification results, and obtaining N target surrounding areas; drawing target rendering pictures by taking the target surrounding area as a center to obtain N target rendering pictures; and combining and splicing the N target rendering pictures, and displaying the N target rendering pictures in a display screen of the panoramic equipment according to a set arrangement mode. The panoramic equipment automatically tracks a plurality of targets and simultaneously displays a plurality of target pictures in the display screen.

Description

Multi-target tracking and displaying method in panoramic video
Technical Field
The invention relates to the technical field of image processing, in particular to a multi-target tracking and displaying method in panoramic video.
Background
Panoramic video is shooting through panoramic camera equipment, and contains 360 degrees all-round picture content's dynamic video, and panoramic camera equipment usually contains one to many fish-eye cameras, carries out panorama concatenation through the multi-angle still picture that shoots each fish-eye camera, forms 360 degrees panorama pictures, and then turns into dynamic video image with static panorama picture for the user can watch the dynamic video of shooting on panoramic camera equipment with arbitrary angle.
At present, when a panoramic camera browses panoramic videos, a plurality of targets cannot be automatically tracked, and effective picture presentation is carried out for a user to watch and browse, so that when the user watches the panoramic videos, the user needs to manually switch different visual angles to watch scenes of the plurality of targets, and experience of the user when watching the panoramic videos is poor.
Or when the panoramic camera equipment is used for carrying out network panoramic live broadcast, such as carrying out panoramic live broadcast conference, panoramic live broadcast event and the like, the panoramic camera equipment cannot automatically identify and track a plurality of targets, such as characters, and a user can not watch real-time picture contents of the plurality of characters as a live broadcast viewer at the same time, so that the experience of the user is poor when watching the panoramic live broadcast.
Disclosure of Invention
The embodiment of the invention provides a multi-target tracking and displaying method in panoramic video, which aims to solve the technical problems that aiming at the defects in the prior art, at present, when a panoramic camera browses panoramic video or initiates live broadcast, a plurality of character targets cannot be automatically tracked, and effective picture presentation is carried out for a user to watch and browse, so that when the user watches panoramic video, the user needs to manually switch different viewing angles to watch scenes of the plurality of character targets, and the experience of the user is poor when watching panoramic video or watching panoramic live broadcast.
In order to solve the technical problems, the invention adopts a technical scheme that: the multi-target tracking and displaying method in the panoramic video comprises the following steps:
s101, acquiring panoramic video data of panoramic equipment to obtain a panoramic picture;
s102, detecting and identifying targets in the panoramic picture, determining N targets according to detection and identification results, and obtaining N target surrounding areas;
s103, drawing target rendering pictures by taking the target surrounding area as a center, and obtaining N target rendering pictures;
and S104, combining and splicing the N target rendering pictures, and displaying the N target rendering pictures in a display screen of the panoramic equipment according to a set arrangement mode.
In a specific embodiment, the detection and identification result includes: and correspondingly, determining N targets according to the detection and identification results comprises the following steps:
calculating the area occupied by each target in the panoramic picture;
sorting all the targets according to the areas;
and determining N targets according to the sequencing result.
In a specific embodiment, the determining N targets according to the sorting result includes: and selecting N targets according to the sorting result from large area to small area, wherein N is an integer and is larger than 1.
In a specific embodiment, the detection and identification result includes: and correspondingly, determining N targets according to the detection and identification results comprises the following steps:
generating thumbnail views of all the targets;
receiving N target instructions selected by a user according to the thumbnail view of the target;
and determining N targets according to the N target instructions.
In a specific embodiment, in step S104, when the N is 2, the setting arrangement manner is vertically symmetrical or laterally symmetrical, and the setting arrangement manner is evenly distributed in the display screen of the panoramic apparatus.
In a specific embodiment, in step S104, when N is 3, the setting arrangement mode is that the target with the largest occupied area in the panoramic picture and the other two targets are symmetrically distributed left and right or vertically, and the other two targets are symmetrically distributed left and right or vertically.
In a specific embodiment, in step S104, when the N is 4, the setting arrangement mode is that the 4 targets are distributed in the display screen of the panoramic apparatus in a vertically and laterally symmetric manner.
In a specific embodiment, the step S104 further includes encoding the rendered frames assembled and spliced according to the set arrangement manner into video data, and transmitting the video data to the streaming media server in real time.
In one embodiment, there is also included a computer device comprising:
one or more processors;
a memory;
and one or more computer programs, the processor and the memory being connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors, the processor implementing the steps of the multi-objective tracking and display method in panoramic video described above when the computer programs are executed.
In a specific embodiment, the panoramic camera further comprises:
one or more processors;
a memory;
and one or more computer programs, the processor and the memory being connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors, characterized in that the steps of the multi-objective tracking and display method thereof in panoramic video described above are implemented when the processor executes the computer programs.
The beneficial effects of the invention are as follows: in contrast to the situation of the prior art, the multi-target tracking and displaying method in a panoramic video provided by the embodiment of the invention includes the following steps:
s101, acquiring panoramic video data of panoramic equipment to obtain a panoramic picture;
s102, detecting and identifying targets in the panoramic picture, determining N targets according to detection and identification results, and obtaining N target surrounding areas;
s103, drawing target rendering pictures by taking the target surrounding area as a center, and obtaining N target rendering pictures;
and S104, combining and splicing the N target rendering pictures, and displaying the N target rendering pictures in a display screen of the panoramic equipment according to a set arrangement mode.
From the above technical solutions, the embodiment of the present invention has the following advantages:
according to the multi-target tracking and displaying method in the panoramic video, the panoramic video is unfolded into the panoramic picture, N targets in the panoramic picture are identified and tracked, N target surrounding areas are obtained, N target rendering pictures are obtained by drawing with each target surrounding area as the center, the N target rendering pictures are combined and spliced and are arranged and displayed in a display screen according to a set arrangement mode, so that a user can automatically and simultaneously watch pictures of the multiple targets without manually adjusting the view angles of the panoramic picture, and the view angles of pictures of different targets are switched.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic flow chart of an implementation of a multi-target tracking and displaying method in panoramic video according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a configuration arrangement when N is 2 according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a configuration arrangement when N is 3 according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of a configuration arrangement when N is 4 according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a computer device according to a second embodiment of the present invention;
fig. 6 is a block diagram of a panoramic camera according to a third embodiment of the present invention.
Reference numerals:
10. a display screen; 20. a computer device; 30. panoramic cameras.
Detailed Description
The present application is described in further detail below with reference to the drawings and the embodiments. It is specifically noted that the following embodiments are merely for illustrating the present application, but do not limit the scope of the present application. Likewise, the following embodiments are only some, but not all, of the embodiments of the present application, and all other embodiments obtained by one of ordinary skill in the art without inventive effort are within the scope of the present application.
The terms "comprising" and "having" and any variations thereof herein are intended to cover a non-exclusive inclusion. A process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those listed but may optionally include other steps or elements not listed or inherent to such process, method, article, or apparatus.
The above terms are merely for convenience of description and should not be construed as limiting the present technical solution.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the present application. The appearances of such phrases in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those skilled in the art will explicitly and implicitly understand that the embodiments described herein may be combined with other embodiments.
The following describes in detail the implementation of the present invention in connection with specific embodiments:
embodiment one:
fig. 1 shows a realization flow of a multi-target tracking and displaying method in panoramic video according to an embodiment of the present invention, and for convenience of explanation, only the relevant parts of the embodiment of the present invention are shown, which is described in detail below:
in step S101, panoramic video data of a panoramic apparatus is acquired, and a panoramic picture is obtained;
the embodiment of the invention is suitable for a panoramic video processing system or platform, and is used for acquiring and processing the panoramic video.
Specifically, when panoramic video data is acquired, panoramic cameras or multiple groups of cameras are generally adopted to carry out 360-degree panoramic shooting on surrounding scenes, panoramic stitching is carried out inside the surrounding scenes, the panoramic video data is obtained, and then panoramic pictures are obtained.
In step S102, detecting and identifying targets in the panoramic picture, determining N targets according to the detection and identification results, and obtaining N target surrounding areas;
in the embodiment of the present invention, the panoramic picture obtained in step S101 is detected, identified and tracked by an Al algorithm, where the algorithm includes, but is not limited to, at least one of the following: SSD algorithm (Single Shot MultiBox Detector), R-CNN series algorithm (Region-Convolutional Neural Networks), YOLO series algorithm (You Only Look Once) and the like in deep learning, and the identified target may be a dynamic object such as a person, an animal, a vehicle and the like, or a static object such as a tree, a flower, a plant, a building and the like. In the embodiment of the invention, the person is preferably selected as the target for detection, identification and tracking, and the user can select an object in the panoramic camera screen as the target for subsequent detection, identification. And detecting and identifying a tracking target through the panoramic picture, obtaining a detected and identified target person picture, calculating to obtain a surrounding area of the target, and obtaining a person surrounding rectangle when the selected target is a person, wherein the person surrounding rectangle is a rectangle taking the target person as the center and contains all detail characteristics of the target person, and obtaining a rectangular area picture through detection and calculation.
Further, the number of the detected and identified targets includes N, where the specific number of N depends on the limitations of the system, CPU, memory, etc. of the panoramic apparatus itself, typically 2-8, and the greater the number, the higher the requirements on the system, CPU, memory of the panoramic apparatus.
Further, the value of N may be default, or a value within the system setting range may be manually selected, for example, the upper limit of N is set to 8 by the system, if the user does not specify the value of N, the system defaults to select N to 8, and if the user specifies the value of N, any value less than or equal to 8 may be specified.
Further, the detection and identification result includes: and correspondingly, determining N targets according to the detection and identification results comprises the following steps: calculating the area occupied by each target in the panoramic picture; sorting all the targets according to the areas; and determining N targets according to the sequencing result.
Further, in one embodiment, when the N value is a default value of the system, N targets are selected according to the above-mentioned sequence of sorting the targets from large to small, where the N targets are target objects that the system needs to track and detect, N target surrounding areas are generated according to the N target objects, and when the target is a person, N person surrounding rectangles are generated.
In another embodiment, when the value of N is specified by the user, the detecting all objects in the panoramic picture includes: and generating thumbnail views of all the targets, receiving N target instructions selected by a user according to the thumbnail views of the targets, wherein the N targets are target objects which are required to be tracked and detected by a system, generating N target surrounding areas according to the N target objects, and generating N person surrounding rectangles when the targets are people.
In step S103, the target rendering screen is drawn with the target surrounding area as a center, and N target rendering screens are obtained.
In the embodiment of the invention, the target rendering picture is picture content which contains a target surrounding area and is actually watched by a user. The step S103 specifically includes calculating, by way of example and not by way of limitation, the position information of the center of the target surrounding area in the panoramic sphere according to the target surrounding area, and when the detection target is a person, the target surrounding area is a person surrounding rectangle, calculating by Al to obtain the position information of the center of the person surrounding rectangle in the panoramic sphere, setting the center of the person surrounding rectangle as a point 0, and setting the position information of the center point 0 of the person surrounding rectangle in the panoramic sphere as a position point a.
And drawing a target rendering picture according to the position information of the center of the person enclosing rectangle, so that the person enclosing rectangle is positioned at the center of the rendering picture, specifically, calculating and drawing a rectangular target rendering picture by taking the position point A as the center according to the position information of the center point 0 of the person enclosing rectangle in the panoramic sphere as the position point A, and enabling the position point A of the position information of the center point 0 of the person enclosing rectangle in the panoramic sphere to be always positioned at the center of the target rendering picture through Al tracking detection, so that a user can conveniently see that the person target is always positioned at the most middle position of the picture, and the live view angle of the person target is not required to be manually adjusted.
Further, the method also comprises the step of adjusting the field angle of the target rendering picture so that the target surrounding area occupies a set proportion of the rendering picture. Specifically, when the target is detected as a person, the target surrounding area is a person surrounding rectangle, the target person may move at any time when panoramic shooting or live broadcasting is performed by using a panoramic camera device, or the panoramic camera may move with the movement of the target person, and the target person and the panoramic camera may move at a certain time, which results in a change in the person surrounding rectangle at the moment of the angle of view of the rendered image, and the size of the person surrounding rectangle in the target rendered image may be always changed although the person surrounding rectangle is always located in the center of the rendered image, so that the angle of view of the target rendered image needs to be adjusted in real time, and the person surrounding rectangle may occupy a set proportion of the target rendered image.
Further, the adjusting the view angle of the target rendering picture specifically includes:
and detecting the proportion of the area of the target surrounding area in the panoramic sphere, and obtaining the size of the area actually mapped in the target rendering picture by the target surrounding area with corresponding size according to the proportion relation.
Calculating the actual proportion of the area mapping of the target surrounding area on the target rendering picture;
if the actual proportion is lower than the set proportion, amplifying the area of the target surrounding area mapped on the target rendering picture;
and if the actual proportion is higher than the set proportion, reducing the area of the target surrounding area mapped on the target rendering picture.
Further, the set proportion is 30-70%.
Further, the set proportion is 50%.
The set proportion user can be adjusted according to personal viewing angle preference so as to accord with viewing angle experience when the user watches live broadcast. When the set proportion is 50%, the person bounding rectangle always occupies half of the area of the person rendering picture, and at the moment, the viewing angle of the panoramic live broadcast is optimal for the user, and the user experience is best.
In step S104, the N target rendering frames are combined and spliced, and are displayed in a display screen of the panoramic apparatus according to a set arrangement mode.
In the embodiment of the present invention, when N is 2, the setting arrangement manner is vertically symmetrical or laterally symmetrical, and is evenly distributed in the display screen of the panoramic apparatus, as shown in fig. 2, in this case, two target rendering frames (including a target 1 rendering frame and a target 2 rendering frame) are vertically symmetrical, so that a user can watch video of two targets at the same time to the greatest extent, and as an alternative embodiment, the two target rendering frames may be arranged in the display screen of the panoramic apparatus or the terminal apparatus in a laterally symmetrical manner (not shown in the drawing).
Further, when N is 3, the arrangement mode is that the object with the largest occupied area in the panoramic picture and the other two objects are symmetrically distributed in a left-right or up-down manner, as shown in fig. 3. As an alternative embodiment, the 3 target rendering frames (including the target 1 rendering frame, the target 2 rendering frame and the target 3 rendering frame) may be displayed in an equally divided arrangement (not shown in the drawing), or be equally divided up and down, or be equally divided left and right, or be arranged and combined in a three equally divided arrangement that can be imagined by those skilled in the art without taking any inventive effort, which is within the scope of the claims of the present invention.
Further, when N is 4, the setting arrangement manner is that the 4 targets are symmetrically distributed in the display screen of the panoramic apparatus, as shown in fig. 4. As an alternative embodiment, the 4 target rendering frames (including the target 1 rendering frame, the target 2 rendering frame, the target 3 rendering frame, and the target 4 rendering frame) may be arranged in a four-equal arrangement manner in the up-down direction, or in a four-equal arrangement manner in the left-right direction, or in other arrangements and combinations of four-equal arrangements that can be imagined by those skilled in the art without performing any inventive work, which are all within the scope of the claims of the present invention.
Further, the step S104 further includes encoding the rendered frames assembled and spliced according to the set arrangement mode into video data, and transmitting the video data to the streaming media server in real time.
Specifically, the combined and spliced rendering pictures according to the set arrangement mode are encoded into video data, and the video data are transmitted to a streaming media server in real time, so that a user can watch panoramic live broadcast on a terminal device conveniently.
Embodiment two:
fig. 5 shows the structure of a computer device provided in the third embodiment of the present invention, and for convenience of explanation, only the portions relevant to the embodiment of the present invention are shown.
The computer device 20 of an embodiment of the invention comprises a processor 21, a memory 22 and a computer program 23 stored in said memory 22 and executable on said processor 21. The steps of the first embodiment described above, such as steps S101 to S104 shown in fig. 1, are implemented when the processor 21 executes the computer program 23.
Embodiment III:
in an embodiment of the present invention, there is provided a panoramic camera 30, as shown in fig. 6, the panoramic camera 30 includes one or more processors 31, and a memory 32, the memory 32 stores a computer program 33, and the computer program 33 implements the steps in the first embodiment described above when executed by the processor 31, for example, steps S101 to S104 shown in fig. 1.
In contrast to the situation of the prior art, the multi-target tracking and displaying method in a panoramic video provided by the embodiment of the invention includes the following steps:
s101, acquiring panoramic video data of panoramic equipment to obtain a panoramic picture;
s102, detecting and identifying targets in the panoramic picture, determining N targets according to detection and identification results, and obtaining N target surrounding areas;
s103, drawing target rendering pictures by taking the target surrounding area as a center, and obtaining N target rendering pictures;
and S104, combining and splicing the N target rendering pictures, and displaying the N target rendering pictures in a display screen of the panoramic equipment according to a set arrangement mode.
From the above technical solutions, the embodiment of the present invention has the following advantages:
according to the multi-target tracking and displaying method in the panoramic video, the panoramic video is unfolded into the panoramic picture, N targets in the panoramic picture are identified and tracked, N target surrounding areas are obtained, N target rendering pictures are obtained by drawing with each target surrounding area as the center, the N target rendering pictures are combined and spliced and are arranged and displayed in a display screen according to a set arrangement mode, so that a user can automatically and simultaneously watch pictures of the multiple targets without manually adjusting the view angles of the panoramic picture, and the view angles of pictures of different targets are switched.
It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein.
In the several embodiments provided in this application, it should be understood that the disclosed systems, apparatuses, and methods may be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied essentially or in part or all of the technical solution or in part in the form of a software product stored in a storage medium, including instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-only memory (ROM), a random access memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
The above embodiments are only for illustrating the technical solution of the present invention, and not for limiting the same; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. The multi-target tracking and displaying method in the panoramic video is characterized by comprising the following steps:
s101, acquiring panoramic video data of panoramic equipment to obtain a panoramic picture;
s102, detecting and identifying targets in the panoramic picture, determining N targets according to detection and identification results, and obtaining N target surrounding areas;
s103, drawing target rendering pictures by taking the target surrounding area as a center, and obtaining N target rendering pictures;
and S104, combining and splicing the N target rendering pictures, and displaying the N target rendering pictures in a display screen of the panoramic equipment according to a set arrangement mode.
2. The method for tracking and displaying multiple objects in panoramic video according to claim 1, wherein said detecting and identifying result comprises: and correspondingly, determining N targets according to the detection and identification results comprises the following steps:
calculating the area occupied by each target in the panoramic picture;
sorting all the targets according to the areas;
and determining N targets according to the sequencing result.
3. The method for tracking and displaying multiple objects in a panoramic video according to claim 2, wherein said determining N objects according to the ranking result comprises: and selecting N targets according to the sorting result from large area to small area, wherein N is an integer and is larger than 1.
4. The method for tracking and displaying multiple objects in panoramic video according to claim 2, wherein said detecting and identifying result comprises: and correspondingly, determining N targets according to the detection and identification results comprises the following steps:
generating thumbnail views of all the targets;
receiving N target instructions selected by a user according to the thumbnail view of the target;
and determining N targets according to the N target instructions.
5. The method for tracking and displaying multiple objects in a panoramic video according to any one of claims 1 to 4, wherein when N is 2, the arrangement is vertically symmetrical or laterally symmetrical, and the arrangement is evenly distributed in a display screen of the panoramic apparatus.
6. The method for tracking and displaying multiple objects in a panoramic video according to any one of claims 1 to 4, wherein when N is 3, the setting arrangement is that the object with the largest occupied area in the panoramic image is distributed symmetrically left and right or symmetrically up and down with the other two objects, and the other two objects are distributed symmetrically left and right or symmetrically up and down.
7. The method for tracking and displaying multiple objects in a panoramic video according to any one of claims 1 to 4, wherein when N is 4, the setting arrangement is that 4 objects are symmetrically distributed in the display screen of the panoramic device.
8. The method according to any one of claims 1 to 4, wherein the step S104 further comprises encoding the rendered frames assembled and spliced in a set arrangement into video data, and transmitting the video data to a streaming server in real time.
9. A computer device, comprising:
one or more processors;
a memory;
and one or more computer programs, the processor and the memory being connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors, characterized in that the processor, when executing the computer programs, implements the steps of the multi-object tracking and display method in panoramic video as claimed in any one of claims 1 to 8.
10. A panoramic camera, comprising:
one or more processors;
a memory;
and one or more computer programs, the processor and the memory being connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors, characterized in that the processor, when executing the computer programs, implements the steps of the multi-objective tracking in panoramic video and the display method thereof as claimed in any one of claims 1 to 8.
CN202310294768.3A 2023-03-23 2023-03-23 Multi-target tracking and displaying method in panoramic video Pending CN116506585A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310294768.3A CN116506585A (en) 2023-03-23 2023-03-23 Multi-target tracking and displaying method in panoramic video

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310294768.3A CN116506585A (en) 2023-03-23 2023-03-23 Multi-target tracking and displaying method in panoramic video

Publications (1)

Publication Number Publication Date
CN116506585A true CN116506585A (en) 2023-07-28

Family

ID=87327478

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310294768.3A Pending CN116506585A (en) 2023-03-23 2023-03-23 Multi-target tracking and displaying method in panoramic video

Country Status (1)

Country Link
CN (1) CN116506585A (en)

Similar Documents

Publication Publication Date Title
US11653065B2 (en) Content based stream splitting of video data
US10630899B2 (en) Imaging system for immersive surveillance
CN108496368B (en) Dynamic video overlay
US11830161B2 (en) Dynamically cropping digital content for display in any aspect ratio
JP7142575B2 (en) Method and apparatus for synthesizing images
CN111263177A (en) Video interactive live broadcast method and system
SG191198A1 (en) Imaging system for immersive surveillance
US10764493B2 (en) Display method and electronic device
CN111937397A (en) Media data processing method and device
US20170225077A1 (en) Special video generation system for game play situation
CN110876079A (en) Video processing method, device and equipment
CN110730340B (en) Virtual audience display method, system and storage medium based on lens transformation
US11831853B2 (en) Information processing apparatus, information processing method, and storage medium
CN110933461B (en) Image processing method, device, system, network equipment, terminal and storage medium
US11622099B2 (en) Information-processing apparatus, method of processing information, and program
CN111371985A (en) Video playing method and device, electronic equipment and storage medium
CN112288877A (en) Video playing method and device, electronic equipment and storage medium
CN116208843A (en) Panoramic live broadcast method and device, readable storage medium and computer equipment
WO2018234622A1 (en) A method for detecting events-of-interest
CN112954443A (en) Panoramic video playing method and device, computer equipment and storage medium
CN116506585A (en) Multi-target tracking and displaying method in panoramic video
US10237614B2 (en) Content viewing verification system
CN113888257A (en) Article-based display method, device and program product
KR101773929B1 (en) System for processing video with wide viewing angle, methods for transmitting and displaying vide with wide viewing angle and computer programs for the same
US10674207B1 (en) Dynamic media placement in video feed

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination