WO2023066412A1 - Dynamic processing method and apparatus based on unmanned aerial vehicle video pyramid model - Google Patents

Dynamic processing method and apparatus based on unmanned aerial vehicle video pyramid model Download PDF

Info

Publication number
WO2023066412A1
WO2023066412A1 PCT/CN2022/136213 CN2022136213W WO2023066412A1 WO 2023066412 A1 WO2023066412 A1 WO 2023066412A1 CN 2022136213 W CN2022136213 W CN 2022136213W WO 2023066412 A1 WO2023066412 A1 WO 2023066412A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
uav
pyramid model
resolution
uav video
Prior art date
Application number
PCT/CN2022/136213
Other languages
French (fr)
Chinese (zh)
Inventor
简洪登
范湘涛
杜小平
Original Assignee
中国科学院空天信息创新研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院空天信息创新研究院 filed Critical 中国科学院空天信息创新研究院
Publication of WO2023066412A1 publication Critical patent/WO2023066412A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44012Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving rendering scenes according to scene graphs, e.g. MPEG-4 scene graphs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image
    • G06T3/40Scaling the whole image or part thereof
    • G06T3/4023Decimation- or insertion-based scaling, e.g. pixel or line decimation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image
    • G06T3/40Scaling the whole image or part thereof
    • G06T3/4092Image resolution transcoding, e.g. client/server architecture
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23412Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs for generating or manipulating the scene composition of objects, e.g. MPEG-4 objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234363Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234381Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the temporal resolution, e.g. decreasing the frame rate by frame skipping

Definitions

  • the present disclosure relates to the field of real-time transmission and processing of video data, and in particular to a dynamic processing method, device, electronic equipment and storage medium based on a UAV video pyramid model.
  • a dynamic processing method based on a UAV video pyramid model including:
  • the UAV video pyramid model is constructed, wherein the UAV video pyramid model includes multiple levels, and the level of each UAV video pyramid model corresponds to a resolution;
  • the UAV video loading request includes the attribute information of the UAV video pyramid model and the video range within the field of view of the scene view camera;
  • the above preset classification rules are determined by formulas (1)-(3):
  • W is the original width pixel value of the UAV video
  • H is the original height pixel value of the UAV video
  • H i is the corresponding video height pixel value of the P i -level UAV video pyramid model
  • W i is the corresponding video width pixel value of the P i- level UAV video pyramid model
  • R is the actual resolution of the UAV image
  • is the vertical field of view of the drone
  • Height is the current flying height of the drone.
  • the attribute information of the UAV video pyramid model includes the level of the UAV video pyramid model and the resolution corresponding to the level;
  • the resolution of the UAV video pyramid model increases sequentially from the top to the bottom.
  • the above-mentioned video server performs resampling and segmentation processing on the UAV video in response to the UAV video loading request including:
  • the visual range coordinates of the scene view camera using the resolution and level of the UAV video, determine the quadtree segmentation range of the UAV video pyramid model where the outsourcing rectangle of the visual range coordinates is located;
  • the drone video is segmented through the video server, and the segmented drone video is sent to the client and/or browser for rendering.
  • the above-mentioned dynamic scheduling and rendering of drone video according to preset requirements includes:
  • part of the UAV video data is loaded according to the location information of the scene view camera.
  • the above-mentioned drone video pyramid model includes multiple resolutions of 360P, 720P, 1080P, 4K and 8K.
  • the minimum map tile unit of the current map is 256 ⁇ 256 or 512 ⁇ 512 pixels.
  • a dynamic processing device based on a UAV video pyramid model including:
  • the obtaining module is used to obtain the view point distance and field of view range of the scene view camera, query the level of the current map through the client, and obtain the current map according to the correspondence between the map level and map resolution and the view point distance of the scene view camera resolution;
  • the model construction module is used to construct the UAV video pyramid model on the video server according to the resolution of the current map, the UAV flight parameters and the preset classification rules, wherein the UAV video pyramid model includes multiple levels, each The level of the UAV video pyramid model corresponds to a resolution;
  • Calculation module for calculating the video range in the field of view range of the scene view camera according to the field of view range of the scene view camera and the video range of the unmanned aerial vehicle;
  • the request sending module is used to send the UAV video loading request to the video server through the client, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video range within the field of view of the scene view camera ;
  • the video dynamic processing module is used to receive the UAV video after the video server responds to the UAV video loading request through the client to resample and segment the UAV video, and to process the UAV video in the
  • the client performs dynamic scheduling and rendering, loading and display;
  • the video on-demand processing module is used to repeat the above steps when the viewpoint distance of the scene view camera changes, and resample, segment, dynamically schedule and render, load and display the UAV video according to preset requirements.
  • an electronic device including:
  • processors one or more processors
  • the one or more processors are made to execute the above multi-resolution UAV video dynamic processing method.
  • a computer-readable storage medium on which executable instructions are stored, and when the instructions are executed by a processor, the processor executes the above-mentioned multi-resolution drone video dynamic processing method.
  • the disclosure provides a multi-resolution UAV video dynamic processing method, which can dynamically load UAV video data similar to the current map resolution according to the viewpoint distance and field of view range of the current scene view camera to achieve multi-resolution High-performance on-demand rendering of UAV video ensures efficient scene display and smooth system operation.
  • Fig. 1 is the flowchart of the dynamic processing method based on unmanned aerial vehicle video pyramid model according to the embodiment of the present disclosure
  • Fig. 2 is a schematic structural diagram of a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of a general video format of a UAV video pyramid model according to an embodiment of the present disclosure
  • FIG. 4 is a schematic diagram of a 16:9 drone video format of a drone video pyramid model according to an embodiment of the present disclosure
  • FIG. 5 is a flow chart of resampling and segmenting drone video according to an embodiment of the disclosure
  • FIG. 6 is a schematic diagram of quadtree segmentation of a UAV video pyramid model of the same level according to an embodiment of the present disclosure
  • Fig. 7 is a schematic diagram of calculating a video range according to an embodiment of the disclosure.
  • FIG. 8 is a schematic structural diagram of a dynamic processing system based on a UAV video pyramid model according to an embodiment of the present disclosure
  • Fig. 9 schematically shows a block diagram of an electronic device adapted to implement a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
  • FIG. 1 is a flowchart of a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
  • the above dynamic processing method based on the UAV video pyramid model includes operation S110 to operation S160.
  • operation S110 obtain the view point distance and view range of the scene view camera, query the level of the current map through the client, and obtain the current map level according to the correspondence between the map level and the map resolution and the view point distance of the scene view camera. resolution.
  • the above-mentioned scene view camera can be a two-dimensional scene camera or a three-dimensional scene camera. Therefore, the above-mentioned UAV video dynamic processing method provided by the present disclosure can be applied to UAV video in a two-dimensional GIS scene Dynamic processing, also applicable to UAV video dynamic processing in 3D GIS scene
  • a UAV video pyramid model is constructed, wherein the UAV video pyramid model includes multiple levels, and each of the UAVs The level of the video pyramid model corresponds to a resolution.
  • the above-mentioned UAV video pyramid model can be constructed on the client or video server according to user requirements.
  • the UAV video pyramid model includes multiple levels (i.e. the levels of the pyramid model), and each level represents a video resolution; since the UAV video pyramid model can represent multiple resolutions, the above-mentioned unmanned aerial vehicles provided by this disclosure
  • the UAV video dynamic processing method is capable of processing UAV video at multiple resolutions.
  • the above-mentioned flight parameters of the UAV include the flight height of the UAV and the vertical field of view of the UAV.
  • a video range within the view range of the scene view camera is calculated according to the view range of the scene view camera and the drone video range.
  • the client sends a UAV video loading request to the video server, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video range within the field of view of the scene view camera.
  • the attribute information of the pyramid model of the UAV video includes the level of the pyramid model and the resolution corresponding to the level.
  • the video server receives the UAV video after resampling and segmenting the UAV video in response to the UAV video loading request through the client, and performs the processing of the UAV video on the client. Dynamic scheduling and rendering, loading and displaying.
  • the disclosure provides a multi-resolution UAV video dynamic processing method, which can dynamically load UAV video data similar to the current map resolution according to the viewpoint distance and field of view range of the current scene view camera to achieve multi-resolution High-performance on-demand rendering of UAV video ensures efficient scene display and smooth system operation.
  • Fig. 2 is a schematic structural diagram of a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
  • the multi-resolution drone video dynamic processing method provided by the present disclosure will be further described in detail below in conjunction with FIG. 2 .
  • Figure 2 shows that the client/browser side runs the UAV image transmission system to load and display the UAV video
  • the server side stores the UAV video data, and the UAV video pyramid model construction rules and methods.
  • Use the client and/or browser to obtain the scene view camera viewpoint distance and field of view range, query the current map level, and obtain the current map resolution according to the currently commonly used map level and map resolution correspondence; according to the preset classification rules, no Calculate the level and resolution of the UAV video pyramid model based on the flight parameters of the man-machine and the resolution of the current map; and calculate the video range in the field of view according to the video range and field of view of the UAV; send a request to the video server to retrieve UAV video with the corresponding resolution; the server resamples and segments the UAV video according to the parameters in the UAV video request, such as grade, resolution, and video range, and returns the resampled UAV Video: Obtain the resampled UAV video in the 3D scene, load and display the UAV video
  • FIG. 2 is only schematic, and users can construct the UAV video pyramid model on the client side according to their own needs.
  • the above preset classification rules are determined by formulas (1)-(3):
  • W is the original width pixel value of the UAV video
  • H is the original height pixel value of the UAV video
  • H i is the corresponding video height pixel value of the P i -level UAV video pyramid model
  • W i is the corresponding video width pixel value of the P i- level UAV video pyramid model
  • R is the actual resolution of the UAV image
  • is the vertical field of view of the drone
  • Height is the current flying height of the drone.
  • the construction of the UAV video pyramid model is determined by formulas (1) to (3). From the above formulas, it can be concluded that the UAV video pyramid model provided by this disclosure has multiple levels and multiple resolutions, and can cover the video field Various commonly used video resolutions. Optionally, the user can determine the UAV video pyramid model with other video resolutions according to the above formula provided in the present disclosure according to their own needs.
  • the above UAV video pyramid model includes multiple levels, and each level of the UAV video pyramid model corresponds to a resolution.
  • the attribute information of the UAV video pyramid model includes the level of the UAV video pyramid model and the resolution corresponding to the level.
  • the resolution of the UAV video pyramid model increases sequentially from the top to the bottom.
  • Fig. 3 is a schematic diagram of a general video format of a UAV video pyramid model according to an embodiment of the present disclosure.
  • FIG. 4 is a schematic diagram of a 16:9 drone video format of a drone video pyramid model according to an embodiment of the disclosure.
  • Fig. 3 schematically shows the resolution of the unmanned aerial vehicle video pyramid model and the pyramid model at all levels, wherein, the resolution of the first level image 310 of the unmanned aerial vehicle video pyramid model is 360P, namely 360*W/H;
  • the resolution of the second level image 320 of the UAV video pyramid model is 720P, i.e. 720*W/H;
  • the resolution of the third level image 330 of the UAV video pyramid model is 1080P, i.e. 1080*W/H;
  • the resolution of the sixth-level image 340 of the UAV video pyramid model is 4K, that is, 2160*W/H.
  • the above W/H represents the ratio of the width and height of the drone video, for example, the ratio of width to height can be 1:1 or 16:9, and those skilled in the art should understand that Figure 3 is only schematic , those skilled in the art can build N (N is a positive integer) levels (such as P 1 , P 2 , P 3 , P 4 , P 5 , P 6 ... P N ) UAV video pyramids according to actual needs Model.
  • this disclosure builds a multi-resolution UAV video pyramid step by step based on 360P video, which can cover 720P, 1080P, 4K, 8K and other commonly used video resolutions.
  • the correspondence between the multi-resolution UAV video pyramid level and the UAV video pixel height and width is shown in the above formulas (1) and (2).
  • Fig. 4 shows 16: 9 (W/H, video width/video height) the UAV video pyramid model of UAV video format and the resolution of each level of pyramid model, wherein, the UAV video pyramid model's first
  • the resolution of the first-level image 410 is 360P, i.e. 640 ⁇ 360
  • the resolution of the second-level image 420 of the UAV video pyramid model is 720P, i.e.
  • the UAV video width is determined by formula (4):
  • FIG. 3 and Fig. 4 schematically show the structural diagram of the UAV video pyramid model according to the embodiment of the present disclosure. From Fig. 3 and Fig. 4, it can be seen without doubt that the UAV video dynamic processing provided by the present disclosure method, capable of processing UAV videos with different resolutions, and has a wide range of application scenarios. It should be particularly noted that although Figures 3 and 4 are schematic diagrams of 3D GIS scenes, the above-mentioned UAV video dynamic processing provided by this disclosure The method is also applicable to two-dimensional GIS scene. In addition, combined with the above pyramid model building process, users can build UAV video pyramid models with multiple other resolutions.
  • FIG. 5 is a flow chart of resampling and segmenting drone video according to an embodiment of the disclosure.
  • the video server resampling and segmenting the UAV video in response to the UAV video loading request includes operation S510 to operation S530 .
  • the resolution and grade of the drone video are determined according to the loading request of the drone video.
  • the drone video is segmented by the video server, and the segmented drone video is sent to the client and/or the browser for rendering.
  • the above-mentioned dynamic scheduling and rendering of drone video according to preset requirements includes:
  • part of the UAV video data is loaded according to the position information of the scene view camera, wherein the UAV video has multiple resolutions.
  • the above-mentioned drone video pyramid model includes multiple resolutions of 360P, 720P, 1080P, 4K and 8K.
  • the minimum map tile unit of the current map is 256 ⁇ 256 or 512 ⁇ 512 pixels.
  • the above UAV video pyramid model provided by this disclosure samples and divides UAV videos step by step according to certain classification rules, taking into account the size of GIS map tiles, and can cover commonly used videos such as 720P, 1080P, 4K, 8K, etc. Resolution, it is possible to load multi-resolution UAV video on demand.
  • Fig. 6 is a schematic diagram of quadtree splitting of the UAV video pyramid model at the same level according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic diagram of calculating a video range according to an embodiment of the disclosure.
  • the UAV video of the same level in the UAV video pyramid model it can be segmented according to the quadtree, as shown in Figures 6 and 7, which is convenient for small-scale video data to be loaded on demand, that is, when the scene When the field of view of the view camera is smaller than the current UAV image range, part of the video data can be loaded according to the scene camera position, thereby reducing the amount of video data and improving scene rendering efficiency.
  • Figures 6 and 7 schematically show a schematic view of displaying part of the UAV video or segmenting the UAV video according to the user's needs according to the present disclosure, wherein those skilled in the art can use Figures 6 and 7 according to the user's needs
  • the background of the drone is replaced by the drone video and/or the current map, which facilitates the dynamic processing of the drone video.
  • W i represents the width of the UAV video corresponding to the i-th level of the UAV video pyramid model, and W i /4 and W i /2 represent 1/4 and 1/2 of the UAV video
  • the width of the video; H i represents the height of the UAV video corresponding to the i-th level of the UAV video pyramid model, and H i /4 and H i /2 represent the video of 1/4 and 1/2 of the UAV video high.
  • Fig. 7 shows a schematic diagram of quadtree segmentation of UAV video according to an embodiment of the present disclosure, as shown in Fig. 7, wherein the gray part of Fig. 7 is to perform quadtree segmentation on target UAV video according to user requirements Segmentation results; for the video images segmented by the above quadtree, the client can dynamically schedule and render, load and display.
  • the 2D or 3D GIS scene will dynamically load different levels of terrain, image and map data according to the current viewpoint distance, so during the dynamic scheduling process of UAV video, UAVs with the same resolution can be dynamically requested according to the current map level video data.
  • UAV video hardware remains unchanged
  • the UAV’s field of view and video size remain unchanged
  • the actual resolution of the UAV’s image is related to its flight height, as shown in formula (3):
  • Fig. 8 is a schematic structural diagram of a multi-resolution drone video dynamic processing device according to an embodiment of the present disclosure.
  • the multi-resolution UAV video dynamic processing system includes an acquisition module 810 , a model building module 820 , a calculation module 830 , a request sending module 840 , a video dynamic processing module 850 and a video on-demand processing module 860 .
  • Obtaining module 810 used to obtain the viewpoint distance and field of view range of the scene view camera, query the level of the current map through the client, and obtain the current the resolution of the map;
  • the model construction module 820 is used to construct the UAV video pyramid model according to the resolution of the current map, the flight parameters of the UAV and the preset grading rules, wherein the UAV video pyramid model includes multiple levels, each unmanned The level of the machine video pyramid model corresponds to a resolution;
  • Calculation module 830 for calculating the video range within the range of view of the scene view camera according to the range of view of the scene view camera and the video range of the drone;
  • the request sending module 840 is used to send a UAV video loading request to the video server through the client, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video within the field of view of the scene view camera scope;
  • the video dynamic processing module 850 is used to receive, through the client, the UAV video after the UAV video is resampled and segmented in response to the UAV video loading request by the video server, and the processed UAV video Perform dynamic scheduling and rendering, loading and display on the client side;
  • the video on-demand processing module 860 is used to repeat the above steps when the viewpoint distance of the scene view camera changes, and perform resampling, segmentation, dynamic scheduling and rendering, loading and display of the UAV video according to preset requirements .
  • the dynamic processing method and device based on the UAV video pyramid model provided by this disclosure can sample and segment the UAV video step by step according to certain classification rules, and at the same time develop a multi-resolution UAV video high-performance press Rendering methods and devices are required to dynamically load drone video data of different resolutions and ranges according to distance, so as to ensure the on-demand dynamic loading and display of drone videos, especially ultra-high-definition drone videos.
  • Fig. 9 schematically shows a block diagram of an electronic device adapted to implement a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
  • an electronic device 900 includes a processor 901, which can be loaded into a random access memory (RAM) 903 according to a program stored in a read-only memory (ROM) 902 or from a storage section 908.
  • the processor 901 may include, for example, a general-purpose microprocessor (eg, a CPU), an instruction set processor and/or related chipsets, and/or a special-purpose microprocessor (eg, an application-specific integrated circuit (ASIC)), and the like.
  • Processor 901 may also include on-board memory for caching purposes.
  • the processor 901 may include a single processing unit or multiple processing units for executing different actions of the method flow according to the embodiments of the present disclosure.
  • the processor 901, ROM 902, and RAM 903 are connected to each other through a bus 904.
  • the processor 901 executes various operations according to the method flow of the embodiment of the present disclosure by executing programs in the ROM 902 and/or RAM 903. It should be noted that the program can also be stored in one or more memories other than ROM 902 and RAM 903.
  • the processor 901 may also perform various operations according to the method flow of the embodiments of the present disclosure by executing programs stored in the one or more memories.
  • the electronic device 900 may further include an input/output (I/O) interface 905 which is also connected to the bus 904 .
  • the electronic device 900 may also include one or more of the following components connected to the I/O interface 905: an input section 906 including a keyboard, a mouse, etc.; including a cathode ray tube (CRT), a liquid crystal display (LCD), etc.
  • An output section 907 of a speaker or the like a storage section 908 including a hard disk or the like; and a communication section 909 including a network interface card such as a LAN card, a modem, or the like.
  • the communication section 909 performs communication processing via a network such as the Internet.
  • a drive 910 is also connected to the I/O interface 905 as needed.
  • a removable medium 911 such as a magnetic disk, optical disk, magneto-optical disk, semiconductor memory, etc. is mounted on the drive 910 as necessary so that a computer program read therefrom is installed into the storage section 908 as necessary.
  • the present disclosure also provides a computer-readable storage medium.
  • the computer-readable storage medium may be included in the device/apparatus/system described in the above embodiments; it may also exist independently without being assembled into the device/system device/system.
  • the above-mentioned computer-readable storage medium carries one or more programs, and when the above-mentioned one or more programs are executed, the method according to the embodiment of the present disclosure is implemented.
  • the computer-readable storage medium may be a non-volatile computer-readable storage medium, such as may include but not limited to: portable computer disk, hard disk, random access memory (RAM), read-only memory (ROM) , erasable programmable read-only memory (EPROM or flash memory), portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable storage medium may include one or more memories other than the above-described ROM 902 and/or RAM 903 and/or ROM 902 and RAM 903.

Abstract

The present disclosure relates to the field of video data real-time transmission and processing. Disclosed are a dynamic processing method and apparatus based on an unmanned aerial vehicle video pyramid model, which are used for solving the problems of the rendering efficiency of an unmanned aerial vehicle video being low, the system operation not being smooth, etc. The method comprises: constructing an unmanned aerial vehicle video pyramid model; and by using the unmanned aerial vehicle video pyramid model and according to a video loading request, performing resampling and segmentation by means of a video server, and performing dynamic scheduling and rendering, loading and displaying on a processed unmanned aerial vehicle video by means of a client. By means of the dynamic processing method for an unmanned aerial vehicle video provided in the present disclosure, unmanned aerial vehicle video data similar to the current map resolution can be dynamically loaded by using an unmanned aerial vehicle video pyramid model and according to a viewpoint distance and a field-of-vision range of the current scene view camera, such that high-performance on-demand rendering of a multi-resolution unmanned aerial vehicle video is realized, thereby ensuring the high efficiency of scene displaying and the smoothness of system operation.

Description

基于无人机视频金字塔模型的动态处理方法及装置Dynamic processing method and device based on UAV video pyramid model 技术领域technical field
本公开涉及视频数据实时传输与处理领域,特别涉及一种基于无人机视频金字塔模型的动态处理方法、装置、电子设备以及存储介质。The present disclosure relates to the field of real-time transmission and processing of video data, and in particular to a dynamic processing method, device, electronic equipment and storage medium based on a UAV video pyramid model.
背景技术Background technique
在无人机图传系统或其他软件中实时查看无人机视频过程中,由于无人机视频数据量大、计算机软硬件性能有限等的影响,会出现无人机视频渲染效率不高、系统运行不流畅等问题。特别是无人机视频与二维和三维地理信息系统(Geographic Information System,GIS)融合过程中,GIS场景要素众多,特别是三维GIS场景包含例如高精度地形、影像、实景三维模型、立体标注等复杂要素,系统的流畅运行对场景渲染效率要求很高,无人机视频、超高清(4K或更高分辨率)无人机视频的接入与融合为二维和/或三维GIS场景的高效渲染带来了很大挑战。During the real-time viewing of UAV video in the UAV image transmission system or other software, due to the influence of the large amount of UAV video data and the limited performance of computer hardware and software, the UAV video rendering efficiency is not high and the system Running is not smooth and other issues. Especially in the fusion process of UAV video and 2D and 3D geographic information system (Geographic Information System, GIS), there are many GIS scene elements, especially the 3D GIS scene contains such as high-precision terrain, images, real-scene 3D models, three-dimensional annotations, etc. Complex elements, the smooth operation of the system requires high scene rendering efficiency, the access and fusion of UAV video and ultra-high-definition (4K or higher resolution) UAV video into an efficient 2D and/or 3D GIS scene Rendering posed quite a challenge.
公开内容public content
根据本公开的第一个方面,提供了一种基于无人机视频金字塔模型的动态处理方法,包括:According to a first aspect of the present disclosure, a dynamic processing method based on a UAV video pyramid model is provided, including:
获取场景视图相机的视点距离和视域范围,通过客户端查询当前地图的等级,并根据地图等级和地图分辨率之间的对应关系以及场景视图相机的视点距离,得到当前地图的分辨率;Obtain the view point distance and field of view range of the scene view camera, query the current map level through the client, and obtain the current map resolution according to the correspondence between the map level and map resolution and the view point distance of the scene view camera;
根据当前地图的分辨率、无人机飞行参数以及预设分级规则,构建无人机视频金字塔模型,其中,无人机视频金字塔模型包括多个等级,每个无人机视频金字塔模型的等级对应一个分辨率;According to the resolution of the current map, UAV flight parameters and preset grading rules, the UAV video pyramid model is constructed, wherein the UAV video pyramid model includes multiple levels, and the level of each UAV video pyramid model corresponds to a resolution;
根据场景视图相机的视域范围和无人机视频范围计算场景视图相机的视域范围内的视频范围;Calculate the video range within the field of view of the scene view camera based on the field of view of the scene view camera and the video range of the drone;
通过客户端向视频服务器发送无人机视频加载请求,其中,无人机视频加载请求包括无人机视频金字塔模型的属性信息和场景视图相机的视域范围内的视频范围;Send the UAV video loading request to the video server through the client, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video range within the field of view of the scene view camera;
通过客户端接收视频服务器响应于无人机视频加载请求对无人机视频进行重采样和切分处理后的无人机视频,并将处理后的无人机视频在客户端进行动态调度与渲染、加载和显示;Resampling and segmenting the UAV video by the video server in response to the UAV video loading request through the client, and dynamically scheduling and rendering the processed UAV video on the client , load and display;
在场景视图相机的视点距离发生变化的情况下,重复上述步骤,将无人机视频按照预设需求进行重采样、切分、动态调度与渲染、加载和显示。When the viewpoint distance of the scene view camera changes, repeat the above steps to resample, segment, dynamically schedule and render, load and display the UAV video according to preset requirements.
根据本公开的实施例,上述预设分级规则由公式(1)~(3)确定:According to an embodiment of the present disclosure, the above preset classification rules are determined by formulas (1)-(3):
H i=360×P i  (1), H i =360×P i (1),
Figure PCTCN2022136213-appb-000001
Figure PCTCN2022136213-appb-000001
Figure PCTCN2022136213-appb-000002
Figure PCTCN2022136213-appb-000002
其中,W是无人机视频的原始宽度像素值,H是无人机视频的原始高度像素值,P i(i=1,2,3…)是无人机视频金字塔模型的等级,H i是P i级无人机视频金字塔模型的对应的视频高度像素值,W i是P i级无人机视频金字塔模型的对应的视频宽度像素值,R是无人机图像的实际分辨率,α是无人机的垂直视场角,Height是无人机的当前飞行高度。 Among them, W is the original width pixel value of the UAV video, H is the original height pixel value of the UAV video, P i (i=1,2,3...) is the level of the UAV video pyramid model, H i is the corresponding video height pixel value of the P i -level UAV video pyramid model, W i is the corresponding video width pixel value of the P i- level UAV video pyramid model, R is the actual resolution of the UAV image, α is the vertical field of view of the drone, and Height is the current flying height of the drone.
根据本公开的实施例,上述无人机视频金字塔模型的属性信息包括无人机视频金字塔模型的等级和与等级相对应的分辨率;According to an embodiment of the present disclosure, the attribute information of the UAV video pyramid model includes the level of the UAV video pyramid model and the resolution corresponding to the level;
其中,无人机视频金字塔模型的分辨率从顶端到低端依次增大。Among them, the resolution of the UAV video pyramid model increases sequentially from the top to the bottom.
根据本公开的实施例,上述视频服务器响应于无人机视频加载请求对无人机视频进行重采样和切分处理包括:According to an embodiment of the present disclosure, the above-mentioned video server performs resampling and segmentation processing on the UAV video in response to the UAV video loading request including:
根据无人机视频加载请求,确定无人机视频的分辨率和等级;Determine the resolution and level of the drone video according to the drone video loading request;
根据场景视图相机的可视范围坐标,利用无人机视频的分辨率和等级,确定可视范围坐标的外包矩形所在的无人机视频金字塔模型的四叉树切分范围;According to the visual range coordinates of the scene view camera, using the resolution and level of the UAV video, determine the quadtree segmentation range of the UAV video pyramid model where the outsourcing rectangle of the visual range coordinates is located;
根据四叉树切分范围,通过视频服务器对无人机视频进行切分,并将切分出来的无人机视频发送到客户端和/或浏览器渲染。According to the quadtree segmentation range, the drone video is segmented through the video server, and the segmented drone video is sent to the client and/or browser for rendering.
根据本公开的实施例,上述将无人机视频按照预设需求进行动态调度与渲染包括:According to an embodiment of the present disclosure, the above-mentioned dynamic scheduling and rendering of drone video according to preset requirements includes:
在场景视图相机的视域范围小于当前无人机视频图像范围的情况下,根据场景视图相机的位置信息加载部分无人机视频数据。When the field of view of the scene view camera is smaller than the range of the current UAV video image, part of the UAV video data is loaded according to the location information of the scene view camera.
根据本公开的实施例,上述率无人机视频金字塔模型包括360P、720P、1080P、4K以及8K多个分辨率。According to an embodiment of the present disclosure, the above-mentioned drone video pyramid model includes multiple resolutions of 360P, 720P, 1080P, 4K and 8K.
根据本公开的实施例,上述当前地图的地图瓦片最小单元是256×256或512×512像素。According to an embodiment of the present disclosure, the minimum map tile unit of the current map is 256×256 or 512×512 pixels.
根据本公开的第二个方面,提供了一种基于无人机视频金字塔模型的动态处理装置,包括:According to a second aspect of the present disclosure, a dynamic processing device based on a UAV video pyramid model is provided, including:
获取模块,用于获取场景视图相机的视点距离和视域范围,通过客户端查询当前地图的等级,并根据地图等级和地图分辨率之间的对应关系以及场景视图相机的视点距离,得到当前地图的分辨率;The obtaining module is used to obtain the view point distance and field of view range of the scene view camera, query the level of the current map through the client, and obtain the current map according to the correspondence between the map level and map resolution and the view point distance of the scene view camera resolution;
模型构建模块,用于根据当前地图的分辨率、无人机飞行参数以及预设分级规则,在视频服务器上构建无人机视频金字塔模型,其中,无人机视频金字塔模型包括多个等级,每个无人机视频金字塔模型的等级对应一个分辨率;The model construction module is used to construct the UAV video pyramid model on the video server according to the resolution of the current map, the UAV flight parameters and the preset classification rules, wherein the UAV video pyramid model includes multiple levels, each The level of the UAV video pyramid model corresponds to a resolution;
计算模块,用于根据场景视图相机的视域范围和无人机视频范围计算场景视图相机的视域范围内的视频范围;Calculation module, for calculating the video range in the field of view range of the scene view camera according to the field of view range of the scene view camera and the video range of the unmanned aerial vehicle;
请求发送模块,用于通过客户端向视频服务器发送无人机视频加载请求,其中,无人机视频加载请求包括无人机视频金字塔模型的属性信息和场景视图相机的视域范围内的视频范围;The request sending module is used to send the UAV video loading request to the video server through the client, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video range within the field of view of the scene view camera ;
视频动态处理模块,用于通过客户端接收视频服务器响应于无人机视频加载请求对无人机视频进行重采样和切分处理后的无人机视频,并将处理后的无人机视频在客户端进行动态调度与渲染、加载和显示;The video dynamic processing module is used to receive the UAV video after the video server responds to the UAV video loading request through the client to resample and segment the UAV video, and to process the UAV video in the The client performs dynamic scheduling and rendering, loading and display;
视频按需处理模块,用于在场景视图相机的视点距离发生变化的情况下,重复上述步骤,将无人机视频按照预设需求进行重采样、切分、动态调度与渲染、加载和显示。The video on-demand processing module is used to repeat the above steps when the viewpoint distance of the scene view camera changes, and resample, segment, dynamically schedule and render, load and display the UAV video according to preset requirements.
根据本公开的第三个方面,提供了一种电子设备,包括:According to a third aspect of the present disclosure, an electronic device is provided, including:
一个或多个处理器;one or more processors;
存储装置,用于存储一个或多个程序,storage means for storing one or more programs,
其中,当一个或多个程序被一个或多个处理器执行时,使得一个或多个处理器执行上述多分辨率无人机视频动态处理方法。Wherein, when the one or more programs are executed by the one or more processors, the one or more processors are made to execute the above multi-resolution UAV video dynamic processing method.
根据本公开的第四个方面,提供了一种计算机可读存储介质,其上存储有可执行指令,该指令被处理器执行时使处理器执行上述多分辨率无人机视频动态处理方法。According to a fourth aspect of the present disclosure, there is provided a computer-readable storage medium, on which executable instructions are stored, and when the instructions are executed by a processor, the processor executes the above-mentioned multi-resolution drone video dynamic processing method.
本公开提供的一种多分辨率无人机视频动态处理方法,可根据当前场景视图相机的视点距离和视域范围来动态加载与当前地图分辨率相似的无人机视频数据,实现多分辨率无人机视频高性能按需渲染,保障场景显示的高效性和系统运行的流畅性。The disclosure provides a multi-resolution UAV video dynamic processing method, which can dynamically load UAV video data similar to the current map resolution according to the viewpoint distance and field of view range of the current scene view camera to achieve multi-resolution High-performance on-demand rendering of UAV video ensures efficient scene display and smooth system operation.
附图说明Description of drawings
图1是根据本公开实施例的基于无人机视频金字塔模型的动态处理方法的流程图;Fig. 1 is the flowchart of the dynamic processing method based on unmanned aerial vehicle video pyramid model according to the embodiment of the present disclosure;
图2是根据本公开实施例的基于无人机视频金字塔模型的动态处理方法的结构示意图;Fig. 2 is a schematic structural diagram of a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure;
图3是根据本公开实施例的无人机视频金字塔模型的通用视频格式示意图;3 is a schematic diagram of a general video format of a UAV video pyramid model according to an embodiment of the present disclosure;
图4是根据本公开实施例的无人机视频金字塔模型的16:9无人机视频格式的示意图;4 is a schematic diagram of a 16:9 drone video format of a drone video pyramid model according to an embodiment of the present disclosure;
图5是根据本公开实施例的对无人机视频进行重采样和切分的流程图;FIG. 5 is a flow chart of resampling and segmenting drone video according to an embodiment of the disclosure;
图6是根据本公开实施例的同一等级的无人机视频金字塔模型的四叉树切分示意图;6 is a schematic diagram of quadtree segmentation of a UAV video pyramid model of the same level according to an embodiment of the present disclosure;
图7是根据本公开实施例的场景视图相机可视范围计算视频范围的示意图;Fig. 7 is a schematic diagram of calculating a video range according to an embodiment of the disclosure;
图8是根据本公开实施例的基于无人机视频金字塔模型的动态处理系统的结构示意图;8 is a schematic structural diagram of a dynamic processing system based on a UAV video pyramid model according to an embodiment of the present disclosure;
图9示意性示出了根据本公开实施例的适于实现基于无人机视频金字塔模型的动态处理方法的电子设备的方框图。Fig. 9 schematically shows a block diagram of an electronic device adapted to implement a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
具体实施方式Detailed ways
为使本公开的目的、技术方案和优点更加清楚明白,以下结合具体实施例,并参照附图,对本公开作进一步的详细说明。In order to make the purpose, technical solutions and advantages of the present disclosure clearer, the present disclosure will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.
图1是根据本公开实施例的基于无人机视频金字塔模型的动态处理方法的流程图。FIG. 1 is a flowchart of a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
如图1所示,上述基于无人机视频金字塔模型的动态处理方法包括操作S110~操作S160。As shown in FIG. 1 , the above dynamic processing method based on the UAV video pyramid model includes operation S110 to operation S160.
在操作S110,获取场景视图相机的视点距离和视域范围,通过客户端查询当前地图的等级,并根据地图等级和地图分辨率之间的对应关系以及场景视图相机的视点距离,得到当前地图的分辨率。In operation S110, obtain the view point distance and view range of the scene view camera, query the level of the current map through the client, and obtain the current map level according to the correspondence between the map level and the map resolution and the view point distance of the scene view camera. resolution.
可选地,上述场景视图相机可以是二维场景相机、也可以是三维场景相机,因此,本公开提供的上述无人机视频动态处理方法既可以适用于二维GIS场景中的无人机视频动态处理,也可以适用于三维GIS场景中的无人机视频动态处理Optionally, the above-mentioned scene view camera can be a two-dimensional scene camera or a three-dimensional scene camera. Therefore, the above-mentioned UAV video dynamic processing method provided by the present disclosure can be applied to UAV video in a two-dimensional GIS scene Dynamic processing, also applicable to UAV video dynamic processing in 3D GIS scene
在操作S120,根据当前地图的分辨率、无人机飞行参数以及预设分级规则,构建无人机视频金字塔模型,其中,无人机视频金字塔模型包括多个等级,每个所述无人机视频金字塔模型的等级对应一个分辨率。In operation S120, according to the resolution of the current map, the flight parameters of the drone, and the preset classification rules, a UAV video pyramid model is constructed, wherein the UAV video pyramid model includes multiple levels, and each of the UAVs The level of the video pyramid model corresponds to a resolution.
可选地,上述无人机视频金字塔模型可以根据用户需求在客户端或者视频服务器上进行构建。Optionally, the above-mentioned UAV video pyramid model can be constructed on the client or video server according to user requirements.
无人机视频金字模型包括多个等级(即金字塔模型的层级),每个等级表示一种视频分辨率;由于无人机视频金字塔模型可以表示多个分辨率,因此本公开提供的上述无人机视频动态处理方法能够处理多个分辨率的无人机视频。The UAV video pyramid model includes multiple levels (i.e. the levels of the pyramid model), and each level represents a video resolution; since the UAV video pyramid model can represent multiple resolutions, the above-mentioned unmanned aerial vehicles provided by this disclosure The UAV video dynamic processing method is capable of processing UAV video at multiple resolutions.
上述无人机飞行参数包括无人机飞行高度和无人机垂直视场角。The above-mentioned flight parameters of the UAV include the flight height of the UAV and the vertical field of view of the UAV.
在操作S130,根据场景视图相机的视域范围和无人机视频范围计算场景视图相机的视域范围内的视频范围。In operation S130, a video range within the view range of the scene view camera is calculated according to the view range of the scene view camera and the drone video range.
在操作S140,通过客户端向视频服务器发送无人机视频加载请求,其中,无人机视频加载请求包括无人机视频金字塔模型的属性信息和场景视图相机的视域范围内的视频范围。In operation S140, the client sends a UAV video loading request to the video server, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video range within the field of view of the scene view camera.
上述无人机视频金字塔模型的属性信息包括金字塔模型的等级和与等级相对应的分辨率。The attribute information of the pyramid model of the UAV video includes the level of the pyramid model and the resolution corresponding to the level.
在操作S150,通过客户端接收视频服务器响应于无人机视频加载请求对无人机视频进行重采样和切分处理后的无人机视频,并将处理后的无人机视频在客户端进行动态调度与渲染、加载和显示。In operation S150, the video server receives the UAV video after resampling and segmenting the UAV video in response to the UAV video loading request through the client, and performs the processing of the UAV video on the client. Dynamic scheduling and rendering, loading and displaying.
在操作S160,在场景视图相机的视点距离发生变化的情况下,重复上述步骤,将无人机视频按照预设需求进行重采样、切分、动态调度与渲染、加载和显示。In operation S160, when the viewpoint distance of the scene view camera changes, the above steps are repeated, and the UAV video is resampled, segmented, dynamically scheduled and rendered, loaded and displayed according to preset requirements.
本公开提供的一种多分辨率无人机视频动态处理方法,可根据当前场景视图相机的视点距离和视域范围来动态加载与当前地图分辨率相似的无人机视频数据,实现多分辨率无人机视频高性能按需渲染,保障场景显示的高效性和系统运行的流畅性。The disclosure provides a multi-resolution UAV video dynamic processing method, which can dynamically load UAV video data similar to the current map resolution according to the viewpoint distance and field of view range of the current scene view camera to achieve multi-resolution High-performance on-demand rendering of UAV video ensures efficient scene display and smooth system operation.
图2是根据本公开实施例的基于无人机视频金字塔模型的动态处理方法的结构示意图。Fig. 2 is a schematic structural diagram of a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
下面结合图2对本公开提供的多分辨无人机视频动态处理方法作进一步详细地描述。The multi-resolution drone video dynamic processing method provided by the present disclosure will be further described in detail below in conjunction with FIG. 2 .
如图2示出了客户端/浏览器端运行无人机图传系统进行无人机视频的加载和显示,服务器端存储无人机视频数据、无人机视频金字塔模型构建规则和方法。利用客户端和/或浏览器获取场景视图相机视点距离和视域范围,查询当前地图等级,并根据目前常用的地图等级与地图分辨率对应关系得到当前地图分辨率;根据预设分级规则、无人机飞行参数和当前地图的分辨率计算无人机视频金字塔模型的等级和分辨率;并根据无人机视频范围和视域范围计算视域内的视频范围;向视频服务器端发送请求,重新获取对应分辨率的无人机视频;服务 器端根据无人机视频请求中的等级、分辨率和视频范围等参数,对无人机视频进行重采样和切分,并返回重采样后的无人机视频;在三维场景中获取到重采样后的无人机视频,进行无人机视频的加载和显示;当三维场景视点发生变化时,重复上述步骤进行多分辨率无人机视频按需调度与渲染。Figure 2 shows that the client/browser side runs the UAV image transmission system to load and display the UAV video, the server side stores the UAV video data, and the UAV video pyramid model construction rules and methods. Use the client and/or browser to obtain the scene view camera viewpoint distance and field of view range, query the current map level, and obtain the current map resolution according to the currently commonly used map level and map resolution correspondence; according to the preset classification rules, no Calculate the level and resolution of the UAV video pyramid model based on the flight parameters of the man-machine and the resolution of the current map; and calculate the video range in the field of view according to the video range and field of view of the UAV; send a request to the video server to retrieve UAV video with the corresponding resolution; the server resamples and segments the UAV video according to the parameters in the UAV video request, such as grade, resolution, and video range, and returns the resampled UAV Video: Obtain the resampled UAV video in the 3D scene, load and display the UAV video; when the viewpoint of the 3D scene changes, repeat the above steps to perform multi-resolution UAV video on-demand scheduling and render.
应当理解的是,图2所示的结构图仅仅是示意性的,用户可以根据自己的需求,将无人机视频金字塔模型在客户端进行构建。It should be understood that the structural diagram shown in FIG. 2 is only schematic, and users can construct the UAV video pyramid model on the client side according to their own needs.
根据本公开的实施例,上述预设分级规则由公式(1)~(3)确定:According to an embodiment of the present disclosure, the above preset classification rules are determined by formulas (1)-(3):
H i=360×P i  (1), H i =360×P i (1),
Figure PCTCN2022136213-appb-000003
Figure PCTCN2022136213-appb-000003
Figure PCTCN2022136213-appb-000004
Figure PCTCN2022136213-appb-000004
其中,W是无人机视频的原始宽度像素值,H是无人机视频的原始高度像素值,P i(i=1,2,3…)是无人机视频金字塔模型的等级,H i是P i级无人机视频金字塔模型的对应的视频高度像素值,W i是P i级无人机视频金字塔模型的对应的视频宽度像素值,R是无人机图像的实际分辨率,α是无人机的垂直视场角,Height是无人机的当前飞行高度。 Among them, W is the original width pixel value of the UAV video, H is the original height pixel value of the UAV video, P i (i=1,2,3...) is the level of the UAV video pyramid model, H i is the corresponding video height pixel value of the P i -level UAV video pyramid model, W i is the corresponding video width pixel value of the P i- level UAV video pyramid model, R is the actual resolution of the UAV image, α is the vertical field of view of the drone, and Height is the current flying height of the drone.
无人机视频金字塔模型构建由公式(1)~(3)确定,从上述公式上可以得出,本公开提供的无人机视频金字塔模型具有多个等级以及多个分辨率,能够涵盖视频领域常用的各个视频分辨率。可选地,用户可以根据自身需求,按照本公开提供的上述公式确定具有其他视频分辨率的无人机视频金字塔模型。The construction of the UAV video pyramid model is determined by formulas (1) to (3). From the above formulas, it can be concluded that the UAV video pyramid model provided by this disclosure has multiple levels and multiple resolutions, and can cover the video field Various commonly used video resolutions. Optionally, the user can determine the UAV video pyramid model with other video resolutions according to the above formula provided in the present disclosure according to their own needs.
根据本公开的实施例,上述无人机视频金字塔模型包括多个等级,每个无人机视频金字塔模型的等级对应一个分辨率。According to an embodiment of the present disclosure, the above UAV video pyramid model includes multiple levels, and each level of the UAV video pyramid model corresponds to a resolution.
其中,无人机视频金字塔模型的属性信息包括无人机视频金字塔模型的等级和与等级相对应的分辨率。Wherein, the attribute information of the UAV video pyramid model includes the level of the UAV video pyramid model and the resolution corresponding to the level.
其中,无人机视频金字塔模型的分辨率从顶端到低端依次增大。Among them, the resolution of the UAV video pyramid model increases sequentially from the top to the bottom.
图3是根据本公开实施例的无人机视频金字塔模型的通用视频格式示意图。Fig. 3 is a schematic diagram of a general video format of a UAV video pyramid model according to an embodiment of the present disclosure.
图4是根据本公开实施例的无人机视频金字塔模型的16:9无人机视频格式的示意图。4 is a schematic diagram of a 16:9 drone video format of a drone video pyramid model according to an embodiment of the disclosure.
下面结合图3和图4对本公开提供的无人机视频金字塔模型作进一步详细地描述。The UAV video pyramid model provided by the present disclosure will be further described in detail below with reference to FIG. 3 and FIG. 4 .
如图3示意性示出了无人机视频金字塔模型及金字塔模型各级的分辨率,其中,无人机视频金字塔模型的第一等级图像310的分辨率是360P,即360*W/H;无人机视频金字塔模型的第二等级图像320的分辨率是720P,即720*W/H;无人机视频金字塔模型的第三等级图像330的分辨率是1080P,即1080*W/H;无人机视频金字塔模型的第六等级图像340的分辨率是4K,即2160*W/H。其中,上述W/H表示无人机视频宽度与高度的比值,例如宽度与高度比可以是1:1,也可以是16:9,本领域技术人员应当理解的是图3仅是示意性的,本领域技术人员可以根据实际需求,构建N(N为正整数)个等级(例如P 1、P 2、P 3、P 4、P 5、P 6……P N)的无人机视频金字塔模型。考虑到三维GIS系统中的地图瓦片最小单元通常为256×256或512×512像素,本公开以360P视频为基础逐级构建多分辨无人机视频金字塔,能够涵盖720P、 1080P、4K、8K等常用视频分辨率。多分辨无人机视频金字塔等级与无人机视频像素高度和宽度之间的对应关系如上述公式(1)和(2)所示。 Fig. 3 schematically shows the resolution of the unmanned aerial vehicle video pyramid model and the pyramid model at all levels, wherein, the resolution of the first level image 310 of the unmanned aerial vehicle video pyramid model is 360P, namely 360*W/H; The resolution of the second level image 320 of the UAV video pyramid model is 720P, i.e. 720*W/H; the resolution of the third level image 330 of the UAV video pyramid model is 1080P, i.e. 1080*W/H; The resolution of the sixth-level image 340 of the UAV video pyramid model is 4K, that is, 2160*W/H. Among them, the above W/H represents the ratio of the width and height of the drone video, for example, the ratio of width to height can be 1:1 or 16:9, and those skilled in the art should understand that Figure 3 is only schematic , those skilled in the art can build N (N is a positive integer) levels (such as P 1 , P 2 , P 3 , P 4 , P 5 , P 6 ... P N ) UAV video pyramids according to actual needs Model. Considering that the smallest unit of a map tile in a 3D GIS system is usually 256×256 or 512×512 pixels, this disclosure builds a multi-resolution UAV video pyramid step by step based on 360P video, which can cover 720P, 1080P, 4K, 8K and other commonly used video resolutions. The correspondence between the multi-resolution UAV video pyramid level and the UAV video pixel height and width is shown in the above formulas (1) and (2).
H i=360×P i  (1), H i =360×P i (1),
Figure PCTCN2022136213-appb-000005
Figure PCTCN2022136213-appb-000005
图4示出了16:9(W/H,视频宽度/视频高度)无人机视频格式的无人机视频金字塔模型及金字塔模型各等级的分辨率,其中,无人机视频金字塔模型的第一等级图像410的分辨率是360P,即640×360;无人机视频金字塔模型的第二等级图像420的分辨率是720P,即1280×720;无人机视频金字塔模型的第三等级图像430的分辨率是1080P,即1920×1080;无人机视频金字塔模型的第六等级图像440的分辨率是4K,即3840×2160;图4仅仅是示意性的,用户可以根据需要构建具有多个等级(例如P 1、P 2、P 3、P 4、P 5、P 6……P N)的16:9视频格式的金字塔模型。如图4所示,对于16:9无人机视频格式,无人机视频宽度由公式(4)确定: Fig. 4 shows 16: 9 (W/H, video width/video height) the UAV video pyramid model of UAV video format and the resolution of each level of pyramid model, wherein, the UAV video pyramid model's first The resolution of the first-level image 410 is 360P, i.e. 640×360; the resolution of the second-level image 420 of the UAV video pyramid model is 720P, i.e. 1280×720; the third-level image 430 of the UAV video pyramid model The resolution is 1080P, that is, 1920×1080; the resolution of the sixth-level image 440 of the UAV video pyramid model is 4K, that is, 3840×2160; Figure 4 is only schematic, and users can construct multiple Pyramid model in 16:9 video format for levels (eg P 1 , P 2 , P 3 , P 4 , P 5 , P 6 . . . P N ). As shown in Figure 4, for the 16:9 UAV video format, the UAV video width is determined by formula (4):
W i=640×P i  (4)。 W i =640×P i (4).
图3和图4示意性展示了根据本公开实施例的无人机视频金字塔模型的结构图,从图3和图4可以毫无疑义地看出,本公开所提供的无人机视频动态处理方法,能够处理不同分辨率的无人机视频,具有较广泛应用场景,应当特别指明的是,虽然图3和图4是三维GIS场景的示意图,但本公开提供的上述无人机视频动态处理方法同样适用于二维GIS场景。此外结合上述金字塔模型构建过程,用户可以构建具有多个其他分辨率的无人机视频金字塔模型。Fig. 3 and Fig. 4 schematically show the structural diagram of the UAV video pyramid model according to the embodiment of the present disclosure. From Fig. 3 and Fig. 4, it can be seen without doubt that the UAV video dynamic processing provided by the present disclosure method, capable of processing UAV videos with different resolutions, and has a wide range of application scenarios. It should be particularly noted that although Figures 3 and 4 are schematic diagrams of 3D GIS scenes, the above-mentioned UAV video dynamic processing provided by this disclosure The method is also applicable to two-dimensional GIS scene. In addition, combined with the above pyramid model building process, users can build UAV video pyramid models with multiple other resolutions.
图5是根据本公开实施例的对无人机视频进行重采样和切分的流程图。FIG. 5 is a flow chart of resampling and segmenting drone video according to an embodiment of the disclosure.
如图5所示,上述视频服务器响应于无人机视频加载请求对无人机视频进行重采样和切分处理包括操作S510~操作S530。As shown in FIG. 5 , the video server resampling and segmenting the UAV video in response to the UAV video loading request includes operation S510 to operation S530 .
在操作S510,根据无人机视频加载请求,确定无人机视频的分辨率和等级。In operation S510, the resolution and grade of the drone video are determined according to the loading request of the drone video.
在操作S520,根据场景视图相机的可视范围坐标,利用无人机视频的分辨率和等级,确定可视范围坐标的外包矩形所在的无人机视频金字塔模型的四叉树切分范围。In operation S520, according to the visible range coordinates of the scene view camera, using the resolution and level of the drone video, determine the quadtree segmentation range of the drone video pyramid model where the outer rectangle of the visible range coordinates is located.
在操作S530,根据四叉树切分范围,通过视频服务器对无人机视频进行切分,并将切分出来的无人机视频发送到客户端和/或浏览器渲染。In operation S530, according to the quadtree segmentation range, the drone video is segmented by the video server, and the segmented drone video is sent to the client and/or the browser for rendering.
本领域技术人员应当理解的是,用户通常只对部分无人机视频感兴趣,因此并不需要将无人机所涵盖的所有视域内的无人机视频进行全部加载和显示,通过上述切分,能够使得客户端按照用户需求动态加载或显示无人机视频,同时,由于按照用户需求对无人机视频进行切分,能够大大降低客户端动态调度与渲染的压力,使得无人机视频能够更快地加载和显示,提升了用户的体验。Those skilled in the art should understand that users are usually only interested in some drone videos, so it is not necessary to load and display all the drone videos in all the sight areas covered by the drone. , enabling the client to dynamically load or display UAV videos according to user needs. At the same time, because the UAV videos are segmented according to user needs, the pressure on dynamic scheduling and rendering of the client can be greatly reduced, so that UAV videos can be Faster loading and display improves user experience.
根据本公开的实施例,上述将无人机视频按照预设需求进行动态调度与渲染包括:According to an embodiment of the present disclosure, the above-mentioned dynamic scheduling and rendering of drone video according to preset requirements includes:
在场景视图相机的视域范围小于当前无人机视频图像范围的情况下,根据场景视图相机的位置信息加载部分无人机视频数据,其中,上述无人机视频具有多个分辨率。In the case that the field of view of the scene view camera is smaller than the range of the current UAV video image, part of the UAV video data is loaded according to the position information of the scene view camera, wherein the UAV video has multiple resolutions.
根据本公开的实施例,上述率无人机视频金字塔模型包括360P、720P、1080P、4K以及8K多个分辨率。According to an embodiment of the present disclosure, the above-mentioned drone video pyramid model includes multiple resolutions of 360P, 720P, 1080P, 4K and 8K.
根据本公开实施例,当前地图的地图瓦片最小单元是256×256或512×512像素。According to an embodiment of the present disclosure, the minimum map tile unit of the current map is 256×256 or 512×512 pixels.
本公开提供的上述无人机视频金字塔模型将无人机视频按照一定的分级规则进行逐级采样和切分,兼顾GIS地图瓦片大小的同时,能够涵盖720P、1080P、4K、8K等常用视频分辨率,为多分辨率无人机视频按需加载提供了可能。The above UAV video pyramid model provided by this disclosure samples and divides UAV videos step by step according to certain classification rules, taking into account the size of GIS map tiles, and can cover commonly used videos such as 720P, 1080P, 4K, 8K, etc. Resolution, it is possible to load multi-resolution UAV video on demand.
图6是根据本公开实施例的同一等级的无人机视频金字塔模型的四叉树切分示意图。Fig. 6 is a schematic diagram of quadtree splitting of the UAV video pyramid model at the same level according to an embodiment of the present disclosure.
图7是根据本公开实施例的场景视图相机可视范围计算视频范围的示意图。FIG. 7 is a schematic diagram of calculating a video range according to an embodiment of the disclosure.
下面结合图6和7对本公开提供的上述多分辨无人机视频动态处理方法作进一步详细地说明。The above-mentioned multi-resolution drone video dynamic processing method provided by the present disclosure will be further described in detail below with reference to FIGS. 6 and 7 .
对于无人机视频金字塔模型中同一等级的无人机视频,可按照四叉树的方式对其进行切分,如图6和7所示,便于小范围的视频数据按需加载,即当场景视图相机的视域小于当前无人机图像范围时,可按照场景相机位置加载其中一部分视频数据,从而减小视频数据量,提高场景渲染效率。图6和7示意性示出了根据本公开按照用户需求对无人机视频进行部分显示或对无人机视频进行切分的示意图,其中,本领域技术人员可以根据用户需求将图6和7的背景换成无人机视频和/或当前地图,从而方便进行无人机视频的动态处理。其中,图6中,W i表示无人机视频金字塔模型第i等级对应的无人机视频的宽度,W i/4、W i/2表示该无人机视频的1/4、1/2的视频宽度;H i表示无人机视频金字塔模型第i等级对应的无人机视频的高度,H i/4、H i/2表示该无人机视频的1/4、1/2的视频高度。图7示出了根据本公开实施例对无人机视频进行四叉树切分的示意图,如图7所示,其中图7灰色部分是根据用户需求,对目标无人机视频进行四叉树切分的结果;对于上述四叉树切分的视频图像,客户端可以进行动态调度与渲染、加载和显示。 For the UAV video of the same level in the UAV video pyramid model, it can be segmented according to the quadtree, as shown in Figures 6 and 7, which is convenient for small-scale video data to be loaded on demand, that is, when the scene When the field of view of the view camera is smaller than the current UAV image range, part of the video data can be loaded according to the scene camera position, thereby reducing the amount of video data and improving scene rendering efficiency. Figures 6 and 7 schematically show a schematic view of displaying part of the UAV video or segmenting the UAV video according to the user's needs according to the present disclosure, wherein those skilled in the art can use Figures 6 and 7 according to the user's needs The background of the drone is replaced by the drone video and/or the current map, which facilitates the dynamic processing of the drone video. Among them, in Figure 6, W i represents the width of the UAV video corresponding to the i-th level of the UAV video pyramid model, and W i /4 and W i /2 represent 1/4 and 1/2 of the UAV video The width of the video; H i represents the height of the UAV video corresponding to the i-th level of the UAV video pyramid model, and H i /4 and H i /2 represent the video of 1/4 and 1/2 of the UAV video high. Fig. 7 shows a schematic diagram of quadtree segmentation of UAV video according to an embodiment of the present disclosure, as shown in Fig. 7, wherein the gray part of Fig. 7 is to perform quadtree segmentation on target UAV video according to user requirements Segmentation results; for the video images segmented by the above quadtree, the client can dynamically schedule and render, load and display.
二维或三维GIS场景会根据当前视点距离来动态加载不同等级的地形、影像和地图数据,因此在无人机视频动态调度过程中,可根据当前地图等级来动态请求同分辨率的无人机视频数据。在无人机视频硬件不变的情况下,无人机视场角与视频大小不变,无人机图像实际分辨率与其飞行高度有关,如公式(3)所示:The 2D or 3D GIS scene will dynamically load different levels of terrain, image and map data according to the current viewpoint distance, so during the dynamic scheduling process of UAV video, UAVs with the same resolution can be dynamically requested according to the current map level video data. When the UAV video hardware remains unchanged, the UAV’s field of view and video size remain unchanged, and the actual resolution of the UAV’s image is related to its flight height, as shown in formula (3):
Figure PCTCN2022136213-appb-000006
Figure PCTCN2022136213-appb-000006
在确定当前无人机视频实际分辨率和等级后,根据场景视图相机(场景视图)的可视范围坐标,计算其外包矩形所在的视频金字塔四叉树切分范围,服务器跟据该范围将此部分视频数据切分出来,发送至客户端渲染。After determining the actual resolution and level of the current UAV video, according to the visual range coordinates of the scene view camera (scene view), calculate the video pyramid quadtree segmentation range where its outsourcing rectangle is located, and the server will follow this range Part of the video data is split and sent to the client for rendering.
在无人机视频在无人机图传系统的二维或三维场景加载和显示过程中,根据当前视点距离、视域范围来动态计算所需的无人机视频分辨率和当前等级中的视频范围,在远距离调用低分辨率视频,在近距离调用高分辨率视频,在小的视域内调用小范围的视频图像,从而减小二维或三维场景渲染压力,提高场景渲染性能。In the process of loading and displaying the UAV video in the 2D or 3D scene of the UAV image transmission system, dynamically calculate the required UAV video resolution and the video in the current level according to the current viewpoint distance and field of view range Range, call low-resolution video at a long distance, call high-resolution video at a short distance, and call a small-scale video image in a small field of view, thereby reducing the pressure of 2D or 3D scene rendering and improving scene rendering performance.
图8是根据本公开实施例的多分辨无人机视频动态处理装置的结构示意图。Fig. 8 is a schematic structural diagram of a multi-resolution drone video dynamic processing device according to an embodiment of the present disclosure.
如图8所示,上述多分辨率无人机视频动态处理系统包括获取模块810、模型构建模块820、计算模块830、请求发送模块840、视频动态处理模块850以及视频按需处理模块860。As shown in FIG. 8 , the multi-resolution UAV video dynamic processing system includes an acquisition module 810 , a model building module 820 , a calculation module 830 , a request sending module 840 , a video dynamic processing module 850 and a video on-demand processing module 860 .
获取模块810,用于获取场景视图相机的视点距离和视域范围,通过客户端查询当前地图的等级,并根据地图等级和地图分辨率之间的对应关系以及场景视图相机的视点距离,得到当前地图的分辨率;Obtaining module 810, used to obtain the viewpoint distance and field of view range of the scene view camera, query the level of the current map through the client, and obtain the current the resolution of the map;
模型构建模块820,用于根据当前地图的分辨率、无人机飞行参数以及预设分级规则,构建无人机视频金字塔模型,其中,无人机视频金字塔模型包括多个等级,每个无人机视频金字塔模型的等级对应一个分辨率;The model construction module 820 is used to construct the UAV video pyramid model according to the resolution of the current map, the flight parameters of the UAV and the preset grading rules, wherein the UAV video pyramid model includes multiple levels, each unmanned The level of the machine video pyramid model corresponds to a resolution;
计算模块830,用于根据场景视图相机的视域范围和无人机视频范围计算场景视图相机的视域范围内的视频范围; Calculation module 830, for calculating the video range within the range of view of the scene view camera according to the range of view of the scene view camera and the video range of the drone;
请求发送模块840,用于通过客户端向视频服务器发送无人机视频加载请求,其中,无人机视频加载请求包括无人机视频金字塔模型的属性信息和场景视图相机的视域范围内的视频范围;The request sending module 840 is used to send a UAV video loading request to the video server through the client, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the video within the field of view of the scene view camera scope;
视频动态处理模块850,用于通过客户端接收视频服务器响应于无人机视频加载请求对无人机视频进行重采样和切分处理后的无人机视频,并将处理后的无人机视频在客户端进行动态调度与渲染、加载和显示;The video dynamic processing module 850 is used to receive, through the client, the UAV video after the UAV video is resampled and segmented in response to the UAV video loading request by the video server, and the processed UAV video Perform dynamic scheduling and rendering, loading and display on the client side;
视频按需处理模块860,用于在场景视图相机的视点距离发生变化的情况下,重复上述步骤,将无人机视频按照预设需求进行重采样、切分、动态调度与渲染、加载和显示。The video on-demand processing module 860 is used to repeat the above steps when the viewpoint distance of the scene view camera changes, and perform resampling, segmentation, dynamic scheduling and rendering, loading and display of the UAV video according to preset requirements .
本公开提供的基于无人机视频金字塔模型的动态处理方法和装置,将无人机视频按照一定的分级规则进行逐级采样和切分,同时研发一种多分辨率无人机视频高性能按需渲染方法和装置,按距离来动态加载不同分辨率、不同范围的无人机视频数据,保障无人机视频、特别是超高清无人机视频的按需动态加载和显示。The dynamic processing method and device based on the UAV video pyramid model provided by this disclosure can sample and segment the UAV video step by step according to certain classification rules, and at the same time develop a multi-resolution UAV video high-performance press Rendering methods and devices are required to dynamically load drone video data of different resolutions and ranges according to distance, so as to ensure the on-demand dynamic loading and display of drone videos, especially ultra-high-definition drone videos.
图9示意性示出了根据本公开实施例的适于实现基于无人机视频金字塔模型的动态处理方法的电子设备的方框图。Fig. 9 schematically shows a block diagram of an electronic device adapted to implement a dynamic processing method based on a UAV video pyramid model according to an embodiment of the present disclosure.
如图9所示,根据本公开实施例的电子设备900包括处理器901,其可以根据存储在只读存储器(ROM)902中的程序或者从存储部分908加载到随机访问存储器(RAM)903中的程序而执行各种适当的动作和处理。处理器901例如可以包括通用微处理器(例如CPU)、指令集处理器和/或相关芯片组和/或专用微处理器(例如,专用集成电路(ASIC))等等。处理器901还可以包括用于缓存用途的板载存储器。处理器901可以包括用于执行根据本公开实施例的方法流程的不同动作的单一处理单元或者是多个处理单元。As shown in FIG. 9, an electronic device 900 according to an embodiment of the present disclosure includes a processor 901, which can be loaded into a random access memory (RAM) 903 according to a program stored in a read-only memory (ROM) 902 or from a storage section 908. Various appropriate actions and processing are performed by the program. The processor 901 may include, for example, a general-purpose microprocessor (eg, a CPU), an instruction set processor and/or related chipsets, and/or a special-purpose microprocessor (eg, an application-specific integrated circuit (ASIC)), and the like. Processor 901 may also include on-board memory for caching purposes. The processor 901 may include a single processing unit or multiple processing units for executing different actions of the method flow according to the embodiments of the present disclosure.
在RAM 903中,存储有电子设备900操作所需的各种程序和数据。处理器901、ROM 902以及RAM 903通过总线904彼此相连。处理器901通过执行ROM 902和/或RAM 903中的程序来执行根据本公开实施例的方法流程的各种操作。需要注意,所述程序也可以存储在除ROM 902和RAM 903以外的一个或多个存储器中。处理器901也可以通过执行存储在所述一个或多个存储器中的程序来执行根据本公开实施例的方法流程的各种操作。In the RAM 903, various programs and data necessary for the operation of the electronic device 900 are stored. The processor 901, ROM 902, and RAM 903 are connected to each other through a bus 904. The processor 901 executes various operations according to the method flow of the embodiment of the present disclosure by executing programs in the ROM 902 and/or RAM 903. It should be noted that the program can also be stored in one or more memories other than ROM 902 and RAM 903. The processor 901 may also perform various operations according to the method flow of the embodiments of the present disclosure by executing programs stored in the one or more memories.
根据本公开的实施例,电子设备900还可以包括输入/输出(I/O)接口905,输入/输出(I/O)接口905也连接至总线904。电子设备900还可以包括连接至I/O接口905的以下部件中的一项或多项:包括键盘、鼠标等的输入部分906;包括诸如阴极射线管(CRT)、液晶显示器(LCD) 等以及扬声器等的输出部分907;包括硬盘等的存储部分908;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分909。通信部分909经由诸如因特网的网络执行通信处理。驱动器910也根据需要连接至I/O接口905。可拆卸介质911,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器910上,以便于从其上读出的计算机程序根据需要被安装入存储部分908。According to an embodiment of the present disclosure, the electronic device 900 may further include an input/output (I/O) interface 905 which is also connected to the bus 904 . The electronic device 900 may also include one or more of the following components connected to the I/O interface 905: an input section 906 including a keyboard, a mouse, etc.; including a cathode ray tube (CRT), a liquid crystal display (LCD), etc. An output section 907 of a speaker or the like; a storage section 908 including a hard disk or the like; and a communication section 909 including a network interface card such as a LAN card, a modem, or the like. The communication section 909 performs communication processing via a network such as the Internet. A drive 910 is also connected to the I/O interface 905 as needed. A removable medium 911 such as a magnetic disk, optical disk, magneto-optical disk, semiconductor memory, etc. is mounted on the drive 910 as necessary so that a computer program read therefrom is installed into the storage section 908 as necessary.
本公开还提供了一种计算机可读存储介质,该计算机可读存储介质可以是上述实施例中描述的设备/装置/系统中所包含的;也可以是单独存在,而未装配入该设备/装置/系统中。上述计算机可读存储介质承载有一个或者多个程序,当上述一个或者多个程序被执行时,实现根据本公开实施例的方法。The present disclosure also provides a computer-readable storage medium. The computer-readable storage medium may be included in the device/apparatus/system described in the above embodiments; it may also exist independently without being assembled into the device/system device/system. The above-mentioned computer-readable storage medium carries one or more programs, and when the above-mentioned one or more programs are executed, the method according to the embodiment of the present disclosure is implemented.
根据本公开的实施例,计算机可读存储介质可以是非易失性的计算机可读存储介质,例如可以包括但不限于:便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。例如,根据本公开的实施例,计算机可读存储介质可以包括上文描述的ROM 902和/或RAM 903和/或ROM 902和RAM 903以外的一个或多个存储器。According to an embodiment of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium, such as may include but not limited to: portable computer disk, hard disk, random access memory (RAM), read-only memory (ROM) , erasable programmable read-only memory (EPROM or flash memory), portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. For example, according to an embodiment of the present disclosure, a computer-readable storage medium may include one or more memories other than the above-described ROM 902 and/or RAM 903 and/or ROM 902 and RAM 903.
本领域技术人员可以理解,本公开的各个实施例和/或权利要求中记载的特征可以进行多种组合或/或结合,即使这样的组合或结合没有明确记载于本公开中。特别地,在不脱离本公开精神和教导的情况下,本公开的各个实施例和/或权利要求中记载的特征可以进行多种组合和/或结合。所有这些组合和/或结合均落入本公开的范围。Those skilled in the art can understand that various combinations and/or combinations of the features described in the various embodiments and/or claims of the present disclosure can be made, even if such combinations or combinations are not explicitly recorded in the present disclosure. In particular, without departing from the spirit and teaching of the present disclosure, the various embodiments of the present disclosure and/or the features described in the claims can be combined and/or combined in various ways. All such combinations and/or combinations fall within the scope of the present disclosure.
以上所述的具体实施例,对本公开的目的、技术方案和有益效果进行了进一步详细说明,应理解的是,以上所述仅为本公开的具体实施例而已,并不用于限制本公开,凡在本公开的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The specific embodiments described above have further described the purpose, technical solutions and beneficial effects of the present disclosure in detail. It should be understood that the above descriptions are only specific embodiments of the present disclosure, and are not intended to limit the present disclosure. Within the spirit and principles of the present disclosure, any modifications, equivalent replacements, improvements, etc., shall be included in the protection scope of the present disclosure.

Claims (10)

  1. 一种基于无人机视频金字塔模型的动态处理方法,包括:A dynamic processing method based on the UAV video pyramid model, comprising:
    获取场景视图相机的视点距离和视域范围,通过客户端查询当前地图的等级,并根据地图等级和地图分辨率之间的对应关系以及所述场景视图相机的视点距离,得到所述当前地图的分辨率;Obtain the view point distance and view range of the scene view camera, query the level of the current map through the client, and obtain the view point distance of the scene view camera according to the correspondence between the map level and map resolution and the view point distance of the scene view camera. resolution;
    根据所述当前地图的分辨率、无人机飞行参数以及预设分级规则,构建无人机视频金字塔模型,其中,所述无人机视频金字塔模型包括多个等级,每个所述无人机视频金字塔模型的等级对应一个分辨率;According to the resolution of the current map, the flight parameters of the unmanned aerial vehicle and the preset classification rules, the unmanned aerial vehicle video pyramid model is constructed, wherein the unmanned aerial vehicle video pyramid model includes multiple levels, each of the unmanned aerial vehicles The level of the video pyramid model corresponds to a resolution;
    根据所述场景视图相机的视域范围和无人机视频范围计算所述场景视图相机的视域范围内的视频范围;calculating the video range within the field of view of the scene view camera according to the field of view of the scene view camera and the video range of the drone;
    通过所述客户端向视频服务器发送无人机视频加载请求,其中,所述无人机视频加载请求包括所述无人机视频金字塔模型的属性信息和所述场景视图相机的视域范围内的视频范围;The UAV video loading request is sent to the video server by the client, wherein the UAV video loading request includes the attribute information of the UAV video pyramid model and the field of view of the scene view camera. video range;
    通过所述客户端接收所述视频服务器响应于所述无人机视频加载请求对无人机视频进行重采样和切分处理后的无人机视频,并将处理后的无人机视频在所述客户端进行动态调度与渲染、加载和显示;Receive the UAV video after the UAV video is resampled and segmented by the video server in response to the UAV video loading request through the client, and process the UAV video in the The client performs dynamic scheduling and rendering, loading and displaying;
    在所述场景视图相机的视点距离发生变化的情况下,重复上述步骤,将所述无人机视频按照预设需求进行重采样、切分、动态调度与渲染、加载和显示。When the viewpoint distance of the scene view camera changes, the above steps are repeated, and the UAV video is resampled, segmented, dynamically scheduled and rendered, loaded and displayed according to preset requirements.
  2. 根据权利要求1所述的方法,其中,所述预设分级规则由公式(1)~(3)确定:The method according to claim 1, wherein the preset classification rules are determined by formulas (1) to (3):
    H i=360×P i   (1), H i =360×P i (1),
    Figure PCTCN2022136213-appb-100001
    Figure PCTCN2022136213-appb-100001
    Figure PCTCN2022136213-appb-100002
    Figure PCTCN2022136213-appb-100002
    其中,W是所述无人机视频的原始宽度像素值,H是所述无人机视频的原始高度像素值,P i(i=1,2,3…)是所述无人机视频金字塔模型的等级,H i是P i级所述无人机视频金字塔模型的对应的视频高度像素值,W i是P i级无人机视频金字塔模型的对应的视频宽度像素值,R是无人机图像的实际分辨率,α是无人机的垂直视场角,Height是无人机的当前飞行高度。 Wherein, W is the original width pixel value of the UAV video, H is the original height pixel value of the UAV video, P i (i=1,2,3...) is the UAV video pyramid The level of the model, H i is the corresponding video height pixel value of the P i level UAV video pyramid model, W i is the corresponding video width pixel value of the P i level UAV video pyramid model, and R is unmanned The actual resolution of the drone image, α is the vertical field of view of the drone, and Height is the current flying height of the drone.
  3. 根据权利要求1所述的方法,其中,所述无人机视频金字塔模型的属性信息包括所述无人机视频金字塔模型的等级和与等级相对应的分辨率;The method according to claim 1, wherein the attribute information of the unmanned aerial vehicle video pyramid model includes the level of the unmanned aerial vehicle video pyramid model and the resolution corresponding to the level;
    其中,所述无人机视频金字塔模型的分辨率从顶端到低端依次增大。Wherein, the resolution of the UAV video pyramid model increases sequentially from the top to the bottom.
  4. 根据权利要求1所述的方法,其中,所述视频服务器响应于所述无人机视频加载请求对无人机视频进行重采样和切分处理包括:The method according to claim 1, wherein the video server performs resampling and segmentation processing on the UAV video in response to the UAV video loading request comprising:
    根据所述无人机视频加载请求,确定所述无人机视频的分辨率和等级;According to the UAV video loading request, determine the resolution and level of the UAV video;
    根据所述场景视图相机的可视范围坐标,利用所述无人机视频的分辨率和等级,确定所述可视范围坐标的外包矩形所在的无人机视频金字塔模型的四叉树切分范围;According to the visible range coordinates of the scene view camera, using the resolution and level of the drone video, determine the quadtree segmentation range of the drone video pyramid model where the outer rectangle of the visible range coordinates is located ;
    根据所述四叉树切分范围,通过所述视频服务器对所述无人机视频进行切分,并将切分出来的无人机视频发送到所述客户端和/或浏览器渲染。According to the quadtree segmentation range, the video server is used to segment the drone video, and the segmented drone video is sent to the client and/or browser for rendering.
  5. 根据权利要求1所述的方法,其中,所述将所述无人机视频按照预设需求进行动态调度与渲染包括:The method according to claim 1, wherein said performing dynamic scheduling and rendering of said UAV video according to preset requirements comprises:
    在所述场景视图相机的视域范围小于当前无人机视频图像范围的情况下,根据所述场景视图相机的位置信息加载部分无人机视频数据。In the case that the field of view of the scene view camera is smaller than the range of the current UAV video image, part of the UAV video data is loaded according to the position information of the scene view camera.
  6. 根据权利要求1所述的方法,其中,所述无人机视频金字塔模型包括360P、720P、1080P、4K以及8K多个分辨率。The method according to claim 1, wherein the drone video pyramid model includes multiple resolutions of 360P, 720P, 1080P, 4K and 8K.
  7. 根据权利要求1所述的方法,其中,所述当前地图的地图瓦片最小单元是256×256或512×512像素。The method according to claim 1, wherein the minimum map tile unit of the current map is 256×256 or 512×512 pixels.
  8. 一种基于无人机视频金字塔模型的动态处理装置,包括:A dynamic processing device based on the UAV video pyramid model, comprising:
    获取模块,用于获取场景视图相机的视点距离和视域范围,通过客户端查询当前地图的等级,并根据地图等级和地图分辨率之间的对应关系以及所述场景视图相机的视点距离,得到所述当前地图的分辨率;The obtaining module is used to obtain the viewpoint distance and the field of view range of the scene view camera, query the level of the current map through the client, and obtain the resolution of the current map;
    模型构建模块,用于根据所述当前地图的分辨率、无人机飞行参数以及预设分级规则,构建无人机视频金字塔模型,其中,所述无人机视频金字塔模型包括多个等级,每个所述无人机视频金字塔模型的等级对应一个分辨率;A model construction module, configured to construct a UAV video pyramid model according to the resolution of the current map, UAV flight parameters and preset classification rules, wherein the UAV video pyramid model includes multiple levels, each The level of each described unmanned aerial vehicle video pyramid model corresponds to a resolution;
    计算模块,用于根据所述场景视图相机的视域范围和无人机视频范围计算所述场景视图相机的视域范围内的视频范围;A calculation module, configured to calculate the video range within the field of view of the scene view camera according to the field of view of the scene view camera and the video range of the drone;
    请求发送模块,用于通过所述客户端向视频服务器发送无人机视频加载请求,其中,所述无人机视频加载请求包括所述无人机视频金字塔模型的属性信息和所述场景视图相机的视域范围内的视频范围;A request sending module, configured to send a UAV video loading request to a video server through the client, wherein the UAV video loading request includes attribute information of the UAV video pyramid model and the scene view camera The range of video within the field of view of ;
    视频动态处理模块,用于通过所述客户端接收所述视频服务器响应于所述无人机视频加载请求对无人机视频进行重采样和切分处理后的无人机视频,并将处理后的无人机视频在所述客户端进行动态调度与渲染、加载和显示;The video dynamic processing module is used to receive, through the client, the UAV video after the UAV video has been resampled and segmented by the video server in response to the UAV video loading request, and processed The UAV video is dynamically scheduled and rendered, loaded and displayed at the client;
    视频按需处理模块,用于在所述场景视图相机的视点距离发生变化的情况下,重复上述步骤,将所述无人机视频按照预设需求进行重采样、切分、动态调度与渲染、加载和显示。The video on-demand processing module is used to repeat the above steps when the viewpoint distance of the scene view camera changes, and perform resampling, segmentation, dynamic scheduling and rendering of the drone video according to preset requirements, Load and display.
  9. 一种电子设备,包括:An electronic device comprising:
    一个或多个处理器;one or more processors;
    存储装置,用于存储一个或多个程序,storage means for storing one or more programs,
    其中,当所述一个或多个程序被所述一个或多个处理器执行时,使得所述一个或多个处理器执行根据权利要求1~7中任一项所述的方法。Wherein, when the one or more programs are executed by the one or more processors, the one or more processors are made to execute the method according to any one of claims 1-7.
  10. 一种计算机可读存储介质,其上存储有可执行指令,该指令被处理器执行时使处理器执行根据权利要求1~7中任一项所述的方法。A computer-readable storage medium, on which executable instructions are stored, and the instructions, when executed by a processor, cause the processor to perform the method according to any one of claims 1-7.
PCT/CN2022/136213 2022-08-17 2022-12-02 Dynamic processing method and apparatus based on unmanned aerial vehicle video pyramid model WO2023066412A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210984288.5A CN115065867B (en) 2022-08-17 2022-08-17 Dynamic processing method and device based on unmanned aerial vehicle video pyramid model
CN202210984288.5 2022-08-17

Publications (1)

Publication Number Publication Date
WO2023066412A1 true WO2023066412A1 (en) 2023-04-27

Family

ID=83207880

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/136213 WO2023066412A1 (en) 2022-08-17 2022-12-02 Dynamic processing method and apparatus based on unmanned aerial vehicle video pyramid model

Country Status (2)

Country Link
CN (1) CN115065867B (en)
WO (1) WO2023066412A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115065867B (en) * 2022-08-17 2022-11-11 中国科学院空天信息创新研究院 Dynamic processing method and device based on unmanned aerial vehicle video pyramid model

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102201115A (en) * 2011-04-07 2011-09-28 湖南天幕智能科技有限公司 Real-time panoramic image stitching method of aerial videos shot by unmanned plane
CN107742276A (en) * 2017-03-27 2018-02-27 苏州星宇测绘科技有限公司 One kind is based on the quick processing system of the airborne integration of unmanned aerial vehicle remote sensing image and method
CN108536863A (en) * 2018-04-20 2018-09-14 曜宇航空科技(上海)有限公司 Selection area update method and system in a kind of map based on unmanned plane
US20200034620A1 (en) * 2016-08-05 2020-01-30 Neu Robotics, Inc. Self-reliant autonomous mobile platform
CN111192362A (en) * 2019-12-17 2020-05-22 武汉理工大学 Virtual compound eye system for real-time acquisition of dynamic three-dimensional geographic scene and working method thereof
CN114511661A (en) * 2022-01-21 2022-05-17 北京百度网讯科技有限公司 Image rendering method and device, electronic equipment and storage medium
CN115065867A (en) * 2022-08-17 2022-09-16 中国科学院空天信息创新研究院 Dynamic processing method and device based on unmanned aerial vehicle video pyramid model

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100547594C (en) * 2007-06-27 2009-10-07 中国科学院遥感应用研究所 A kind of digital globe antetype system
US8509565B2 (en) * 2008-12-15 2013-08-13 National Tsing Hua University Optimal multi-resolution blending of confocal microscope images
DE102010008630A1 (en) * 2010-02-18 2011-08-18 ESW GmbH, 22880 Method for processing multi-channel image recordings for the detection of hidden objects in opto-electronic person control
CN103595974B (en) * 2013-12-01 2016-09-28 北京航空航天大学深圳研究院 A kind of video geographic information system towards metropolitan area and method
CN108052642A (en) * 2017-12-22 2018-05-18 重庆邮电大学 Electronic Chart Display method based on tile technology
CN110706166B (en) * 2019-09-17 2022-03-18 中国科学院空天信息创新研究院 Image super-resolution reconstruction method and device for sharpening label data
CN111586360B (en) * 2020-05-14 2021-09-10 佳都科技集团股份有限公司 Unmanned aerial vehicle projection method, device, equipment and storage medium
CN113916136A (en) * 2021-11-19 2022-01-11 招商局重庆交通科研设计院有限公司 High-rise structure dynamic displacement measurement method based on unmanned aerial vehicle aerial photography

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102201115A (en) * 2011-04-07 2011-09-28 湖南天幕智能科技有限公司 Real-time panoramic image stitching method of aerial videos shot by unmanned plane
US20200034620A1 (en) * 2016-08-05 2020-01-30 Neu Robotics, Inc. Self-reliant autonomous mobile platform
CN107742276A (en) * 2017-03-27 2018-02-27 苏州星宇测绘科技有限公司 One kind is based on the quick processing system of the airborne integration of unmanned aerial vehicle remote sensing image and method
CN108536863A (en) * 2018-04-20 2018-09-14 曜宇航空科技(上海)有限公司 Selection area update method and system in a kind of map based on unmanned plane
CN111192362A (en) * 2019-12-17 2020-05-22 武汉理工大学 Virtual compound eye system for real-time acquisition of dynamic three-dimensional geographic scene and working method thereof
CN114511661A (en) * 2022-01-21 2022-05-17 北京百度网讯科技有限公司 Image rendering method and device, electronic equipment and storage medium
CN115065867A (en) * 2022-08-17 2022-09-16 中国科学院空天信息创新研究院 Dynamic processing method and device based on unmanned aerial vehicle video pyramid model

Also Published As

Publication number Publication date
CN115065867B (en) 2022-11-11
CN115065867A (en) 2022-09-16

Similar Documents

Publication Publication Date Title
CN108648269B (en) Method and system for singulating three-dimensional building models
US10657714B2 (en) Method and system for displaying and navigating an optimal multi-dimensional building model
WO2020073936A1 (en) Map element extraction method and apparatus, and server
AU2014315181B2 (en) Estimating depth from a single image
US20230053462A1 (en) Image rendering method and apparatus, device, medium, and computer program product
US20220319046A1 (en) Systems and methods for visual positioning
US20150213590A1 (en) Automatic Pose Setting Using Computer Vision Techniques
US9704282B1 (en) Texture blending between view-dependent texture and base texture in a geographic information system
US11809487B2 (en) Displaying objects based on a plurality of models
US10235800B2 (en) Smoothing 3D models of objects to mitigate artifacts
WO2023066412A1 (en) Dynamic processing method and apparatus based on unmanned aerial vehicle video pyramid model
KR101764063B1 (en) Method and system for analyzing and pre-rendering of virtual reality content
WO2023231793A1 (en) Method for virtualizing physical scene, and electronic device, computer-readable storage medium and computer program product
CN110570441A (en) Ultra-high definition low-delay video control method and system
CN110196638B (en) Mobile terminal augmented reality method and system based on target detection and space projection
CN110662099B (en) Method and device for displaying bullet screen
CN113312979B (en) Image processing method and device, electronic equipment, road side equipment and cloud control platform
US11978160B2 (en) GPU-based digital map tile generation method and system
US20230065027A1 (en) Gpu-based digital map tile generation method and system
WO2023111843A1 (en) System and method for defining a virtual data capture point
WO2023224627A1 (en) Face-oriented geometry streaming
CN117390033A (en) Method and device for generating vector tiles by adapting dream database
CN117009607A (en) Method and device for generating vector tiles by thinning vector elements
CN117911498A (en) Pose determination method and device, electronic equipment and storage medium
CN114170068A (en) Image special effect generation method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22883016

Country of ref document: EP

Kind code of ref document: A1