CN109587401B

CN109587401B - Electronic cloud deck multi-scene shooting implementation method and system

Info

Publication number: CN109587401B
Application number: CN201910001096.6A
Authority: CN
Inventors: 欧俊文
Original assignee: Ava Electronic Technology Co Ltd
Current assignee: Ava Electronic Technology Co Ltd
Priority date: 2019-01-02
Filing date: 2019-01-02
Publication date: 2021-10-08
Anticipated expiration: 2039-01-02
Also published as: CN109587401A

Abstract

The invention provides a method and a system for realizing multi-scene shooting of an electronic cloud deck, wherein a front-end device receives an editing instruction of a user host end, obtains a plurality of scene video images with specific resolution by cutting and zooming operations in an original video image with high resolution according to the editing instruction, splices the scene video images with the specific resolution into one video image by splicing, sends a non-coded video image to the user host end through 1 physical transmission line, separates out the video images with different scenes by the user host end according to splicing position information, and generates a high-quality video playing file or streaming media data by once coding after carrying out image data processing on the video images with the different scenes by the user host end. The transmission line bandwidth is efficiently utilized, the high-quality video playing file is generated by one-time coding, the system is simple and convenient to install, operate and maintain, the processing and transmission efficiency of the whole data is improved, and the whole cost of the system is effectively reduced.

Description

Electronic cloud deck multi-scene shooting implementation method and system

Technical Field

The invention belongs to the technical field of video monitoring and recording and broadcasting, and particularly relates to a method and a system for realizing multi-scene shooting of an electronic holder.

Background

In the current implementation scheme of the normalized recording and broadcasting classroom, at least more than 2 scenes, such as teacher close-up, teacher panorama, blackboard-writing close-up, etc., are often required to be simultaneously shot by using one camera to simplify the deployment.

The current multi-scene shooting implementation scheme is that a cradle head network camera is used for cutting collected images, after a user control instruction of a recording and broadcasting host is received, the cradle head network camera codes a specific shooting area and transmits the coded video to the recording and broadcasting host through an IP network, and the recording and broadcasting host receives the coded video, decodes and displays the coded video or processes other images, and codes the coded video again to generate video files or streaming media data. However, there are 2 or more encoding and decoding processes in the transmission process, which brings irreversible quality reduction to the video image, and in the use scenes such as recording and broadcasting classroom, the viewing experience of audiences such as students and teachers is seriously affected due to the reduction of the video quality.

In order to solve the above technical problems, those skilled in the art propose to assemble more than two cameras in one device housing to make the camera device become an integral appearance, and different cameras are responsible for shooting different scenes, for example, two cameras respectively shoot close-up and panoramic pictures of teachers, and transmit non-coded shot video data to a recording and broadcasting host through more than 2 physical transmission lines. And after receiving the non-coded video and displaying or carrying out other image processing, the recording and broadcasting host encodes the non-coded video to generate a video file or streaming media data. The technical problems existing in the mode are that the number of shot scenes is limited by the number of physical lenses, the expansibility is poor in practical application, scene non-coded video data shot by the multiple camera lenses needs to be sent to a recording and broadcasting host through multiple physical transmission lines, the engineering construction is complex, fault nodes are increased undoubtedly in the operation and maintenance of equipment, and the operation and maintenance difficulty is increased.

In view of this, those skilled in the art propose to send original video image information acquired by 1 camera to the recording and broadcasting host, and the recording and broadcasting host finishes cutting a video picture at the recording and broadcasting host according to a user operation instruction, so as to realize scene shooting of a panoramic picture and a close-up picture. However, the video camera sends complete image information to the recording and broadcasting host, but the recording and broadcasting host only needs to shoot partial area of the image, which causes great bandwidth waste on transmission resources and is easy to meet the bandwidth limitation of the transmission link, and if the 4K or 8K resolution video camera is adopted, the bandwidth of the transmission link becomes a bottleneck.

Disclosure of Invention

In order to solve the technical problem, the invention provides a method and a system for realizing multi-scene shooting of an electronic holder.

The following presents a simplified summary in order to provide a basic understanding of some aspects of the disclosed embodiments. This summary is not an extensive overview and is intended to neither identify key/critical elements nor delineate the scope of such embodiments. Its sole purpose is to present some concepts in a simplified form as a prelude to the more detailed description that is presented later.

The invention adopts the following technical scheme:

in some optional embodiments, a method for implementing multi-scene shooting by an electronic pan-tilt is provided, including:

each scene processing module of the front-end equipment obtains a plurality of different scene video images in the original video image through cutting and zooming operations according to the operation instructions corresponding to the different scene video images;

and the front-end equipment splices the video images of different scenes into an integral video image and sends the integral video image to a user host.

In some optional embodiments, the method further comprises, before:

the front-end equipment collects an original video image;

the front-end equipment preprocesses the original video image and optimizes the pixel quality of the original video image;

and the front-end equipment analyzes the editing instruction to obtain the operation instructions corresponding to the video images of different scenes.

In some optional embodiments, the editing instruction is from the user host side or is formed by presetting in the front-end equipment by the user host side;

the editing instruction comprises a cutting instruction and zooming instruction information which are specified aiming at the shooting requirements of different scenes.

In some optional embodiments, the multiple different scene video images obtained by the front-end device through the cropping and scaling operations in the original video image are specific resolution images, and the resolution of the original video image captured by the front-end device is higher than the specific resolution of the multiple different scene video images obtained by the front-end device through the cropping and scaling operations in the original video image.

In some optional embodiments, the front-end device records stitching position information while stitching the video images of different scenes into an integral video image, and sends the stitching position information to the user host;

the video images of different scenes which are adjacent up, down, left and right are sequenced and spliced according to a certain rule, and the splicing position information is used for marking the coordinates and the sizes of the video images of the different scenes in the whole video image.

In some optional embodiments, the method for implementing multi-scene shooting by an electronic pan-tilt further includes:

the user host end receives the whole video image uploaded by the front-end equipment, and separates a plurality of different scene video images with specific resolution according to the whole video image;

and the user host end edits the video images of different scenes with the specific resolutions and encodes the video images to form a video playing file or streaming media data.

In some optional embodiments, the user host receives the entire video image uploaded by the front-end device and simultaneously receives the splicing position information uploaded by the front-end device; and the user host separates a plurality of different scene video images with specific resolution according to the whole video image and the splicing position information.

In some optional embodiments, the user host performs scene name tagging on different scene video images while separating out a plurality of different scene video images with specific resolutions.

In some optional embodiments, the present invention further provides an electronic pan-tilt multi-scene shooting system, including: a front-end device; the front-end device includes:

the scene image editing unit is used for obtaining a plurality of different scene video images in the original video image through cutting and zooming operations according to the operation instructions corresponding to the different scene video images;

the image splicing unit is used for splicing the video images of different scenes into an integral video image;

the data sending unit is used for sending the whole video image to a user host end;

wherein the scene image editing unit includes: and each scene processing module acquires editing areas of different scenes according to the operation instructions corresponding to the video images of the different scenes.

In some optional embodiments, the front-end device further comprises:

the video image acquisition unit is used for acquiring an original video image;

the image signal preprocessing unit is used for preprocessing the original video image and optimizing the pixel quality of the original video image;

and the editing instruction processing unit is used for analyzing the editing instruction to obtain the operation instructions corresponding to the video images of different scenes.

In some optional embodiments, the edit instruction processing unit includes:

the editing instruction receiving submodule is used for acquiring an editing instruction from the user host side or an editing instruction formed by presetting the user host side in the front-end equipment;

the editing instruction analysis submodule is used for analyzing the editing instruction and sending the operation instruction corresponding to the video images of different scenes to the corresponding scene processing module;

In some optional embodiments, the scene processing module comprises:

the image area cutting submodule is used for receiving a cutting instruction obtained after the editing instruction processing unit analyzes the cutting instruction, determining an area to be cut in the preprocessed original video image and cutting out a corresponding scene video image area;

and the image area zooming submodule is used for receiving a zooming instruction obtained after the editing instruction processing unit analyzes the zooming instruction, zooming the scene video image cut out by the image area cutting submodule and adjusting the scene video image to a specific resolution.

In some optional embodiments, the image stitching unit includes:

the image splicing submodule is used for sequencing and splicing a plurality of different scene video images which are adjacent up, down, left and right according to a certain rule;

and the splicing position information recording submodule is used for recording splicing position information, and the splicing position information is used for marking the coordinates and the sizes of the video images of the plurality of different scenes in the whole video image.

In some optional embodiments, the electronic pan-tilt multi-scene shooting system further includes: a user host side; the subscriber host side includes:

the data receiving unit is used for receiving the whole video image uploaded by the front-end equipment;

the image separation unit is used for separating a plurality of different scene video images with specific resolution according to the whole video image;

and the image coding unit is used for editing the different scene video images with the specific resolutions and coding to form a video playing file or streaming media data.

In some optional embodiments, the data receiving unit receives the entire video image uploaded by the front-end device and receives the splicing position information uploaded by the front-end device at the same time; and the image separation unit separates a plurality of different scene video images with specific resolution according to the whole video image and the splicing position information.

In some optional embodiments, the image separating unit comprises: and the scene marking submodule is used for marking the scene names of the separated video images of different scenes.

In some optional embodiments, the front-end device sends non-coded multi-scene shooting video data to the user host through 1 physical transmission line; wherein, the physical transmission line includes but is not limited to network cable, coaxial cable, HDMI wire, and the transmission technology includes but is not limited to 3G-SDI, HDBaseT, HDMI.

The invention has the following beneficial effects: the method has the advantages that multiple scene shooting is flexibly realized under the condition of using a single physical lens, non-coded multi-scene shooting video data are sent to a user host end only through 1 physical transmission line, high-quality video files or streaming media data are generated by once coding after image data are processed at the user host end, multi-scene shooting is flexibly realized, the bandwidth of the transmission line is efficiently utilized, high-quality video playing files are generated by once coding, the system is simple and convenient to install, operate and maintain, the processing and transmission efficiency of the whole data is improved, and the whole cost of the system is effectively reduced.

For the purposes of the foregoing and related ends, the one or more embodiments include the features hereinafter fully described and particularly pointed out in the claims. The following description and the annexed drawings set forth in detail certain illustrative aspects and are indicative of but a few of the various ways in which the principles of the various embodiments may be employed. Other benefits and novel features will become apparent from the following detailed description when considered in conjunction with the drawings and the disclosed embodiments are intended to include all such aspects and their equivalents.

Drawings

Fig. 1 is a schematic flow chart of an implementation method of multi-scene shooting of an electronic pan-tilt in one embodiment of the invention;

fig. 2 is a schematic flow chart of an implementation method of multi-scene shooting by an electronic pan-tilt in another embodiment of the invention;

FIG. 3 is a schematic diagram of the present invention for obtaining multiple video images of a scene through a cropping operation;

FIG. 4 is a schematic diagram of the present invention with a zoom operation to bring a close-up scene image to a particular resolution;

FIG. 5 is a diagram illustrating a zoom operation to achieve a specific resolution for a panoramic scene image according to the present invention;

FIG. 6 is a schematic diagram of the present invention implementing the stitching of multiple scene video images through a stitching operation;

FIG. 7 is a schematic diagram of an electronic pan/tilt head multi-scene shooting system according to an embodiment of the invention;

fig. 8 is a schematic diagram of the electronic pan-tilt multi-scene shooting system in another embodiment of the invention.

Detailed Description

The following description and the drawings sufficiently illustrate specific embodiments of the invention to enable those skilled in the art to practice them. Other embodiments may incorporate structural, logical, electrical, process, and other changes. The examples merely typify possible variations. Individual components and functions are optional unless explicitly required, and the sequence of operations may vary. Portions and features of some embodiments may be included in or substituted for those of others. The scope of embodiments of the invention encompasses the full ambit of the claims, as well as all available equivalents of the claims.

In some illustrative embodiments, as shown in fig. 1, a method for implementing multi-scene shooting by an electronic pan-tilt is provided, including:

101: the front-end equipment collects original video images.

The front-end device may be a high-resolution camera. The original video image is acquired by a high-resolution camera, and preferably, the camera is selected to be a camera with image resolution supporting 1600 ten thousand pixels.

102: the front-end equipment preprocesses the original video image and optimizes the pixel quality of the original video image.

The optimization comprises image denoising, image compensation, image enhancement and the like.

103: and the front-end equipment analyzes the editing instruction to obtain operation instructions corresponding to the video images of different scenes.

The editing instruction is from the user host end or is formed by presetting the user host end in the front-end equipment. Therefore, the front-end equipment can edit the optimized video image in real time according to the instruction of the user host end, and can process the optimized video image according to the preset editing instruction, so that the method is suitable for various processing requirements and is more diversified.

The editing instruction includes a cropping instruction and zoom instruction information specified by shooting requirements for different scenes. Therefore, the front-end device receives the editing instruction sent by the user host, analyzes the editing instruction to obtain a cutting instruction and a zooming instruction corresponding to a plurality of scenes respectively, and obtains a plurality of scene video images with specific resolutions through cutting and zooming in the optimized video images.

104: and each scene processing module of the front-end equipment obtains a plurality of different scene video images in the original video image through cutting and zooming operations according to the operation instructions corresponding to the different scene video images.

Specifically, when the front-end device obtains operation instructions corresponding to video images of different scenes, the optimized video images are input into the plurality of scene processing modules. Each scene processing module obtains a plurality of different scene video images after the pre-processed original video image is cut and zoomed according to the editing instruction of the corresponding scene, namely the front-end equipment executes the editing instruction, and the original video image is cut and zoomed according to the information of the cutting area and the zooming area to obtain a plurality of different scene video image information with specific resolution.

105: and the front-end equipment splices the video images of different scenes into an integral video image.

Specifically, the front-end device sequences and splices a plurality of different scene video images which are adjacent up, down, left and right according to a certain rule. The rules include: single row multiple column, single column multiple row or multiple column multiple row.

In some illustrative embodiments, the front-end device records the splicing position information while splicing video images of different scenes into a whole video image, and sends the splicing position information to the user host, so that the user host can conveniently perform separation operation on the whole video image, and the accuracy and the separation efficiency of the separation operation are improved. When the splicing rule is a default rule, the front-end device may not send the splicing position information to the user host.

The stitching location information is used to mark coordinates and dimensions of the plurality of different scene video images in the overall video image.

106: and the front-end equipment transmits the spliced whole video image to the user host side, wherein the transmitted whole video image is non-coded. And when the splicing rule is not the default rule, the front-end equipment sends the spliced whole video image and the splicing position information to the user host side.

In some illustrative embodiments, the front-end device sends the required non-coded multi-scene shooting video data to the user host through 1 physical transmission line in the process of sending the data to the user host. The physical transmission line includes but is not limited to a network cable, a coaxial cable, an HDMI wire, and other physical lines; the transmission technology includes but is not limited to non-coding video image transmission technologies such as 3G-SDI, HDBaseT, HDMI, etc.

In some illustrative embodiments, the video images of the plurality of different scenes obtained by the front-end device through the cropping and scaling operations in the original video image are images of a specific resolution, which may be 200 ten thousand pixels. In order to meet the requirement that after the original video image is cut and zoomed, the scene video image reaches a specific resolution, the resolution of the original video image collected by the front-end equipment is higher than the specific resolution of a plurality of different scene video images obtained by the equipment through cutting and zooming operations in the original video image.

In some illustrative embodiments, as shown in fig. 2, the method for implementing multi-scene shooting by an electronic pan-tilt of the present invention further includes:

107: and the user host end receives the whole video image uploaded by the front-end equipment.

In some illustrative embodiments, the user host receives the entire video image uploaded by the front-end device and simultaneously receives the splicing position information uploaded by the front-end device, so that the user host can perform separation operation on the entire video image conveniently, and the accuracy and separation of the separation operation are improved. When the splicing rule is a default rule, the front-end device may not send the splicing position information to the user host.

108: the user host separates a plurality of different scene video images with specific resolution according to the whole video image.

And when the splicing rule is a default rule, the front-end equipment separates the whole video image into a plurality of different scene video images with specific resolution according to the default splicing position information.

And when the splicing rule is not the default rule, the front-end equipment separates the whole video image into a plurality of different scene video images with specific resolution according to the splicing position information uploaded by the front-end equipment.

In some illustrative embodiments, when the user host separates a plurality of different scene video images with specific resolutions, scene name labeling is performed on the different scene video images, so that the user host can call the different scene video images as required, subsequent processing operation is facilitated, and data processing efficiency is improved.

109: the user host side edits a plurality of different scene video images with specific resolution, specifically edits the plurality of scene video images according to a resource mode or a movie mode. For example: and separating a plurality of scene video images such as teacher close-up, teacher panorama and student panorama from the spliced whole video image according to the received splicing position information.

110: the user host side processes the image data of the video images of a plurality of different scenes according to the use requirement, and then forms a video playing file or streaming media data through once coding.

Therefore, the electronic pan-tilt multi-scene shooting realization method can realize the shooting of various scenes under the condition of a single physical lens, simultaneously send the required non-coding multi-scene shooting video data to the user host end through 1 physical transmission line, and generate high-quality video files or streaming media data by once coding after processing the image data at the user host end.

As shown in fig. 3, the original video image size collected by the front-end device is X pixels long and Y pixels wide. And determining two scene video images, namely a close-up scene and a panoramic scene, in the original video image according to the cutting instruction of the editing instruction.

The close-up scene mainly shoots close-ups of the faces of people, marked as A in figure 3, the panoramic scene is determined according to the cutting instruction of the editing instruction, and the rectangular parameters of the cutting area are expressed as: the pixel coordinates of the image starting point are (x1, y1), the image length is a pixel points, and the image width is b pixel points.

The panoramic scene mainly shoots a multi-person picture, a reference mark B in figure 3 determines the panoramic scene according to a cutting instruction of an editing instruction, and rectangular parameters of a cutting area are represented as follows: the pixel coordinates of the image starting point are (x2, y2), the image length is c pixel points, and the image width is d pixel points.

As shown in fig. 4 and 5, the front-end device obtains a close-up scene image and a panoramic scene image according to the cropping instruction, the length and width pixels of the two scene images are (a, b), (c, d), respectively, and the length and width pixels of the two images are respectively scaled and adjusted to (e, f), (e, f) according to the scaling instruction of the editing instruction.

As shown in fig. 6, the front-end device obtains two zoomed close-up scene images and panoramic scene images according to the zoom instruction, the length and width pixels of the two scene images are (e, f) and (e, f), respectively, and splices the two images in an up-down arrangement manner to generate a video image, the length and width pixels of the video image are (e,2f), and records splice position information, that is, the splice position information is: the upper (e, f) pixel area in the video image is a close-up scene image, and the lower (e, f) pixel area is a panoramic scene image.

When the user host separates the whole video image uploaded by the front-end device, the splicing operation in fig. 6 is performed in reverse, specifically: according to the known splicing arrangement rule, a pixel area with the length and the width as (e, f) is taken as a unit, two video images are cut and separated from top to bottom, and the two video images are respectively marked as a close-up scene video image and a panoramic scene video image.

In some illustrative embodiments, as shown in fig. 7, the present invention provides an electronic pan/tilt multi-scene shooting system comprising: a front-end device. The front-end device may be a high-resolution camera. The original video image is acquired by a high-resolution camera, and preferably, the camera is selected to be a camera with image resolution supporting 1600 ten thousand pixels.

The front-end equipment comprises: the system comprises a video image acquisition unit, an image signal preprocessing unit, an editing instruction processing unit, a scene image editing unit, an image splicing unit and a data sending unit.

And the video image acquisition unit is used for acquiring an original video image.

And the image signal preprocessing unit is used for preprocessing the original video image and optimizing the pixel quality of the original video image. The optimization comprises image denoising, image compensation, image enhancement and the like.

And the scene image editing unit is used for obtaining a plurality of different scene video images in the original video image through cutting and zooming operations according to the operation instructions corresponding to the different scene video images. Wherein the scene image editing unit includes: and each scene processing module acquires editing areas of different scenes according to the operation instructions corresponding to the video images of the different scenes. Specifically, when the scene image editing unit obtains operation instructions corresponding to different scene video images, the optimized video images are input into the plurality of scene processing modules. Each scene processing module obtains a plurality of different scene video images after the pre-processed original video image is cut and zoomed according to the editing instruction of the corresponding scene, namely the front-end equipment executes the editing instruction, and the original video image is cut and zoomed according to the information of the cutting area and the zooming area to obtain a plurality of different scene video image information with specific resolution.

and the data sending unit is used for sending the whole video image to the user host side. The transmitted overall video image is non-encoded. And when the splicing rule is not the default rule, the front-end equipment sends the spliced whole video image and the splicing position information to the user host side. In some illustrative embodiments, in the process of sending data to the user host end by the front-end device, the required non-coded multi-scene shooting video data is sent to the user host end through 1 physical transmission line. The physical transmission line includes but is not limited to a network cable, a coaxial cable, an HDMI wire, and other physical lines; the transmission technology includes but is not limited to non-coding video image transmission technologies such as 3G-SDI, HDBaseT, HDMI, etc.

In some illustrative embodiments, the edit instruction processing unit includes: an editing instruction receiving submodule and an editing instruction analyzing submodule.

And the editing instruction receiving sub-module is used for acquiring an editing instruction from the user host end or an editing instruction formed by presetting the user host end in the front-end equipment. Therefore, the front-end equipment can edit the optimized video image in real time according to the instruction of the user host end, and can process the optimized video image according to the preset editing instruction, so that the method is suitable for various processing requirements and is more diversified. The editing instruction comprises a cutting instruction and zooming instruction information which are specified according to shooting requirements of different scenes.

And the editing instruction analysis submodule is used for analyzing the editing instruction and sending the operation instruction corresponding to the video images of different scenes to the corresponding scene processing module. Therefore, the front-end device receives the editing instruction sent by the user host, analyzes the editing instruction to respectively obtain a cutting instruction and a zooming instruction corresponding to a plurality of scenes, and obtains a plurality of scene video images with specific resolutions through cutting and zooming in different scene processing modules after the optimized video images are optimized.

In some demonstrative embodiments, the scene processing module includes: an image area cropping sub-module and an image area scaling sub-module.

And the image area cutting submodule is used for receiving a cutting instruction obtained after the editing instruction processing unit analyzes the cutting instruction, determining an area to be cut in the preprocessed original video image, and cutting out a corresponding scene video image area.

And the image area zooming submodule is used for receiving a zooming instruction obtained after the editing instruction processing unit analyzes the zooming instruction, zooming the scene video image cut out by the image area cutting submodule and adjusting the scene video image to a specific resolution. The specific resolution may be 200 ten thousand pixels. In order to meet the requirement that after the original video image is cut and zoomed, the scene video image reaches a specific resolution, the resolution of the original video image collected by the front-end equipment is higher than the specific resolution of a plurality of different scene video images obtained by the equipment through cutting and zooming operations in the original video image.

In some illustrative embodiments, the image stitching unit includes: an image splicing submodule and a splicing position information recording submodule.

And the image splicing submodule is used for sequencing and splicing a plurality of different scene video images which are adjacent up, down, left and right according to a certain rule. The rules include: single row multiple column, single column multiple row or multiple column multiple row.

And the splicing position information recording submodule is used for recording the splicing position information, and the splicing position information is used for marking the coordinates and the sizes of the video images of the plurality of different scenes in the whole video image. The front-end equipment records the splicing position information while splicing the video images of different scenes into a whole video image, and sends the splicing position information to the user host end, so that the user host end can conveniently separate the whole video image, and the accuracy and the separation efficiency of the separation operation are improved. When the splicing rule is a default rule, the front-end device may not send the splicing position information to the user host.

In some illustrative embodiments, as shown in fig. 8, the electronic pan/tilt multi-scene shooting system of the present invention further includes: the user host side.

The user host end comprises: the image encoding device comprises a data receiving unit, an image separating unit and an image encoding unit.

And the data receiving unit is used for receiving the whole video image uploaded by the front-end equipment.

The data receiving unit receives the whole video image uploaded by the front-end equipment and simultaneously receives the splicing position information uploaded by the front-end equipment, so that the user host can conveniently separate the whole video image, and the accuracy and separation of the separation operation are improved. When the splicing rule is a default rule, the front-end device may not send the splicing position information to the user host.

And the image separation unit is used for separating a plurality of different scene video images with specific resolution according to the whole video image.

And when the splicing rule is a default rule, the image separation unit separates the whole video image into a plurality of different scene video images with specific resolution according to the default splicing position information.

And when the splicing rule is not the default rule, the image separation unit separates the whole video image into a plurality of different scene video images with specific resolution according to the splicing position information uploaded by the front-end equipment.

And the image coding unit is used for editing a plurality of different scene video images with specific resolutions and forming a video playing file or streaming media data through once coding. Specifically, a plurality of scene video images are edited according to a resource mode or a movie mode. For example: and separating a plurality of scene video images such as teacher close-up, teacher panorama and student panorama from the spliced whole video image according to the received splicing position information.

In some illustrative embodiments, the image separating unit includes: and the scene marking submodule is used for marking the scene names of the separated video images of different scenes. The user host side is called as required, so that subsequent processing operation is facilitated, and the data processing efficiency is improved.

Therefore, the electronic pan-tilt multi-scene shooting system can simultaneously realize shooting of multiple scenes under the condition of a single physical lens, simultaneously sends required non-coding multi-scene shooting video data to the user host end through 1 physical transmission line, and generates high-quality video files or streaming media data by once coding after processing the image data at the user host end.

Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.

Claims

1. The electronic cloud deck multi-scene shooting implementation method is characterized by comprising the following steps:

the front-end equipment splices the video images of different scenes into an integral video image and sends the integral video image to a user host end through a physical transmission line;

the whole video image is a non-coding video image;

the user host receives the whole video image uploaded by the front-end equipment, and separates a plurality of different scene video images according to the whole video image;

wherein the front-end device is a camera.

2. The method for implementing multi-scene shooting by an electronic pan-tilt according to claim 1, characterized in that the method further comprises the following steps:

the front-end equipment collects an original video image;

3. The method according to claim 2, wherein the editing command is from the user host or is preset in the front-end device by the user host;

4. The method for implementing multi-scene shooting by an electronic pan-tilt according to claim 1 or 3, wherein the multiple different scene video images obtained by the front-end device through the cropping and scaling operations in the original video image are specific resolution images, and the resolution of the original video image collected by the front-end device is higher than the specific resolution of the multiple different scene video images obtained by the front-end device through the cropping and scaling operations in the original video image.

5. The method for implementing multi-scene shooting by an electronic pan-tilt according to claim 4, wherein the front-end device records the splicing position information while splicing the video images of different scenes into a whole video image, and sends the splicing position information to the user host;

6. The electronic pan-tilt multi-scene shooting implementation method according to claim 5,

the user host end receives the whole video image uploaded by the front-end equipment, and separates a plurality of different scene video images with specific resolution in the process of separating a plurality of different scene video images according to the whole video image;

the electronic cloud deck multi-scene shooting implementation method further comprises the following steps:

7. The method for implementing multi-scene shooting by an electronic pan-tilt according to claim 6, wherein the user host receives the whole video image uploaded by the front-end device and simultaneously receives the splicing position information uploaded by the front-end device; and the user host separates a plurality of different scene video images with specific resolution according to the whole video image and the splicing position information.

8. The method for implementing multi-scene shooting by an electronic cloud deck according to claim 7, wherein the user host separates out a plurality of different scene video images with specific resolution and marks scene names for the different scene video images.

9. The electronic pan-tilt multi-scene shooting implementation method according to claim 1,

the front-end equipment splices the video images of different scenes into an integral video image and simultaneously records splicing position information;

and when the splicing rule is not the default rule, the front-end equipment sends the splicing position information to the user host side, and the user host side separates the whole video image into different scene video images according to the splicing position information uploaded by the front-end equipment.

10. Many scenes of electron cloud platform shooting system, its characterized in that includes: the system comprises front-end equipment and a user host end;

the head-end equipment is a camera, the head-end equipment comprising:

the image splicing unit is used for splicing the video images of different scenes into an integral video image; wherein the whole video image is a non-coded video image;

wherein the scene image editing unit includes: the scene processing modules acquire editing areas of different scenes according to operation instructions corresponding to video images of the different scenes;

the subscriber host side includes:

the image separation unit is used for separating a plurality of different scene video images according to the whole video image;

and the front-end equipment transmits the whole video image to the user host end through a physical transmission line.

11. The electronic pan-tilt multi-scene shooting system according to claim 10, wherein the front-end device further comprises:

the video image acquisition unit is used for acquiring an original video image;

12. The electronic pan/tilt multi-scene shooting system according to claim 11, wherein the edit instruction processing unit comprises:

13. The electronic pan/tilt multi-scene shooting system according to claim 12, wherein the scene processing module comprises:

14. The electronic pan/tilt multi-scene shooting system according to claim 13, wherein the image stitching unit comprises:

15. The electronic pan/tilt head multi-scene shooting system according to claim 14, wherein the user host further comprises:

the image coding unit is used for editing the different scene video images with the specific resolutions and coding the different scene video images to form a video playing file or streaming media data;

the image separation unit is also used for separating a plurality of different scene video images with specific resolutions.

16. The electronic pan-tilt multi-scene shooting system according to claim 15, wherein the data receiving unit receives the entire video image uploaded by the front-end device and simultaneously receives the splicing position information uploaded by the front-end device; and the image separation unit separates a plurality of different scene video images with specific resolution according to the whole video image and the splicing position information.

17. The electronic pan/tilt multi-scene shooting system of claim 16, wherein the image separation unit comprises: and the scene marking submodule is used for marking the scene names of the separated video images of different scenes.

18. The electronic pan/tilt multi-scene shooting system according to claim 17, wherein the physical transmission line comprises: net twine, coaxial cable or HDMI wire rod, transmission technology includes: 3G-SDI, HDBaseT, or HDMI.

19. The electronic pan-tilt multi-scene shooting system according to claim 10, wherein the image stitching unit is further configured to record stitching location information;