CN118057809A - Recording and broadcasting system - Google Patents

Recording and broadcasting system Download PDF

Info

Publication number
CN118057809A
CN118057809A CN202211453130.1A CN202211453130A CN118057809A CN 118057809 A CN118057809 A CN 118057809A CN 202211453130 A CN202211453130 A CN 202211453130A CN 118057809 A CN118057809 A CN 118057809A
Authority
CN
China
Prior art keywords
image
camera
recording
definition
close
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211453130.1A
Other languages
Chinese (zh)
Inventor
陈明良
陈仕同
黄浩成
吴明锐
张欣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Kaidelian Software Technology Co ltd
Original Assignee
Guangzhou Kaidelian Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Kaidelian Software Technology Co ltd filed Critical Guangzhou Kaidelian Software Technology Co ltd
Priority to CN202211453130.1A priority Critical patent/CN118057809A/en
Publication of CN118057809A publication Critical patent/CN118057809A/en
Pending legal-status Critical Current

Links

Landscapes

  • Studio Devices (AREA)

Abstract

The invention discloses a recording and broadcasting system. The recording and broadcasting system provided by the invention comprises a camera connected with a recording and broadcasting host, wherein the camera comprises a first camera, the first camera can further analyze a first high-definition image after shooting the first high-definition image, a first close-range image is intercepted from the first high-definition image according to a first analysis result, meanwhile, the first camera can also determine the image priority, and the first high-definition image, the first close-range image and the image priority are output to the recording and broadcasting host, so that the recording and broadcasting host can determine a target image according to the image priority and output the target image. According to the method and the device, the image priority is determined locally in the first camera, so that the recording and broadcasting host can output and determine the target image according to the image priority after receiving the image priority, and the image switching process is completed. The invention can reduce the delay in picture switching and solve the technical problem of higher delay in picture switching of the recording and broadcasting system.

Description

Recording and broadcasting system
Technical Field
The embodiment of the application relates to the field of information processing, in particular to a recording and broadcasting system.
Background
The current recording and broadcasting system is generally provided with a camera, one panoramic image and one characteristic image are output by the camera, the character movement condition of a fixed area in the panoramic image is detected through a movement detection technology, when the character moves, the panoramic image is switched into a close-up image, and when the character does not move, the panoramic image is kept. However, the close-up image output by this method has poor imaging effect, and the delay is high when the image is switched, the image switching process is abrupt, and the use experience of the user is poor.
In summary, how to reduce the delay of the conventional recording and playing system when switching the picture becomes the technical problem to be solved.
Disclosure of Invention
The embodiment of the invention provides a recording and broadcasting system, which solves the technical problem that the existing recording and broadcasting system has higher delay when switching pictures.
In a first aspect, an embodiment of the present invention provides a recording and playing system, including a recording and playing host, and further including a camera connected to the recording and playing host, where the camera includes a first camera, and the first camera includes a first camera module, and the first camera module is used to capture a first high-definition image in a first area;
the first camera is also used for analyzing the first high-definition image, intercepting a first close-range image from the first high-definition image according to a first analysis result, determining an image priority and sending the first high-definition image, the first close-range image and the image priority to the recording and playing host;
The recording and broadcasting host is used for determining a target image from the images sent by the camera according to the image priority and outputting the target image.
In the foregoing, the recording and broadcasting system provided by the embodiment of the present invention includes a camera connected to a recording and broadcasting host, where the camera includes a first camera, and the first camera can further analyze a first high-definition image after capturing the first high-definition image, and intercept a first close-range image from the first high-definition image according to a first analysis result, and at the same time, the first camera can also determine an image priority, and output the first high-definition image, the first close-range image, and the image priority to the recording and broadcasting host, so that the recording and broadcasting host can determine and output a target image according to the image priority. According to the embodiment of the invention, the image priority is determined locally on the first camera, so that the recording and broadcasting host can output and determine the target image according to the image priority after receiving the image priority, and the image switching process is completed, the recording and broadcasting host is not required to analyze the image sent by the camera and then determine the target image to be output, the delay in picture switching is reduced, and the technical problem that the delay is higher in the picture switching of the conventional recording and broadcasting system is solved.
Drawings
Fig. 1 is a schematic structural diagram of a recording and playing system according to an embodiment of the present invention.
Fig. 2 is a schematic diagram of capturing a first close-range image from a first high-definition image according to an embodiment of the present invention.
Fig. 3 is a schematic structural diagram of another recording and playing system according to an embodiment of the present invention.
Fig. 4 is a schematic structural diagram of another recording and playing system according to an embodiment of the present invention.
Fig. 5 is a schematic structural diagram of a classroom according to an embodiment of the present invention.
Fig. 6 is a schematic diagram of an installation position of a recording and playing system according to an embodiment of the present invention.
Fig. 7 is a schematic diagram of an installation position of a first camera according to an embodiment of the present invention.
Fig. 8 is a schematic view of view angles of a first high-definition image, a first near-field image, a third high-definition image, and a third near-field image according to an embodiment of the present invention.
Fig. 9 is a schematic diagram of gradually switching a picture from a larger angle of view to a smaller angle of view according to an embodiment of the present invention.
Fig. 10 is a schematic diagram of gradually switching a picture from a smaller angle of view to a larger angle of view according to an embodiment of the present invention.
Fig. 11 is a schematic diagram of an eye orientation display module of a teacher according to an embodiment of the present invention.
Reference numerals
Recording and broadcasting host 1, first camera 2, second camera 3, interactive television 4, interactive big board 5, microphone 6, audio amplifier 7, switch 8.
Detailed Description
The following description and the drawings sufficiently illustrate specific embodiments of the application to enable those skilled in the art to practice them. The embodiments represent only possible variations. Individual components and functions are optional unless explicitly required, and the sequence of operations may vary. Portions and features of some embodiments may be included in, or substituted for, those of others. The scope of embodiments of the application encompasses the full ambit of the claims, as well as all available equivalents of the claims. Embodiments may be referred to herein, individually or collectively, by the term "application" merely for convenience and without intending to voluntarily limit the scope of this application to any single application or inventive concept if more than one is in fact disclosed. Relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed. Various embodiments are described herein in a progressive manner, each embodiment focusing on differences from other embodiments, and identical and similar parts between the various embodiments are sufficient to be seen with each other. The structures, products and the like disclosed in the embodiments correspond to the parts disclosed in the embodiments, so that the description is relatively simple, and the relevant parts refer to the description of the method parts.
The recording and broadcasting system provided by the embodiment of the invention comprises a recording and broadcasting host, and further comprises a camera connected with the recording and broadcasting host, wherein the camera comprises a first camera, the first camera comprises a first camera module, and the first camera module is used for shooting a first high-definition image in a first area.
The recording and broadcasting system is a system which can integrate and synchronously record video and audio recorded on site and image signals of electronic equipment to generate standardized streaming media files for live broadcasting, storage, later editing and on-demand. The recording and broadcasting system can be arranged in a classroom, video signals and audio signals of a teacher in the teaching process are recorded by the recording and broadcasting system, the streaming media files in the teaching scene are generated according to the video signals and the audio signals and uploaded to the server, and the subsequent students can watch teaching videos of the teacher by playing the streaming media files in the server, so that the online learning effect is achieved. The streaming media file refers to a media file which is continuously played in real time on a network by adopting a streaming technology.
In this embodiment, as shown in fig. 1, the recording system includes a recording host 1 and a first camera 2 connected to the recording host 1. The recording and broadcasting host is equipment integrating recording and broadcasting functions, is responsible for picking up signals such as audio signals and video signals, analyzes and processes the audio signals and the video signals to generate streaming media files, and can perform live broadcasting, recording and broadcasting, on-demand broadcasting, video synchronous recording and the like. The camera is a device capable of converting an optical image signal into an electrical signal, and specifically, the camera collects light reflected from an object to focus the light on a light receiving surface of an image pickup device, and converts the light into electrical energy through the image pickup device, thereby obtaining an image signal. The first camera in this embodiment includes a first camera module, where the first camera module refers to a device on the first camera for capturing an image signal, and for example, the first camera module may be a group of imaging lenses. The first image capturing module is configured to capture a first high-definition image in a first area, where the high-definition image refers to an image with a high resolution, generally refers to an image with a vertical resolution of 720 or more, and the size of the high-definition image is generally 1280×720, 1920×1080, and the like. In addition, the first area shot by the first shooting module can be set according to actual needs, for example, when a video of a teacher in the teaching process needs to be recorded, the first area can be set as a platform area. In another embodiment, when the first camera module is provided with a rotating device such as a pan-tilt, the first area shot by the first camera module may also be changed along with the rotation of the pan-tilt, and in this embodiment, the specific range of the first area is not limited.
The first camera is also used for analyzing the first high-definition image, intercepting a first close-range image from the first high-definition image according to a first analysis result, determining the image priority and sending the first high-definition image, the first close-range image and the image priority to the recording and playing host.
In this embodiment, after the first camera module captures a first high-definition image, the first camera is further configured to analyze the captured first high-definition image, and intercept a first close-range image from the first high-definition image according to a first analysis result obtained by the analysis. The first close-range image is an image whose screen range is smaller than that of the first high-definition image. For example, when recording a teaching video of a teacher, the first camera module can identify the teacher in the first high-definition image, and after the teacher is identified in the first high-definition image, an image of an area where the teacher is located is taken out from the first high-definition image, and the image is taken as a first close-range image. And then, the first camera can send the first high-definition image and the first close-range image to the recording and broadcasting host.
In addition, in this embodiment, the first camera is further configured to determine an image priority, where the image priority refers to a priority level of an image, and in this embodiment, specifically refers to a priority level of image output. For example, when the first camera determines that the importance level of the first high-definition image is higher than the priority level of the first close-range image at this time according to the first analysis result, the image priority level of the first high-definition image may be set to be higher than the image priority level of the first close-range image. When recording teacher's teaching video, the determination of image priority follows the following rule: when the first high-definition image comprises an image interacted by students and teachers on a podium, setting the priority of the first high-definition image as the highest; when an image in which only a teacher is in a lesson or an image in which only one student is standing to answer a question is included in the first high-definition image, the priority of the first close-range image including the teacher or the student standing to answer the question is set to be highest. For example, if the first area shot by the first camera is a platform area, when the first high-definition image is an image including the interaction between the student and the teacher, the image priority of the first high-definition image is highest; when the first high-definition image is an image which only comprises the teacher in teaching, the image priority of the first close-range image comprising the teacher is highest; in other cases, the image priority of the first close-range image is highest. If the first area shot by the first camera is a student seat area, when the first high-definition image is an image which comprises a plurality of students and teachers which are interacting, the image priority of the first high-definition image is highest; when the first high-definition image is an image including only one student interacting with the teacher, the priority of the first close-range image including the student is highest, and otherwise, the priority of the first close-range image is highest.
For example, when a teaching video of a teacher is recorded, a first camera analyzes a first high-definition image of a shot platform area, when the teacher exists in the platform area according to a first analysis result, the area of the teacher in the first high-definition image is identified, a first close-range image of the area where the teacher exists is intercepted, if the first camera simultaneously analyzes that students exist in the first area in the first high-definition image, and the students interact with the teacher, the first camera judges that the importance degree of the first high-definition image is greater than that of the first close-range image, and the first camera can set the image priority of the first high-definition image to be highest. After determining the image priority, the first camera can send the image priority to the recording and playing host.
The recording and broadcasting host is used for determining a target image from the images sent by the camera according to the image priority and outputting the target image.
In this embodiment, the recording and playing host is configured to receive a first high-definition image, a first close-range image, and an image priority sent by the first camera. And then determining a target image with higher image priority from the received first high-definition image and first near-field image according to the image priority, and outputting the target image as the image which is required to be output currently to finish the switching of the images. For example, the recording and playing host can add the target image to the streaming media file according to the playing time sequence, so that the streaming media file displays the target image when playing to the corresponding position, or switches the current live broadcast picture to the target image.
In the foregoing, the recording and broadcasting system provided by the embodiment of the present invention includes a camera connected to a recording and broadcasting host, where the camera includes a first camera, and the first camera can further analyze a first high-definition image after capturing the first high-definition image, and intercept a first close-range image from the first high-definition image according to a first analysis result, and at the same time, the first camera can also determine an image priority, and output the first high-definition image, the first close-range image, and the image priority to the recording and broadcasting host, so that the recording and broadcasting host can determine and output a target image according to the image priority. According to the embodiment of the invention, the image priority is determined locally on the first camera, so that the recording and broadcasting host can output and determine the target image according to the image priority after receiving the image priority, and the image switching process is completed, the recording and broadcasting host is not required to analyze the image sent by the camera and then determine the target image to be output, the delay in picture switching is reduced, and the technical problem that the delay is higher in the picture switching of the conventional recording and broadcasting system is solved.
On the basis of the embodiment, the first camera is specifically configured to analyze the first high-definition image, obtain state information of each object in the first high-definition image, determine a target object according to the state information of each object, intercept a first close-range image corresponding to the target object from the first high-definition image, and determine an image priority according to the state information of each object.
In one embodiment, the first camera is specifically configured to identify objects on the first high-definition image when analyzing the first high-definition image, and acquire status information of each object from the first high-definition image, where the status information includes information of a current status of the object, for example, a position or an action behavior of the object, and the object may be set according to actual needs. For example, the first camera may identify each face in the first high-definition image using a face recognition algorithm, so as to determine each person on the first high-definition image, and identify the current action behavior or position of each person, so as to obtain the state information of each person.
After the first camera obtains the state information of each object, the first camera can determine the target object in all objects of the first high-definition image according to the state information of each object, and intercept a first close-range image corresponding to the target object in the first high-definition image. The target object may be an object specified in advance by a user or an object satisfying a certain specific condition, and the target object may be an object or a person. For example, after the first camera obtains the position of each person through analysis, the person located at the position designated in advance by the user may be determined according to the position of each person, the person located at the designated position is determined as the target object, and the image of the target object and the surrounding environment thereof is cut out from the first high-definition image, thereby obtaining the first close-range image. In one embodiment, the first close-range image is taken from the first high-definition image as shown in fig. 2, where P1 is the first high-definition image and P2 is the first close-range image in fig. 2.
After the first high-definition image is analyzed to obtain a first analysis result, the first camera further determines the image priority according to the first analysis result. In one embodiment, the first camera may determine whether the image of the first high-definition image is higher in priority or the image of the first close-range image is higher in priority according to the motion of each person or the number of persons in the first analysis result. For example, when the teaching video of the teacher is recorded, the first area is a podium area, the first close-up image is a close-up image of the teacher, and the image priority of the default first close-up image is highest, but when the first camera module detects that the number of people in the first area is two on the first high-definition image, and the actions of the two people are the same, that is, it is stated that there may be a student on the podium at this time, and the teacher and the student are interacting (for example, the teacher and the student are demonstrating the same action), the first camera module may set the image priority of the first high-definition image to be highest. It can be understood that the determination manner of the image priority in this embodiment may be set according to actual needs, and the determination manner of the image priority in this embodiment is not specifically limited.
In the embodiment of the invention, the first camera analyzes the first high-definition image to obtain the first priority, so that the first camera can intercept the first close-range image from the first high-definition image according to the first analysis result, and meanwhile, the image priorities of the first high-definition image and the first close-range image can be determined according to the first analysis result, so that the recording and broadcasting host can directly output the target image after receiving the image priority, and the image transmitted by the camera does not need to be analyzed and then the image required to be output is determined, thereby reducing the delay in picture switching.
On the basis of the above embodiment, the status information includes behavior information and location information.
In one embodiment, the state information includes behavior information of each object and position information of each object. The behavior information refers to actions of a target object, for example, when the object is a person, the behavior information may be hand lifting, standing, sitting, turning, writing on blackboard, eye communication, and the like. The behavior information can be realized by adopting a combination of a face recognition algorithm, a skeleton analysis algorithm, a gesture recognition algorithm, a pupil distance measurement algorithm and the like. The position information includes a position of each object in the first high definition image, and the position information may represent the position of the object by coordinates in a planar coordinate system by establishing the planar coordinate system in the first high definition image.
The first camera is specifically configured to determine a target object according to behavior information of each object, intercept a first close-range image corresponding to the target object from the first high-definition image according to position information of the target object, and determine an image priority according to the behavior information of each object.
The first camera is used for determining a target object according to the behavior information of each object. For example, when a teaching video of a teacher is recorded, if the first area is a podium, after the first camera determines the behavior information of each person, if it is determined that the behavior information of a person is a blackboard writing, it may be determined that the object is opposite to the target object, that is, the teacher. Alternatively, if the first area is a student seat area, when it is determined that the behavior information of a person in the first high-definition graphic is on, the object may be determined to be a target object. After determining the target object according to the behavior information, the first camera determines a region to be intercepted according to the position information of the target object in the first high-definition image, wherein the region can be larger than the region occupied by the target object in the first high-definition image, for example, the region can comprise the whole target object; the area may also be smaller than the area occupied by the target in the first high definition image, e.g. the area may comprise a certain area on the target object (e.g. a face or a hand of a person) or the like.
Meanwhile, the first camera module is also used for determining the image priority according to the behavior information of each object. For example, the first area is a student seating area and the object is a student in the student seating area. When it is determined that the behavior of the plurality of objects is hand lifting at this time based on the behavior information of each object, the image priority of the first high-definition image may be set highest at this time, and the image priority of the first close-range image may be set lower than the first high-definition image. When only one object in the first region behaves as a lifting hand, the image priority of the first close-range image corresponding to the object may be set to be highest, and the image priority of the image of the first high-definition image may be set to be lower than the first close-range image.
In the embodiment of the invention, the first camera analyzes the objects in the first high-definition image to obtain the behavior information of each object, and then the first camera intercepts the first close-range image from the first high-definition image according to the behavior information of each object and determines the image priority, so that the recording and broadcasting host can directly output the target image after receiving the image priority, and the image transmitted by the camera is not required to be analyzed and then the image required to be output is determined, thereby reducing the delay in the process of switching the images.
On the basis of the embodiment, the camera further comprises at least one second camera, each second camera is used for shooting second high-definition images in different second areas, and each second camera is in communication connection with the first camera.
In one embodiment, as shown in fig. 3, the recording and playing system further comprises at least one second camera 3 connected to the recording and playing host 1. Each second camera is used for shooting second high-definition images in different second areas, wherein the second areas are preset areas by a user. It is understood that the second area photographed by each second camera may be set according to actual needs, and the specific position photographed by each second camera is not limited in this embodiment. For example, when recording a teacher's teaching video, a first camera may be provided for capturing a lecture area, while a second camera is provided for capturing a student seating area. In addition, in this embodiment each second camera is communicatively coupled to the first camera. The communication connection means that communication is formed between the first camera and the second camera through signal transmission and interaction, and the communication connection comprises wired connection and wireless connection. For example, the first camera and the second camera may be connected to each other by a wireless communication link through WiFi (WIRELESS FIDELITY ), 4G, bluetooth, or other technologies, or directly by a data line to establish a wired communication link.
Each second camera is further used for analyzing the second high-definition image, intercepting a second close-range image from the second high-definition image according to a second analysis result, sending the second high-definition image and the second close-range image to the recording and broadcasting host, and sending the second analysis result to the first camera.
The first camera is specifically configured to determine an image priority according to the first analysis result and each second analysis result, and send the image priority to the recording and playing host.
In this embodiment, after the second camera captures a second high-definition image of the second area, the second camera also analyzes the second high-definition image to obtain a second analysis result, then intercepts a second close-range image from the second high-definition image according to the second analysis result, and sends the second high-definition image and the second close-range image to the recording and playing host. The specific process may refer to the process of analyzing the first high-definition image by the first camera and cutting out the first close-range image from the first high-definition image, which is not described in detail in this embodiment. The difference from the first camera is that each second camera, after generating the second analysis results, transmits the second analysis results to the first camera via the communication link.
After the first camera receives the second analysis result sent by each second camera, the first camera determines the image priority according to the first analysis result generated by the first camera and the second analysis result sent by each second camera, and sends the image priority to the recording and broadcasting host. For example, the first camera may determine, according to the behavior information of each object in the first analysis result and the behavior information of each object in the second analysis result, a target object that makes a target behavior at this time, and when there is a close-range image (first close-range image or second close-range image) corresponding to the target object, set the image priority of the close-range image to be highest. Otherwise, the image priority of the panoramic image (first panoramic image or second panoramic image) corresponding to the target object is set to be highest. Therefore, after receiving the image priority, the recording and broadcasting host can determine and output the target image with the highest image priority from the first high-definition image and the first close-range image sent by the first camera and the second high-definition image and the second close-range image sent by each second camera.
For example, when recording a teaching video of a teacher, a first area photographed by a first camera is a podium area, a second area photographed by a second camera is a student seat area, and the determination of the image priority follows the following rule: when the first high-definition image or the second high-definition image is an image comprising interaction of students and teachers, the priority of the corresponding high-definition image is highest. In other cases, when the first close-range image is an image including an image in which a teacher is teaching, the image priority of the first close-range image is set highest, and when the second close-range image is an image including an interaction between a student and a teacher, the image priority of the second close-range image is highest.
The overall structure of the recording and playing system is shown in fig. 4, and includes, in addition to the recording and playing host 1, the first video camera 2 and the second video camera 3, an interactive television 4, an interactive board 5, a microphone 6, a sound box 7 and a switch 8, which are connected to the recording and playing host 1. The interactive television 4 is also called as an interactive television, is a new generation television based on digital television and broadband network technology, and can provide a highlight program with high individuation and interactivity. The interaction large board 5 is integrated equipment for controlling the content displayed on the display panel and realizing man-machine interaction operation through a touch technology, and can integrate multiple functions of a projector, an electronic whiteboard, a curtain, sound equipment, a television, a video conference terminal and the like. The microphone 6 is an energy conversion device that converts sound signals into electrical signals, and in fig. 4, the microphone 6 includes an omni-directional microphone for collecting sound of a classroom environment and a wireless microphone for collecting sound of a teacher. The loudspeaker 7 is a device for converting an audio signal into sound. The switch 8 is a network device for forwarding electric (optical) signals, and can provide an exclusive electric signal path for any two accessed network nodes, and the recording and broadcasting host 1 can be connected with a server or a remote device through the switch 8, so that the switch can communicate with the outside.
In one embodiment, the structure of a classroom is shown in FIG. 5. Fig. 6 is a schematic diagram of an installation position of the recording and broadcasting system in a classroom, in which the recording and broadcasting host 1 can be installed on one side of a blackboard so as to be convenient for wiring, the height of the recording and broadcasting host 1 from the ground is 1.5M, the distance from the blackboard is 15CM, the first camera 2 and the second camera 3 are installed in the middle line of the classroom, and the first camera 2 is used for shooting a platform area and is installed on a wall B, as shown in fig. 7. And the second camera 3 is used for shooting a student seat area and is arranged on the wall A, and the distance between the first camera 2 and the second camera 3 and the ground is 2.2M-2.4M. The omnidirectional microphones are arranged on the ceiling of the classroom in tandem, and are arranged in the middle line of the classroom, and the distance from the ground is about 2.5M. The interactive television 4 is installed on the wall D by a suspender or a bracket, the distance from the ground is 2M high, and the sound box 7 and the interactive large plate 5 are installed on the wall A. In this embodiment, when the recording and playing system records the teaching video of the teacher, the first camera 2 actively initiates a connection request to the second camera 3, and the first camera 2 and the second camera 3 form a pair of camera pairs to perform combined shooting. The first camera 2 captures a first high-definition image of the teacher area and analyzes the first high-definition image to obtain a first analysis result (including behavior information of the teacher). And the second camera 3 shoots a second high-definition image of the student seating area, and the second high-definition image is analyzed to obtain a second analysis result (including behavior information of each student). After the second camera 3 obtains the second analysis result, the second analysis result is sent to the first camera 2, and the first camera 2 can determine the image priority according to the behavior information of the teacher in the first analysis result and the behavior information of each student in the second analysis result. For example, when the behavior information of the teacher is speech, the image priority of the first close-range image of the teacher may be set to be highest; when the behavior information of the student is up, the image priority of the second close-range image of the student can be set to be the highest; when the student's behavior information is right look-aside, the image priority of the first close-range image of the teacher may be set to be highest. It can be appreciated that the manner of setting the image priority according to the behavior information may be set according to actual needs. After the subsequent first camera 2 sends the image priority to the recording and broadcasting host 1, the recording and broadcasting host 1 can determine the target image from the images sent by the first camera 2 and the second camera 3 according to the image priority, and then the target image is output to the server for display in a streaming media mode through the switch 8, so that the effect of picture switching is achieved.
In the embodiment of the invention, the second camera is in communication connection with the first camera, and after analyzing the second high-definition image shot by the second camera to obtain a second analysis result, the second camera transmits the second analysis result to the first camera, so that the first camera can determine the image priority according to the first analysis result of the second camera and the second analysis result sent by the second camera, and send the image priority to the recording and playing host. According to the embodiment of the invention, the image analysis process is carried out in each camera in a scattered way, and after the first camera receives the second analysis result of the second camera, the image priority is generated through unified analysis. Therefore, the recording and broadcasting host is not required to analyze the images to determine the target image after receiving the images sent by all cameras, and the delay in picture switching is greatly reduced.
On the basis of the embodiment, the first camera further comprises a second camera module.
The first camera is further used for controlling the second camera module to shoot a third high-definition image in the third area according to the first analysis result, intercepting a third close-range image from the third high-definition image according to the first analysis result, and sending the third high-definition image and the third close-range image to the recording and playing host; the third region is located in the first region and the range of the third region is smaller than the range of the first region.
In one embodiment, the first camera further includes a second camera module. The first camera is further used for controlling the second camera module to shoot a third high-definition image in a third area according to the first analysis result after analyzing the first high-definition image shot by the first camera module to obtain the first analysis result. The third area is located in the first area, and the range of the third area is smaller than that of the first area. For example, when a teaching video of a teacher is recorded, the first camera module of the first camera is used for shooting a first high-definition image of a platform area, and when the first camera analyzes the first high-definition image and recognizes that the current behavior information of the teacher on the platform is speaking, the second camera module can be further controlled to shoot a third high-definition image in the area where the teacher is located. In addition, the second camera module is provided with a rotating device, such as a cradle head. The first camera can control the second camera module to shoot third high-definition images of different third areas by controlling the rotation device to rotate.
After the third high-definition image is obtained, the first camera is further used for capturing a third close-range image from the third high-definition image. For example, when the third high-definition image is an image in the area where the teacher is located, a third close-range image corresponding to the face of the teacher or a third close-range image corresponding to the hand of the teacher may be further extracted from the third high-definition image. The process of capturing the third close-range image from the third high-definition image may refer to the process of capturing the first close-range image from the first high-definition image, which is not described in detail in this embodiment. And then, the first camera can determine the image priority of the first high-definition image, the first close-range image, the third high-definition image and the third close-range image according to the first analysis result, and send the first high-definition image, the first close-range image, the third high-definition image, the third close-range image and the image priority to the recording host, so that the recording host determines the target image from the first high-definition image, the first close-range image, the third high-definition image and the third close-range image according to the received image priority.
In addition, it should be further explained that, when the recording and broadcasting system further includes a second camera, each second camera is also provided with a first camera module and a second camera module, the second camera analyzes the second high-definition image of the second area captured by the first camera module, and after the second analysis result is obtained, the second camera module can be controlled to capture a fourth high-definition image of a certain area in the second area according to the second analysis result, and a fourth near-field image is captured from the fourth high-definition image. And finally, sending the second high-definition image, the second close-range image, the fourth high-definition image and the fourth close-range image to a recording and broadcasting host, and sending the second analysis result to the first camera.
In the embodiment of the invention, the two camera modules are arranged in the first camera, the first camera module is used for shooting a first high-definition image with a larger range, the second camera module is used for shooting a third high-definition image with a smaller range, and the first camera can also extract a first close-range image and a third close-range image from the first high-definition image and the third high-definition image respectively. In the embodiment of the invention, the first camera can output four paths of images, wherein the high-definition images output by the physical camera module and the close-up images intercepted by the algorithm exist, and the shooting range of the second camera module is smaller than that of the first camera module, so that the definition of a close-up picture can be improved, and the use experience of a user is improved.
On the basis of the above-described embodiment, the angles of view of the first high-definition image, the first near-field image, the third high-definition image, and the third near-field image are sequentially reduced.
In one embodiment, the first high definition image, the first near view image, the third high definition image, and the third near view image output by the first camera sequentially decrease in angle of view. In the optical instrument, a lens of the optical instrument is taken as a vertex, and an included angle formed by two edges of the maximum range of the object image of the measured object passing through the lens is called a field angle, and the size of the field angle determines the field range of the optical instrument. That is, in the present embodiment, the ranges of the first high-definition image, the first close-range image, the third high-definition image, and the third close-range image are sequentially reduced, as shown in fig. 8, wherein P1, P2, P3, and P4 are the first high-definition image, the first close-range image, the third high-definition image, and the third close-range image, respectively.
The recording and broadcasting host is also used for determining a transition image with the size of the angle of view between the target image and the currently output image after determining the target image from the images sent by the camera and before outputting the target image, and outputting the transition image in sequence according to the size of the angle of view of the transition image, and outputting the target image when all the transition images are output.
In one embodiment, the recording and playing host is further configured to determine, after determining the target image in the images sent by the first camera, a transition image with a view angle between the target image and the currently output image from the images sent by the first camera before outputting the target image. For example, if the angle of view of the image currently output by the recording and playing host is the same as the angle of view of the first panoramic image, and the target image to be output is the third near-view image, then the recording and playing host can determine that the transition image includes the first near-view image and the third high-definition image. And then determining the output sequence of the transition images according to the size of the field angle of each transition image, and sequentially outputting the transition images according to the output sequence. And outputting the target image after outputting all the transition images. The instant recording and broadcasting host can determine the output sequence of the first close-range image and the third high-definition image according to the size of the field angle, and output the third close-range image after outputting the first close-range image and the third high-definition image according to the output sequence.
On the basis of the embodiment, when the field angle of the target image is smaller than the field angle of the image currently output by the recording and playing host, the recording and playing host is used for determining a first output sequence according to the field angle of the transition image from large to small, and the transition image is sequentially output according to the first output sequence; when the field angle of the target image is larger than the field angle of the image currently output by the recording and playing host, the recording and playing host is used for determining a second output sequence from small to large according to the field angle of the transition image, and the transition image is sequentially output according to the second output sequence.
In one embodiment, when determining the output sequence of the transition image, the recording and playing host first needs to determine the angle of view of the currently output image and the angle of view of the target image, and compare the sizes of the angles of view between the two images. When the field angle of the target image is smaller than the field angle of the image currently output by the recording and broadcasting host, the recording and broadcasting host determines a first output sequence according to the field angle of the transition image from large to small, and when the transition image is output, the recording and broadcasting host outputs the transition image according to the first output sequence. Therefore, in the process of switching pictures by the recording and playing host, the pictures can be gradually switched from a larger field angle to a smaller field angle, and as shown in fig. 9, the zooming process is simulated by displaying the progressive pictures. And when the field angle of the target image is larger than the field angle of the image currently output by the recording and broadcasting host, determining a second output sequence by the recording and broadcasting host according to the field angle of the transition image from small to large, and outputting the transition image according to the second output sequence when outputting the transition image. Therefore, in the process of switching the pictures by the recording and playing host, the pictures can be gradually switched from a smaller view angle to a larger view angle, as shown in fig. 10.
In one embodiment, when the recording and playing system includes the second camera, if the image currently output by the recording and playing host and the target image are both from the same second camera, the recording and playing host may determine the transition image in the image sent by the second camera, and output the transition image before outputting the target image, which is similar to the above-mentioned specific process, and will not be repeated in this embodiment. It can be understood that in this embodiment, when the recording host and the currently output image and the target image are both from the same camera (the first camera or the second camera), the recording host is only used to execute the function of determining the transition image and outputting the target image after outputting the transition image, and when the recording host and the currently output image and the target image are both from different cameras, the screen is directly switched.
Before outputting the target image, the recording and playing host further determines the transition image from the images sent by the camera, and outputs the target image after outputting the transition image according to the output sequence. According to the embodiment of the invention, the transition image is output when the recording and broadcasting host switches the picture, so that the recording and broadcasting host can display the progressive picture when switching the picture, the zooming process is simulated, and the viewing experience of a user is greatly improved.
On the basis of the above embodiment, when the recording and playing system is in the first use state, the first camera is specifically configured to identify the target object from the first high-definition image, and when it is determined that the target object meets the preset condition, the second camera module is controlled to capture a third high-definition image of the area where the target object is located, and cut out a third close-range image from the target area of the third high-definition image, and the image priority of the third close-range image is set to be the highest.
In one embodiment, when the recording and playing system is in the first use state, the first camera is used for analyzing the first high-definition image shot by the first camera module, identifying the target object from the first high-definition image, and determining whether the target object meets the preset condition. The first usage state refers to a state when the recording and playing system is in use of a certain function, for example, when the remote interactive function of the recording and playing system is in use, the recording and playing system can be considered to process the first usage state. The first camera may identify the target object in the first high-definition image using a target identification algorithm during analysis of the first high-definition image. For example, when recording a teacher's teaching video, the target object may be a teacher or a student. Then, the first camera further determines whether the target object in the first high-definition image meets a preset condition, and the preset condition may be preset by a user, for example, the preset condition may be set such that a distance between the target object and a certain object is smaller than a preset distance, or the preset condition is set such that a behavior of the target object is a specified behavior. When the target object meets the preset condition, the first camera controls the second camera to shoot a third high-definition image of the area where the target object is located, then a third close-range image is cut out of the target area of the third high-definition image, and the image priority of the third close-range image is set to be the highest. And the recording and broadcasting host outputs a third close-range image according to the image priority after receiving the image priority sent by the first camera.
On the basis of the embodiment, the recording and broadcasting system further comprises a display module, wherein the display module is connected with the recording and broadcasting host, and the first use state is when the recording and broadcasting system is in communication connection with the remote equipment and performs remote interaction; the display module is used for displaying pictures transmitted by the remote equipment during remote interaction, and the recording and broadcasting host is used for outputting target images to the remote equipment during remote interaction.
In one embodiment, the recording and playing system further includes a display module connected to the recording and playing host, where the display module refers to a device for displaying a certain electronic file on a screen through a specific transmission device, for example, the display module may be an interactive television shown in fig. 4. The first use state of the recording and broadcasting system is a state that the recording and broadcasting system is in communication connection with the remote equipment and performs remote interaction. The remote device refers to an electronic device which performs remote communication or remote control by using radio or electric signals through an Internet access network and the like, and for example, the remote device can be a notebook computer or a mobile phone and the like. The recording host of the recording system can be accessed into the network through the gateway, thereby establishing communication connection with the remote equipment and realizing the function of remote interaction. When the recording and broadcasting system is used for carrying out remote interaction on the remote equipment, the recording and broadcasting host can receive pictures transmitted by the remote equipment through the gateway, display the pictures transmitted by the remote equipment on the interactive television, and simultaneously, the recording and broadcasting host can output target images to the remote equipment.
The first camera is specifically configured to identify a target object from the first high-definition image, perform gesture analysis on the target object to obtain a gesture analysis result, and determine that the target object meets a preset condition when determining that a target area of the target object faces the display module according to the gesture analysis result.
And when the recording and broadcasting system is in a state of performing remote interaction with the remote equipment, the first camera is specifically used for identifying the target object from the first high-definition image, and performing gesture analysis on the target object to obtain a gesture analysis result of the target object. The gesture analysis refers to analysis of the motion gesture of the target object, for example, a skeleton algorithm may be used to perform gesture analysis on the target object. When the target area of the target object faces the display module according to the gesture analysis result, determining that the target object meets the preset condition. At this time, the first camera controls the second camera to shoot a third high-definition image of the area where the target object is located, then, the third close-range image is cut out from the target area of the third high-definition image, and the image priority of the third close-range image is set to be the highest. And the recording and broadcasting host machine outputs a third close-range image to the remote equipment according to the image priority after receiving the image priority sent by the first camera. The target object is a human body, and the target area is an area where eyes of the human body are located. When the recording and broadcasting system performs remote interaction with remote equipment, the first camera is used for shooting a podium area, and when the first camera recognizes a target object located on a podium from a first high-definition image shot by the first camera shooting module, namely after a teacher, the first camera performs gesture analysis on the teacher to judge whether the area where the eyes of the teacher are located is towards the display module, as shown in fig. 11. If yes, the first camera controls the second camera module to shoot a third high-definition image of the area where the teacher is located, a third close-range image of the area where the eyes of the teacher are located is intercepted from the third high-definition image, and the image priority of the third close-range image is set to be the highest. Therefore, the recording and broadcasting host can output the third close-up image comprising the eye feature of the teacher to the remote equipment, and the remote equipment can display the picture of the eye feature of the teacher, thereby creating the effect of looking at the eyes.
In the above embodiment of the present invention, when the recording and broadcasting system performs remote interaction with the remote device, the first camera is further configured to identify the target object from the first high-definition image, and when the target object meets a preset condition, control the second camera module to capture a third high-definition image of an area where the target object is located, and cut out a third close-range image from the target area of the third high-definition image, and set the image priority of the third close-range image to be the highest, so that the recording and broadcasting system can output the third close-range image to the remote device for display. According to the embodiment of the invention, when the recording and broadcasting system and the remote equipment are in remote interaction, the third close-up image comprising the teacher eye close-up picture can be sent to the remote equipment for display, so that the effect that both parties are looking at is created in the interaction process, and the use experience of a user is improved.
Note that the above is only a preferred embodiment of the present invention and the technical principle applied. It will be understood by those skilled in the art that the embodiments of the present invention are not limited to the particular embodiments described herein, but are capable of numerous obvious changes, rearrangements and substitutions without departing from the scope of the embodiments of the present invention. Therefore, while the embodiments of the present invention have been described in connection with the above embodiments, the embodiments of the present invention are not limited to the above embodiments, but may include many other equivalent embodiments without departing from the spirit of the embodiments of the present invention, and the scope of the embodiments of the present invention is determined by the scope of the appended claims.

Claims (10)

1. The recording and broadcasting system comprises a recording and broadcasting host, and is characterized by further comprising a camera connected with the recording and broadcasting host, wherein the camera comprises a first camera which comprises a first camera module, and the first camera module is used for shooting a first high-definition image in a first area;
The first camera is further used for analyzing the first high-definition image, intercepting a first close-range image from the first high-definition image according to a first analysis result, determining an image priority and sending the first high-definition image, the first close-range image and the image priority to the recording and broadcasting host;
and the recording and broadcasting host is used for determining a target image from the images sent by the camera according to the image priority and outputting the target image.
2. The recording and broadcasting system according to claim 1, wherein the first camera is specifically configured to analyze the first high-definition image, obtain status information of each object in the first high-definition image, determine a target object according to the status information of each object, intercept a first close-up image corresponding to the target object from the first high-definition image, and determine an image priority according to the status information of each object.
3. The recording and playback system of claim 2, wherein the status information includes behavior information and location information;
The first camera is specifically configured to determine a target object according to behavior information of each object, intercept a first close-range image corresponding to the target object from the first high-definition image according to position information of the target object, and determine an image priority according to the behavior information of each object.
4. The recording and playback system of claim 1, said camera further comprising at least one second camera, each of said second cameras for capturing second high definition images in a different second area, each of said second cameras being communicatively coupled to said first camera;
Each second camera is further used for analyzing the second high-definition image, intercepting a second close-range image from the second high-definition image according to a second analysis result, sending the second high-definition image and the second close-range image to the recording and broadcasting host, and sending the second analysis result to the first camera;
The first camera is specifically configured to determine an image priority according to the first analysis result and each second analysis result, and send the image priority to the recording and playing host.
5. The recording and playing system according to claim 1, wherein the first camera further comprises a second camera module;
The first camera is further used for controlling the second camera module to shoot a third high-definition image in a third area according to a first analysis result, intercepting a third close-range image from the third high-definition image according to the first analysis result, and sending the third high-definition image and the third close-range image to the recording and playing host; the third region is located within the first region and the extent of the third region is less than the extent of the first region.
6. The recording and playback system of claim 5, wherein the angles of view of the first high definition image, the first near view image, the third high definition image, and the third near view image decrease in sequence;
The recording and broadcasting host is also used for determining a transition image with the size of the angle of view between the target image and the currently output image after determining the target image from the images sent by the camera and before outputting the target image, and outputting the transition image in sequence according to the size of the angle of view of the transition image, and outputting the target image when all the transition images are output.
7. The recording and broadcasting system of claim 6, wherein when the angle of view of the target image is smaller than the angle of view of the image currently output by the recording and broadcasting host, the recording and broadcasting host is configured to determine a first output order according to the angle of view of the transition image from large to small, and the transition image is sequentially output according to the first output order; when the field angle of the target image is larger than the field angle of the image currently output by the recording and broadcasting host, the recording and broadcasting host is used for determining a second output sequence from small to large according to the field angle of the transition image, and the transition image is sequentially output according to the second output sequence.
8. The recording and broadcasting system according to claim 5, wherein when the recording and broadcasting system is in a first use state, the first camera is specifically configured to identify a target object from the first high-definition image, and when the target object is determined to satisfy a preset condition, the second camera module is controlled to capture a third high-definition image of an area where the target object is located, and cut a third close-range image from a target area of the third high-definition image, and an image priority of the third close-range image is set to be highest.
9. The recording and broadcasting system of claim 8, further comprising a display module, wherein the display module is connected to the recording and broadcasting host, and wherein the first use state is when the recording and broadcasting system is in communication connection with a remote device and performs remote interaction; the display module is used for displaying pictures transmitted by the remote equipment during remote interaction, and the recording and broadcasting host is used for outputting the target image to the remote equipment during remote interaction;
The first camera is specifically configured to identify a target object from the first high-definition image, perform gesture analysis on the target object to obtain a gesture analysis result, and determine that the target object meets a preset condition when it is determined that a target area of the target object faces the display module according to the gesture analysis result.
10. The recording and playback system of claim 9, wherein the target object is a human body and the target area is an area in which eyes of the human body are located.
CN202211453130.1A 2022-11-21 2022-11-21 Recording and broadcasting system Pending CN118057809A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211453130.1A CN118057809A (en) 2022-11-21 2022-11-21 Recording and broadcasting system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211453130.1A CN118057809A (en) 2022-11-21 2022-11-21 Recording and broadcasting system

Publications (1)

Publication Number Publication Date
CN118057809A true CN118057809A (en) 2024-05-21

Family

ID=91069002

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211453130.1A Pending CN118057809A (en) 2022-11-21 2022-11-21 Recording and broadcasting system

Country Status (1)

Country Link
CN (1) CN118057809A (en)

Similar Documents

Publication Publication Date Title
CN106791485B (en) Video switching method and device
CN205016086U (en) Intelligent tutoring recorded broadcast system
US8208002B2 (en) Distance learning via instructor immersion into remote classroom
KR100989142B1 (en) System and method for supplying e-learning contents
US11128793B2 (en) Speaker tracking in auditoriums
US20180192002A1 (en) Automated configuration of behavior of a telepresence system based on spatial detection of telepresence components
WO2017215295A1 (en) Camera parameter adjusting method, robotic camera, and system
US20100118112A1 (en) Group table top videoconferencing device
KR102308443B1 (en) Smart advanced lecture and recoding system
US20080180519A1 (en) Presentation control system
CN105376547A (en) Micro video course recording system and method based on 3D virtual synthesis technology
US11006072B2 (en) Window system based on video communication
US10623698B2 (en) Video communication device and method for video communication
CN110333837B (en) Conference system, communication method and device
WO2022262839A1 (en) Stereoscopic display method and apparatus for live performance, medium, and system
TW201734948A (en) A method, system and device for generating associated audio and visual signals in a wide angle image system
JP2007158680A (en) Tracking imaging apparatus and tracking imaging system utilizing it
JP2005033570A (en) Method and system for providing mobile body image
CN110933350A (en) Electronic cloud mirror recording and broadcasting system, method and device
US7986336B2 (en) Image capture apparatus with indicator
CN109788221A (en) A kind of recorded broadcast method and device
JP6232716B2 (en) Information processing apparatus, display control system, and program
CN118057809A (en) Recording and broadcasting system
WO2021226821A1 (en) Systems and methods for detection and display of whiteboard text and/or an active speaker
US10701313B2 (en) Video communication device and method for video communication

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination