WO2019242207A1 - Image display method, apparatus, system and device, and readable storage medium - Google Patents

Image display method, apparatus, system and device, and readable storage medium Download PDF

Info

Publication number
WO2019242207A1
WO2019242207A1 PCT/CN2018/114074 CN2018114074W WO2019242207A1 WO 2019242207 A1 WO2019242207 A1 WO 2019242207A1 CN 2018114074 W CN2018114074 W CN 2018114074W WO 2019242207 A1 WO2019242207 A1 WO 2019242207A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
close
camera
speaker
relative angle
Prior art date
Application number
PCT/CN2018/114074
Other languages
French (fr)
Chinese (zh)
Inventor
田楠
李伟
Original Assignee
广州视源电子科技股份有限公司
广州视臻信息科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州视源电子科技股份有限公司, 广州视臻信息科技有限公司 filed Critical 广州视源电子科技股份有限公司
Publication of WO2019242207A1 publication Critical patent/WO2019242207A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/90Arrangement of cameras or camera modules, e.g. multiple cameras in TV studios or sports stadiums
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/80Camera processing pipelines; Components thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/2624Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects for obtaining an image which is composed of whole input images, e.g. splitscreen

Definitions

  • the invention provides a close-up image determination method, a device system and a device, and a readable storage medium, which solves the problem that the time required for the manual guide to adjust the camera equipment in the related art is long and reduces the efficiency of close-up implementation.
  • an image display method including the following steps:
  • the first image and the second image are displayed.
  • the determining the position of the speaker includes:
  • the position includes a relative position parameter of the speaker and a local display screen; and from more than two close-up cameras, selecting a close-up camera corresponding to the position to capture an image of the speaker, include:
  • An image obtained by the selected close-up camera to capture the speaker is acquired.
  • selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker includes:
  • the relative position parameter includes a relative angle and / or a relative distance.
  • the relative angle is a relative angle in a horizontal direction.
  • the position includes the coordinates of the speaker in a predetermined coordinate system; selecting a close-up camera corresponding to the position to capture an image of the speaker from two or more close-up cameras includes:
  • the method further includes:
  • a close-up camera with the smallest relative distance from the speaker is selected as the target camera.
  • the method further includes:
  • a close-up camera having the smallest relative angle with the speaker in the horizontal direction is selected as the target camera.
  • the method further includes the following steps:
  • the close-up image and the first image are displayed.
  • the cropped image data is determined as the close-up image.
  • mapping an image position of the speaker in the second image according to the position includes:
  • the position is a relative angle between the speaker and a local display screen
  • the processor is coupled to the memory, and is configured to read program instructions stored in the memory and, in response, perform operations in the method described above.
  • the relative angle between each close-up camera and the display screen is different.
  • the panoramic camera is installed at a frame on the upper side of the image display device, a first close-up camera is installed at a frame on the left side of the image display device, and a second close-up camera is installed at the image display. At the border on the right side of the device.
  • the relative angle between the first close-up camera and the display screen is between 10 degrees and 50 degrees;
  • the relative angle between the second close-up camera and the display screen is between 130 degrees and 170 degrees.
  • the image display device further includes an acoustic positioning device for positioning the speaker.
  • the image display device is a conference interaction device.
  • the conference interactive device is a smart interactive tablet.
  • an image display device including:
  • Memory storing processor-executable instructions
  • the processor is coupled to the memory, and is configured to read program instructions stored in the memory and, in response, perform operations in the method described above.
  • the image display device of the embodiment of the present invention is associated with a panoramic camera and at least two close-up cameras.
  • the relative angle between each close-up camera and the display screen is different.
  • the relative angle between a close-up camera and the display screen is between 10 degrees and 50 degrees; the relative angle between the other close-up camera and the display screen is between 130 degrees and 170 degrees.
  • the image display device further includes an acoustic positioning device for positioning the speaker.
  • the acoustic positioning device includes a microphone array.
  • the image display device is a conference interaction device.
  • the conference interactive device is a smart interactive tablet.
  • one or more machine-readable storage media are provided with instructions stored thereon, and when executed by one or more processors, perform operations in the method described above.
  • an image display device including:
  • a first image acquisition module configured to acquire an image captured by a panoramic camera as a first image
  • a speech position determining module configured to determine the position of a speaker
  • a second image acquisition module configured to select, from two or more close-up cameras, a close-up camera corresponding to the position to capture an image of the speaker as a second image;
  • An image display module is configured to display the first image and the second image.
  • the position includes a relative position parameter of the speaker and a local display screen;
  • the second image acquisition module includes:
  • Position parameter acquisition module used to retrieve the relative position parameters of each close-up camera and the local display
  • a relative position calculation module configured to calculate the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters
  • a camera selection module for selecting a close-up camera with the smallest relative position parameter of the speaker from the two or more close-up cameras according to the calculated relative position parameters
  • the first acquisition submodule is configured to acquire an image obtained by the selected close-up camera and capturing the speaker.
  • the second image acquisition module includes:
  • a predetermined relationship acquisition module configured to obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is specifically used to close-up the speaker at the scene position;
  • a target camera selection module configured to select a close-up camera corresponding to the position as a target camera from the two or more close-up cameras according to the predetermined correspondence relationship
  • a second acquisition submodule is configured to acquire an image obtained by the target camera shooting the speaker.
  • the position is a relative angle between the speaker and the local display screen, and the relative position parameter is a relative angle.
  • the image display module includes:
  • An image position mapping module configured to map an image position of the speaker in the second image according to the position
  • a close-up image extraction module configured to extract image data at a mapped image position in the second image to obtain a close-up image of the speaker
  • An image display submodule configured to display the close-up image and the first image.
  • the apparatus further includes a target extraction module, configured to:
  • the cropped image data is determined as the close-up image.
  • the image position mapping module is configured to:
  • the image position mapping module is configured to:
  • the coordinates of the obtained image area are used as the mapped image position.
  • the position is a relative angle between the speaker and a local display screen
  • the position information is a scene area corresponding to each image area in a real scene, and a relative angle with the local display screen.
  • the module for pre-generating the location information is configured to:
  • the calibrated relative angle and the relative position parameters of the close-up camera and the local display calculate the relative angle between the scene area of the close-up camera and the local display.
  • a smart interactive tablet including a panoramic camera, a first close-up camera, and a second close-up camera.
  • the panoramic camera, the first close-up camera, and the second close-up camera are provided.
  • the optical axes of the first close-up camera and the second close-up camera are inclined to the display plane of the smart interactive tablet.
  • the smart interactive tablet is further configured to:
  • the first image and the second image are displayed.
  • the relative angle between the optical axis of the first close-up camera and the display screen is between 10 degrees and 50 degrees;
  • the relative angle between the optical axis of the second close-up camera and the display screen is between 130 degrees and 170 degrees.
  • a close-up camera corresponding to the position of the speaker can be quickly selected from a plurality of close-up cameras, the speaker is shot, and then the image taken by the panoramic camera and the selected close-up camera are taken. Images, without the need for manual guides to pan, tilt, push and pull a series of additional operations on the close-up camera, you can achieve close-up of the speaker, and compared to related technologies that require manual guides to perform some additional operations on the camera, can Effectively improve the efficiency of close-ups.
  • FIG. 1A is a schematic diagram of an image display system according to an exemplary embodiment of the present invention.
  • FIG. 1B is a schematic diagram of an image display system according to another exemplary embodiment of the present invention.
  • FIG. 1C is a schematic diagram of an image display system according to another exemplary embodiment of the present invention.
  • Fig. 2A is a processing logic diagram of an image display system according to another exemplary embodiment of the present invention.
  • FIG. 2B is an interaction schematic diagram of an image display system according to another exemplary embodiment of the present invention.
  • 2C is a display screen of a display screen according to an exemplary embodiment of the present invention.
  • FIG. 3 is a schematic diagram of an image display method according to an exemplary embodiment of the present invention.
  • FIG. 4A is a schematic diagram of an image display method according to another exemplary embodiment of the present invention.
  • FIG. 4B is a schematic diagram of dividing a panoramic image according to an exemplary embodiment of the present invention.
  • FIG. 5 is a schematic diagram of an image display method according to another exemplary embodiment of the present invention.
  • Fig. 6 is a block diagram of an image display device according to an exemplary embodiment of the present invention.
  • Fig. 7 is a hardware structural diagram of an image display device according to an exemplary embodiment of the present invention.
  • first, second, third, etc. may be used in the present invention to describe various kinds of information, these information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other.
  • first information may also be referred to as the second information, and similarly, the second information may also be referred to as the first information.
  • word "if” as used herein can be interpreted as “at” or "when” or "in response to determination”.
  • the embodiments of the present invention provide an image display method and an image display system for implementing the image display method.
  • the provided image display system may include an image display device, a panoramic camera, and at least two close-up cameras.
  • the image display device may include a memory, a processor, and a display screen for displaying an image.
  • the memory stores program instructions executable by the processor.
  • the processor is coupled to the memory and is used to read the program instructions stored in the memory and execute the program instructions in response. An operation for implementing an image display method according to an embodiment of the present invention.
  • the display screen mentioned here can be an independent display screen, such as an LED display screen; it can also be a screen of an image display device with interactive capabilities, such as a touch screen display, a smart interactive tablet, or other interactive display screens. Capable computer equipment.
  • the panoramic camera is used to shoot the realistic scene of the target location. Compared with the close-up camera, the angle of view range is wider and the shooting range is wider.
  • the panoramic camera may be a wide-angle camera.
  • Close-up camera for close-up of speakers in the target location The designer of the present invention can preset different close-up cameras to shoot different predetermined scene areas of the target place according to the close-up requirements of the actual application scene, and / or preset different close-up cameras to the same reservation according to different relative position parameters.
  • the scene area is used for shooting.
  • the relative position parameter mentioned herein may include a relative angle and / or a relative distance between the close-up camera and the subject in the predetermined scene area. In the actual close-up process, it is usually not necessary to manually guide the close-up camera to adjust with the changes of the speaker.
  • the target scene and the predetermined scene area of the close-up camera may be different.
  • the application scenario can be a conference scenario, a smart education scenario, a live broadcast scenario, or other scenarios that require close-ups of the speakers; correspondingly, the target location can be a conference room, a lecture room, a live broadcast room, or other needs to perform a speaker Close-up scene place.
  • the predetermined scene area may be an area in which speakers may appear in the meeting, an area in which speakers may appear in the course, an area in which anchors may appear, or an area in which speakers may appear in other scenarios.
  • the predetermined scene area of the close-up camera and / or the relative position parameters of the close-up camera and the predetermined scene area may also be different.
  • the conference presenter is usually at the end of the conference table near the conference interactive device, and the scheduled scene area may be the conference table area near the conference interactive device; for example, for any participant who may speak Close-up, scheduled scene area can be the area where participants are located.
  • the relative angle between the close-up camera and the scheduled scene area is a positive angle; for oblique side close-up of the speaker, the close-up camera and the scheduled scene area
  • the relative angle is the oblique angle.
  • the front angle mentioned here refers to the vertical angle with the front of the subject.
  • the oblique angle refers to the deviation from the front angle, or left or right around the subject in the predetermined scene area to move to The angle experienced by the side angle.
  • the side angle refers to the angle perpendicular to the side of the subject.
  • the panoramic camera and each close-up camera can be used as an accessory device of the image display device, installed on the image display device, or can exist independently of the image display device and installed on the image display. In the space outside the device.
  • the image display system according to the embodiment of the present invention is described in detail below with an application scenario as a conference scenario and a target location as a conference room as an example.
  • FIG. 1A is a schematic diagram of an image display system according to an exemplary embodiment of the present invention.
  • the image display device 110 may be a conference display device with a display function, or a conference interactive device with interactive capabilities, such as a smart interactive tablet.
  • the panoramic camera 112, the close-up camera 113, and the close-up camera 114 are disposed on the frame of the smart interactive tablet, and the optical axes of the close-up camera 113 and the close-up camera 114 are inclined to the display plane of the smart interactive tablet.
  • the embodiment of the present invention may use the relative angles of the close-up cameras and the display 111 to identify the close-up cameras, and the relative angles of the close-up cameras for different scene areas and the display 111. different.
  • the relative angle here can refer to the angle between the optical axis of the close-up camera and the display plane of the display 111, and the specific value can be determined by the scene area and the installation position of the close-up camera.
  • the angle ⁇ 2 between the optical axis of the close-up camera 113 and the screen centerline of the display 111 distributed along the y direction that is, the relative angle between the close-up camera 113 and the display 111
  • the angle ⁇ 3 between the axis and the center line of the screen distributed along the y-direction of the display screen 111 is the relative angle between the close-up camera 114 and the display screen 111.
  • the relative angle between each close-up camera and the display screen 111 may also be calculated by referring to other reference objects, and details are not described herein again.
  • At least one close-up camera 113 is installed at a frame on the left side of the image display device 110, and at least one close-up camera 114 is installed at a frame on the right side of the image display device 110.
  • the embodiment of the present invention uses the three-dimensional xyz coordinate system shown in FIG. 1A as a reference, and installs close-up cameras 113 and 114, in which the display screen 111 is located on the yz plane and the display screen 111
  • the center point of is the origin of the coordinates and can also be used as the point of sight of the speaker.
  • the relative angle of the close-up camera 113 installed on the left border of the image display device 110 and the display 111 is between 10 degrees and 50 degrees (a value of ⁇ 1); installed on the image display device
  • the close-up camera at the right border of 110 has a relative angle with the display screen between 130 degrees and 170 degrees (an example value of ⁇ 2).
  • the relative angle between the close-up camera 113 and the display 111 is 15 degrees, and the relative angle between the close-up camera 114 and the display 111 is 165 degrees.
  • the speaker involved in the embodiment of the present invention may refer to an object that emits audio, such as a participant who speaks in the conference scene of this embodiment.
  • the close-up camera mentioned here may be the camera that shoots the area of the scene where the speaker is located, or the camera that shoots the speaker in the scene area in front.
  • the image captured by the panoramic camera 112 is referred to as a first image
  • an image captured by the selected close-up camera 113 or 114 is used Called the second image. After the first image and the second image are acquired, the first image and the second image are displayed through the display screen 111.
  • the audio emitted by the speaker in the embodiment of the present invention is different from the ambient audio.
  • the embodiment of the present invention may determine the speaker's The position determines the position of the speaker according to the positioning result of the acoustic positioning device.
  • the specific way of determining the position of the speaker through the acoustic positioning device 115 can be determined by the positioning principle of the acoustic positioning device 115.
  • the acoustic positioning device 115 may include a vertically arranged microphone and a horizontally arranged microphone. The time difference between the audio signals collected by the two microphones is combined with the spatial position of the microphone to locate the relative position parameters of the speaker and the acoustic positioning device 115.
  • the relative position parameters mentioned here include relative angle and / or relative distance.
  • the processor of the acoustic positioning device 115 or the image display device 110 may calculate the relative position parameters of the speaker and the acoustic positioning device 115 and the three-dimensional coordinates of the acoustic positioning device 115.
  • the processor of the acoustic positioning device 115 or the image display device 110 may pre-store the relative position parameters of the display screen 111 and the acoustic positioning device 115, and then based on the relative position parameters of the speaker and the acoustic positioning device 115 and the pre-stored relative position Parameters, the relative position parameters of the speaker and the display 111 are calculated.
  • the relative position parameters mentioned here include relative angle and relative distance.
  • the length of the vector that the speaker points to the center of the display 111 is the relative distance between the speaker and the display 111, and the vector is related to the screen distributed in the y direction.
  • the included angle of the center line that is, the relative angle between the speaker and the display screen 111, is equal to ⁇ 1 and is opposite to each other.
  • the panoramic camera 112 and at least two close-up cameras 113 and 114 are installed on the image display device 110.
  • the panoramic camera 112 and at least two close-up cameras 113 may also exist independently of the image display device 110. For details, refer to FIG. 1B.
  • FIG. 1B is a schematic diagram of an image display system according to another exemplary embodiment of the present invention.
  • the image display system shown in FIG. 1B is disposed in the conference room 100 and may include an image display device 110, a memory (not shown), a processor (not shown), and a display screen 111 provided on the image display device 110.
  • the panoramic camera 131 and at least two close-up cameras 132 and 133 are provided in a three-dimensional space outside the image display device 110.
  • the relative distance between the panoramic camera 131 and at least two close-up cameras 132 and 133 and the image display device 110 in the direction of at least one coordinate axis may be increased.
  • the cameras of the close-up cameras are parallel to the center line of the display screen 111 in the y direction.
  • the relative angle between the close-up camera and the display screen 11 is the relative angle in the horizontal direction (in the xy plane), and ⁇ 1, ⁇ 2, ⁇ 3 shown in FIG. 1A and FIG. 1B are equal to ⁇ 1p, ⁇ 2p, ⁇ 3p shown in FIG. 1C.
  • FIG. 1C is the projection of the display system shown in FIG. 1A on the horizontal plane.
  • the technical content involved Referring to the embodiment related to FIG. 1A, details are not described herein again. The following describes the processing logic of the image display system according to the embodiment of the present invention with reference to FIG. 2A and FIG. 2B to implement a close-up image display process.
  • the image display system shown in FIG. 2A may include an image display device 210, a panoramic camera 221, at least two close-up cameras 222, 223, and the like.
  • the image display device 210 may include a processor 211, a display screen 212, a memory 213, a non-volatile memory 214, and a device interface 215 connected through an internal bus.
  • the panoramic camera 221 and at least two close-up cameras 222 and 223 are connected to the processor 211 through the device interface 215.
  • the specific form of the device interface 215 may match the interfaces of the panoramic camera 221 and at least two close-up cameras 222 and 223, such as a USB interface.
  • the image display system may further include a positioning device, such as an acoustic positioning device, for determining the position of the speaker.
  • a positioning device such as an acoustic positioning device, for determining the position of the speaker.
  • the acoustic positioning device may be associated with the image display device 210 and exist independently of the image associated device; it may also be a microphone array installed on the image display device 210 and connected to the processor 211 through an internal bus.
  • the designer of the present invention can store the program instructions (program instructions corresponding to the processing logic 213 a) that realize the close-up in the non-volatile memory 214.
  • the processor 211 reads the program instructions into the memory 213 for operation, and in response performs the operations shown in the processing logic 213a: acquiring the image captured by the panoramic camera 221 as the first image; determining the speaker's Position; from two or more close-up cameras 222, 223, etc., select a close-up camera corresponding to the position to capture the image of the speaker as the second image; display the first image and the display on the display screen 212 Second image.
  • the implementation of the processing logic 213a may be implemented by interactions between devices in the image display system. For a specific interaction process, refer to FIG. 2B.
  • the panoramic camera 221 and two or more close-up cameras 222, 223, etc. execute step S201 to capture respective shooting areas according to a set frequency, and pass the captured images through the device interface 215.
  • Sending to the processor 211 (step S202), in order to facilitate the processor 211 to distinguish different images, when transmitting the images, the panoramic camera 221 and two or more close-up cameras 222, 223, etc. may be identified by their respective identities, or The relative position to the display screen is sent to the processor 211 along with the image.
  • the process 211 may select the panoramic image captured by the panoramic camera 221 as the first image (step S203). If the speaker in the target scene speaks before the first image is sent to the display screen, the positioning device 230 may determine the speaker ’s Position (this step S204), and send it to the processor 211 (step S205), and the processor 211 then selects the close-up corresponding to the position from the images sent by the two or more close-up cameras 222, 223, etc. according to the position
  • the image sent by the camera is a second image (step S206), and the first image and the second image are sent to the display screen 212 (step S207), and the display screen 212 displays the first image and the second image (S208).
  • the first image and the second image displayed on the display screen 212 are shown in FIG. 2C.
  • the second image may be superimposed on the first image, or may be suspended on the first image, and a specific display manner may be set by a relevant person according to actual actual needs.
  • the processor 21 sends a first image value display screen 212, and the display screen 212 displays the first image.
  • the smart interactive tablet may be an integrated projector, electronic whiteboard, curtain, audio, television, and video conference terminal. Or an integrated device with multiple functions.
  • the smart interactive tablet may also establish a data connection with at least one external device.
  • the external devices include, but are not limited to, smart phones, USB flash drives, laptop computers, desktop computers, tablet computers, personal digital assistants (PDAs), and the like.
  • the communication methods of the data connection between the external device and the smart interactive tablet include, but are not limited to, communication methods such as USB connection, Internet, local area network, Bluetooth, Wi-Fi, or ZigBee, which are not limited in the embodiments of the present invention.
  • the projection data is sent to the interactive intelligent tablet, so that the intelligent interactive tablet displays the content of the projection data of the projection data.
  • the external device serves as the projection client, and generally In particular, there may be one or more projection screen clients, which are set according to specific application scenarios, which are not limited in the embodiment of the present invention.
  • FIG. 3 is a flowchart of an image display method according to an exemplary embodiment of the present invention. This embodiment can be applied to an image display system for close-up of a speaker, and includes the following steps S301-S304:
  • Step S301 Acquire an image captured by a panoramic camera as a first image.
  • Step S302 Determine the position of the speaker.
  • Step S303 Select a close-up camera corresponding to the position from the two or more close-up cameras to capture an image of the speaker as a second image.
  • Step S304 Display the first image and the second image.
  • the method of the embodiment of the present invention may be implemented by an image display system.
  • the method of the embodiment of the present invention may be applied to and executed by a single image display device.
  • the image display system includes multiple image display devices, as shown in FIG. 1B, the three sides of the conference room against the wall are provided with image display devices, and each image display device is provided with a panoramic camera associated with it and at least two Close-up camera.
  • the embodiment of the present invention may be executed by each image display device or a general control device of each image display device.
  • the positioning device determines which image display device the speaker is facing, and then uses the image display device as a reference to determine the speaker position. Then, from the close-up cameras associated with each image display device, a corresponding close-up camera is selected.
  • the specific manner of selecting the image captured by the close-up camera in the embodiment of the present invention can be determined by the specific shape, formula or preparatory work of the determined position.
  • the speaker speaks to the image display device, and the acoustic positioning device is used to locate the speaker.
  • the acoustic positioning device may include a vertically arranged microphone and a horizontally arranged microphone.
  • the time difference between the audio signals collected by the two microphones is combined with the spatial position of the microphone to locate the relative position parameters of the speaker and the acoustic positioning device.
  • the position parameters are as described above, and are not repeated here.
  • the position determined in the embodiment of the present invention is a relative position parameter between the speaker and the acoustic positioning device.
  • the relative position parameters of each close-up camera and the acoustic positioning device can be obtained according to the position of each close-up camera and the position of the acoustic positioning device, and then the position determined by the speaker, and the close-up cameras and
  • the relative position parameters of the acoustic positioning device calculate the relative position of each close-up camera and the speaker, and then select the close-up camera with reference to the calculated relative position parameters. For frontal close-ups, choose the smallest relative angle. For clearer and larger close-ups, choose a smaller distance.
  • the relative distance in the relative position parameter may refer to a straight line distance between the position of the speaker and the position of the close-up camera, as shown in FIG. 1A, a dashed line between the speaker and the close-up camera 114, or a projection distance in a certain direction;
  • the relative angle in the relative position parameter is the angle between the vector of the speaker pointing to the close-up camera and the front direction of the speaker, or the projection of the angle in a certain direction or a certain plane.
  • the relative angle is the angle between the vector from the speaker to the close-up camera and the vector from the speaker to the center of the display, as shown in ⁇ 4 in FIG. 1A.
  • the plane mentioned here may be a horizontal plane.
  • the acoustic positioning device refers to the predetermined coordinate system in the figure, pre-stores its own position coordinates, and locates the relative position of the speaker and the acoustic positioning device, and then can locate the speaker's position coordinates in the predetermined coordinate system.
  • the embodiment of the present invention can directly calculate each close-up camera according to the position coordinates of each camera in a predetermined coordinate system and the position coordinates of the speaker. The relative position parameter with the speaker, and then select the close-up camera with reference to the calculated relative position parameter.
  • the embodiment of the present invention may select the same from two or more close-up cameras through the following operations.
  • a close-up camera corresponding to the position captures an image of the speaker:
  • the local display screen is relative to the remote display screen.
  • the angle between the center of the local display screen and each camera with respect to the speaker is calculated, which is the relative angle between the close-up camera and the speaker.
  • a close-up camera with the smallest relative angle to the speaker is selected as the target camera.
  • the predetermined coordinate system may be a three-dimensional coordinate system shown in FIG. 1A, and details are not described herein again.
  • the close-up camera 114 is selected as the target camera.
  • a close-up camera with the smallest relative angle to the speaker may appear.
  • the embodiment of the present invention may use The images taken by the selected close-up camera are displayed as the second image.
  • a close-up camera can be further selected according to the relative distance, which can be implemented by the following operations:
  • the relative distance between the close-up camera and the speaker is calculated according to the predetermined coordinates of each selected close-up camera and the position.
  • a close-up camera with the smallest relative distance from the speaker is selected as the target camera.
  • the relative angle can be projected on a two-dimensional horizontal plane to obtain the relative angle in the horizontal direction, and then a close-up camera can be selected. For the projection, see FIG. 1A and FIG.
  • a close-up camera having the smallest relative angle with the speaker in the horizontal direction is selected as the target camera.
  • the coordinate origin of the predetermined three-dimensional coordinate system is not the center of the display screen 111 as shown in FIG. 1A, and the point of sight of the speaker is the center of the display screen 111.
  • predetermined coordinates (coordinates in a three-dimensional coordinate system) of the center of the display screen 111 can be obtained in advance, and then each close-up is calculated based on the predetermined coordinates of the center of the display screen 111, the position coordinates of the speaker, and the position coordinates of the close-up cameras. The relative position of the camera and the speaker, and then select the close-up camera with reference to the calculated relative position.
  • Case 3 Considering that in the embodiment of the present invention, in the process of implementing close-ups, manual guides are not required to adjust the installation positions and angles of the close-up cameras, etc.
  • the close-up cameras can be determined in advance.
  • the relative position parameter with the display screen of the image display device (as shown in FIG. 1A, the angle between the close-up camera and the center line of the display screen in the y direction), and then after determining the position of the speaker, call the predetermined relative position parameter , Calculate the relative position of each close-up camera and the speaker, and then select the close-up camera with reference to the calculated relative position.
  • the position includes the relative position parameters of the speaker and the local display screen.
  • a close-up camera corresponding to the position can be selected from two or more close-up cameras to perform shooting by the following operations: Image of the speaker:
  • the relative position parameters of the speaker and each close-up camera are calculated.
  • a close-up camera with the smallest relative position parameter of the speaker is selected.
  • An image obtained by the selected close-up camera to capture the speaker is acquired.
  • the relative angle between each camera and the local display, and the relative angle between the speaker and the local display in different scene areas it is possible to estimate the speech of each close-up camera in different scene areas.
  • the close-up effect of the speaker, and then different cameras correspond to different relative angle ranges of the speaker and the local display. After determining the speaker position or the relative angle of the speaker and the local display in the later stage, select the close-up according to the corresponding relationship. camera.
  • the relative angle between the close-up camera 114 and the local display 111 is 165 degrees
  • the relative angle between the close-up camera 113 and the local display 111 is 15 degrees
  • the relative angle between the speaker and the local display 111 When it is 0 to 90 degrees, it corresponds to the close-up camera 114, and when the relative angle between the speaker and the local display 111 is 90 degrees to 180 degrees, it corresponds to the close-up camera 113.
  • Case 4 Considering that different close-up cameras can be dedicated to close-up of speakers in different predetermined scene areas, in order to further improve the close-up efficiency, the corresponding relationship between each scene position in the real scene and each close-up camera can be determined in advance. Among them, a close-up camera corresponding to each scene position is dedicated to close-up with a speaker at the scene position. For example, the relative position parameter of the close-up camera corresponding to the scene position is smaller than that of other close-up cameras at the scene position.
  • a close-up camera corresponding to the position is selected according to a predetermined correspondence relationship.
  • a close-up camera corresponding to the position may be selected to take an image of the speaker by performing the following operations:
  • a close-up camera corresponding to the position is selected as a target camera.
  • the first image and the second image are directly displayed, and the close-up of the speaker can be realized while the panoramic image is displayed, and the close-up camera is not required to be shaken manually.
  • a series of additional operations such as tilting, pitching, pushing and pulling, can quickly achieve close-up of the speaker, and can effectively improve the efficiency of close-ups compared to related technologies that require manual guides to perform some additional operations on the camera.
  • the close-up camera 114 may be dedicated to close-ups: speakers whose relative angle with the local display 111 is in the range of 0 to 90 degrees; the close-up camera 113 may be dedicated to close-ups: relative to the local display 111 Speakers with an angle ranging from 90 degrees to 180 degrees.
  • the first image and the second image are not directly displayed, but only the speaker or its target part is close-up.
  • the image of the speaker or its target part needs to be extracted from the second image, and then the first image and the extracted image are displayed.
  • the method shown in FIG. 4A may include steps S401-S406:
  • Step S401 Acquire an image captured by a panoramic camera as a first image.
  • Step S402 Determine the position of the speaker.
  • Step S403 From the two or more close-up cameras, select a close-up camera corresponding to the position to capture an image of the speaker as the second image.
  • Step S404 Map the image position of the speaker in the second image according to the position.
  • Step S405 Extract image data at a mapped image position in the second image to obtain a close-up image of the speaker.
  • Step S406 Display the close-up image and the first image.
  • Steps S401, S402, and S403 in this embodiment correspond to the foregoing embodiments, and details are not described herein again.
  • step S404 the image position of the speaker in the second image is mapped in order to roughly determine the location of the image data to be extracted, and then perform the extraction to reduce the time required to match the image features during direct extraction and improve the speaker extraction. Or the efficiency of the image data of the speaker's target part.
  • the target part mentioned here can be the face or upper body.
  • the image position of the speaker in the second image can be mapped in real time according to the four coordinate systems involved in the imaging process of the camera.
  • the four coordinate systems mentioned here are World coordinate system, camera coordinate system, image coordinate system, and pixel coordinate system.
  • the position can be mapped to the image position in the second image according to the mapping relationship between the world coordinate system to the camera coordinate system and the camera coordinate system to the image coordinate system. .
  • the positions can be mapped according to the mapping relationship between the world coordinate system to the camera coordinate system, the camera coordinate system to the image coordinate system, and the image coordinate system to the pixel coordinate system. To the image position in the second image.
  • the correspondence between each image region of the panoramic image and each scene region in the real scene can be determined in advance.
  • the embodiment of the present invention can obtain the correspondence between each area in the conference room and each image in the panoramic image, as shown in FIG. 4B.
  • the included angle between the subject in different scene areas in the conference room 100 and the center line of the screen distributed in the y-direction of the display 111 corresponds to different image areas in the panoramic image, where the panoramic image is divided into 6 images Area, from left to right, corresponding to the scene area is 0 degrees to 30 degrees, 30 degrees to 60 degrees, 60 degrees to 90 degrees, 90 degrees to 120 degrees, 120 degrees to 150 degrees, 150 degrees to 180 degrees.
  • the angle ⁇ 1 between the speaker and the center line of the screen distributed in the y-direction of the display screen 111 is 65 degrees
  • the mapped image area is an image area corresponding to 60 degrees to 90 degrees (the shaded area in FIG. 4B).
  • the area in the second image can be obtained through feature matching, and then the image position in the second image can be obtained. Specifically, the following operations can be performed. Map the image position of the speaker in the second image:
  • an image region to which the position belongs in the first image is located.
  • the features of the speakers in the image area to which the first image belongs are matched with the features of the image in the second image.
  • the designer of the present invention may also pre-generate the position information of each image area of the image captured by each close-up camera in a real scene, and the position information may be a scene corresponding to each image area in the real scene.
  • the determined position of the speaker is the relative angle between the speaker and the local display screen.
  • the relative angle mentioned here is the included angle between the speaker or the subject in the scene area and the center line of the screen distributed along the y direction in the display screen 111 shown in FIG. 1A.
  • the image position of the speaker in the second image can be obtained.
  • the following operations can be used to map the image position of the speaker in the second image:
  • the coordinates of the obtained image area are used as the mapped image position.
  • the position is the relative angle between the speaker and the local display;
  • the position information is the scene area corresponding to each image area in the real scene.
  • the step of the location information may include:
  • the relative angle between the scene area of the close-up camera and the close-up camera is calibrated.
  • the calibrated relative angle and the relative position parameters of the close-up camera and the local display calculate the relative angle between the scene area of the close-up camera and the local display.
  • the horizontal camera angle parameter (such as 160 degrees) of the close-up camera is used to calibrate the image taken by the close-up camera.
  • the leftmost, middle, or rightmost image area in a photo is 10 degrees and 90 degrees relative to the camera. 160 degrees.
  • the relative angle of a position in the image and the screen centerline distributed in the y direction of the display screen can be calculated .
  • the speaker After mapping the image position, if a close-up of the speaker's target part is required, after extracting the image data at the mapped position in the second image, the speaker can be identified and cropped from the extracted image data Image data of the target part; determining the cropped image data as the close-up image.
  • a close-up camera corresponding to the position of the speaker can be selected from a plurality of close-up cameras, the speaker is shot, and then the image captured by the panoramic camera and the extracted image data are displayed, and at the same time the panoramic image is displayed, Realize close-up of the speaker or the target part of the speaker, without a series of additional operations such as panning, tilting, pushing and pulling of the close-up camera without manual guidance, it can achieve close-up of the speaker, compared with the need to manually guide the camera. Some related techniques for additional operations can effectively improve the efficiency of close-ups.
  • the speaker may move his position with time.
  • non-positioned objects may alternately speak as speakers.
  • to close the speaker for accuracy After the close-up is implemented, the position of the speaker needs to be re-determined, and then the close-up camera is selected again to achieve close-up of the speaker at the new position.
  • the method shown in FIG. 5 may include steps.
  • Step S501 Acquire an image captured by a panoramic camera as a first image.
  • Step S502 Determine the position of the speaker.
  • Step S503 Select a close-up camera corresponding to the position from the two or more close-up cameras to capture the image of the speaker as the second image.
  • Step S504 Display the first image and the second image.
  • Step S505 Re-determine the position of the speaker.
  • Step S506 Determine whether the position change amount of the speaker is less than a predetermined change amount according to the re-determined position and the last determined position; if it is less, perform step S504. If it is not smaller, step S507 is performed.
  • Step S507 From the two or more close-up cameras, select an image obtained by the close-up camera corresponding to the re-determined position to capture the speaker, which is a third image;
  • Step S508 Display the first image and the third image.
  • the designer of the present invention may predetermine conditions for re-determining the position of the speaker, such as periodicity, timing, and user triggering.
  • the embodiment of the present invention may predetermine a change amount, which is determined by the actual application scenario. And the target place of the application scenario is determined. For example, in a conference scenario, the interval between two participants can be set.
  • the selected close-up camera is the same regardless of whether the speaker is changed or the position of the speaker changes. Therefore, in this case, it is not necessary to frequently perform the operation of selecting the close-up camera, and the previously taken close-up camera is directly taken.
  • the image may be a second image, or image data may be extracted from the second image.
  • a close-up camera corresponding to the re-determined position from two or more close-up cameras before selecting a close-up camera corresponding to the re-determined position from two or more close-up cameras to capture an image obtained by the speaker, it may be determined whether the re-determined position and the last determined position are within a predetermined range. Within the range of the position; if yes, perform the operation of displaying the first image and the second image; if not, perform the selection of a close-up camera pair corresponding to the re-determined position from more than two close-up cameras The speaker performs an operation of shooting to obtain a third image.
  • the position is the relative angle between the speaker and the center line of the screen distributed along the y-direction of the display screen 111. It can be determined whether the newly determined position and the last determined position are both 75 degrees to 105 degrees In between, if the operation of displaying the first image and the second image is performed.
  • the present invention also provides an embodiment of the device.
  • FIG. 6 is a block diagram of an image display device according to an exemplary embodiment of the present invention.
  • the device may be applied to the image display system in the foregoing embodiment, and may include a first image acquisition module 610 and a speaking position determination module. 620.
  • the first image acquisition module 610 is configured to acquire an image captured by a panoramic camera as a first image.
  • the speaking position determining module 620 is configured to determine the position of the speaker.
  • the second image acquisition module 630 is configured to select, from the two or more close-up cameras, a close-up camera corresponding to the position to capture an image of the speaker as a second image.
  • the image display module 640 is configured to display the first image and the second image.
  • the position includes a relative position parameter of the speaker and a local display screen;
  • the second image acquisition module 630 may include:
  • Position parameter acquisition module used to retrieve the relative position parameters of each close-up camera and the local display
  • a relative position calculation module configured to calculate the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters
  • a camera selection module for selecting a close-up camera with the smallest relative position parameter of the speaker from the two or more close-up cameras according to the calculated relative position parameters
  • the first acquisition submodule is configured to acquire an image obtained by the selected close-up camera and capturing the speaker.
  • the second image acquisition module 630 may include:
  • a predetermined relationship acquisition module configured to obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is specifically used to close-up the speaker at the scene position;
  • a target camera selection module configured to select a close-up camera corresponding to the position as a target camera from the two or more close-up cameras according to the predetermined corresponding relationship
  • a second acquisition submodule is configured to acquire an image obtained by the target camera shooting the speaker.
  • the position is a relative angle between the speaker and the local display screen
  • the relative position parameter is a relative angle
  • the image display module 64 may include:
  • An image position mapping module configured to map an image position of the speaker in the second image according to the position
  • a close-up image extraction module configured to extract image data at a mapped image position in the second image to obtain a close-up image of the speaker
  • An image display submodule configured to display the close-up image and the first image.
  • the image display device may further include a target extraction module, configured to:
  • the cropped image data is determined as the close-up image.
  • the image position mapping module is configured to:
  • the image position mapping module is configured to:
  • the coordinates of the obtained image area are used as the mapped image position.
  • the position is the relative angle between the speaker and the local display screen
  • the calibrated relative angle and the relative position parameters of the close-up camera and the local display calculate the relative angle between the scene area of the close-up camera and the local display.
  • FIG. 7 it is a hardware structure diagram of the image display device of the present invention, except for the processor, memory, network interface, and non-volatile memory shown in FIG. 7.
  • the image display device in which the device is located in the embodiment may generally include other hardware according to the actual function of the device, and details are not described herein again.
  • the memory and the non-volatile memory are device-readable memory, and the memory of the image display device may store program instructions executable by the processor; the processor may be coupled to the memory and used to read the program instructions stored in the storage medium, And in response, the operations in the image display method described above are performed.
  • an embodiment of the present invention also provides a machine-readable storage medium (storage device / peripheral device / receiver device memory), where the readable storage medium stores program instructions, and the program instructions include the foregoing Instructions for each step of the method.
  • the image display device When executed by one or more processors, the image display device is caused to perform the operations in the above corresponding method.
  • Embodiments of the present invention may take the form of a computer program product implemented on one or more readable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing program code.
  • Computer-readable storage media includes permanent and non-permanent, removable and non-removable media, and information storage can be accomplished by any method or technology.
  • Information may be computer-readable instructions, data structures, modules of a program, or other data.
  • machine-readable storage media include, but are not limited to: phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only Memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technologies, read-only disc read-only memory (CD-ROM), digital versatile disc (DVD), or other optical storage , Magnetic tape cartridges, magnetic tape storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices.
  • PRAM phase change memory
  • SRAM static random access memory
  • DRAM dynamic random access memory
  • RAM random access memory
  • ROM read-only Memory
  • EEPROM electrically erasable programmable read-only memory
  • flash memory or other memory technologies
  • CD-ROM compact disc read-only memory
  • DVD digital versatile disc
  • Magnetic tape cartridges magnetic tape storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Studio Devices (AREA)

Abstract

Disclosed in embodiments of the present invention are an image display method, apparatus, system and device, and a readable storage medium. The method comprises: obtaining an image photographed by a panoramic camera as a first image; determining the position of a speaker; selecting a close-up camera corresponding to the position from two or more close-up cameras to photograph an image of the speaker as a second image; and displaying the first image and the second image. By implementing the embodiments of the present invention, the close-up camera corresponding to the position of the speaker can be quickly selected from a plurality of close-up cameras to photograph the speaker, and then the image photographed by the panoramic camera and the image photographed by the selected close-up camera, and the close-up photographing of the speaker can be achieved without performing an addition operation on the close-up camera by an artificial director, thereby effectively improving the close-up implementation efficiency.

Description

图像显示方法、装置、系统及设备、可读存储介质Image display method, device, system and equipment, and readable storage medium 技术领域Technical field
本发明涉及图像处理技术领域,尤其涉及一种图像显示方法、装置系统及设备、可读存储介质。The present invention relates to the field of image processing technologies, and in particular, to an image display method, device system, and device, and a readable storage medium.
背景技术Background technique
在会议场景下,为了方便与会者能看清会议发言者,相关技术可以在会议场地设置多个摄像设备,由人工导播调整至少部分摄像设备,实现对发言者的特写,如拍摄出发言者的正面图像。In the meeting scene, in order to make it easier for the participants to see the speakers of the meeting, related technologies can set up multiple camera equipment at the conference venue, and at least part of the camera equipment can be adjusted by manual guides to achieve close-up of the speaker, such as shooting the speaker Positive image.
但是,人工导播调整摄像设备时,需要对摄像设备执行摇移、俯仰、推拉等一系列操作,耗费的时间较长,会降低特写实现效率。However, when manually guiding a camera to adjust the camera device, a series of operations such as panning, tilting, sliding, and other operations need to be performed on the camera device, which takes a long time and reduces the efficiency of close-ups.
发明内容Summary of the Invention
本发明提供一种特写图像确定方法、装置系统及设备、可读存储介质,解决相关技术中人工导播调整摄像设备耗费的时间长,会降低特写实现效率的问题。The invention provides a close-up image determination method, a device system and a device, and a readable storage medium, which solves the problem that the time required for the manual guide to adjust the camera equipment in the related art is long and reduces the efficiency of close-up implementation.
根据本发明实施例的第一方面,提供一种图像显示方法,包括以下步骤:According to a first aspect of the embodiments of the present invention, an image display method is provided, including the following steps:
获取全景摄像头拍摄的图像为第一图像;Acquiring an image captured by a panoramic camera as a first image;
确定发言者的位置;Determine the position of the speaker;
从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像;Selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker as a second image;
显示所述第一图像和所述第二图像。The first image and the second image are displayed.
在一个实施例中,所述确定发言者的位置,包括:In one embodiment, the determining the position of the speaker includes:
获取关联的声学定位设备对所述发言者的定位结果;Acquiring a positioning result of the associated acoustic positioning device on the speaker;
根据所述定位结果确定所述发言者的位置。Determining the position of the speaker according to the positioning result.
在一个实施例中,所述位置包括所述发言者与本端显示屏的相对位置参数;从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,包括:In one embodiment, the position includes a relative position parameter of the speaker and a local display screen; and from more than two close-up cameras, selecting a close-up camera corresponding to the position to capture an image of the speaker, include:
调取各特写摄像头与本端显示屏的相对位置参数;Recall the relative position parameters of each close-up camera and the local display;
基于所述位置及调取的相对位置参数,计算所述发言者与各特写摄像头的相对位置参数;Calculating the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters;
根据计算出的相对位置参数,从两个以上的特写摄像头中,选取与所述发言者的相对位置参数最小的特写摄像头;Selecting a close-up camera with a minimum relative position parameter to the speaker from the two or more close-up cameras according to the calculated relative position parameters;
获取所选取的特写摄像头拍摄所述发言者得到的图像。An image obtained by the selected close-up camera to capture the speaker is acquired.
在一个实施例中,从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,包括:In one embodiment, selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker includes:
获取现实场景中各场景位置与各特写摄像头间的预定对应关系;每一场景位置对应的特写摄像头,专用于对该场景位置上的发言者进行特写;Obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is dedicated to close-up the speaker at the scene position;
根据所述预定对应关系,从两个以上的特写摄像头中,选择与所述位置对应的特写摄像头为目标摄像头;Selecting the close-up camera corresponding to the position as the target camera from the two or more close-up cameras according to the predetermined correspondence relationship;
获取所述目标摄像头拍摄所述发言者得到的图像。Acquire an image obtained by the target camera taking the speaker.
在一个实施例中,所述相对位置参数包括相对角度和/或相对距离。In one embodiment, the relative position parameter includes a relative angle and / or a relative distance.
在一个实施例中,所述位置为所述发言者与本端显示屏的相对角度,所述相对位置参数为相对角度。In one embodiment, the position is a relative angle between the speaker and the local display screen, and the relative position parameter is a relative angle.
在一个实施例中,所述相对角度为水平方向上的相对角度。In one embodiment, the relative angle is a relative angle in a horizontal direction.
在一个实施例中,所述位置包括预定坐标系中所述发言者的坐标;从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,包括:In one embodiment, the position includes the coordinates of the speaker in a predetermined coordinate system; selecting a close-up camera corresponding to the position to capture an image of the speaker from two or more close-up cameras includes:
获取本端显示屏中心的预定坐标及各特写摄像头的预定坐标;Obtain the predetermined coordinates of the center of the local display screen and the predetermined coordinates of each close-up camera;
根据获取的预定坐标及所述位置,计算本端显示屏中心与每个摄像 头相对所述发言者的夹角,为该特写摄像头与所述发言者的相对角度;Calculating the included angle between the center of the local display screen and each camera with respect to the speaker according to the obtained predetermined coordinates and the position, which is the relative angle between the close-up camera and the speaker;
根据计算的相对角度,选择与所述发言者的相对角度最小的特写摄像头,为目标摄像头。According to the calculated relative angle, a close-up camera with the smallest relative angle to the speaker is selected as the target camera.
在一个实施例中,如果选择的特写摄像头的数目在两个以上,所述方法还包括:In one embodiment, if the number of close-up cameras selected is more than two, the method further includes:
根据选择的每个特写摄像头的预定坐标及所述位置,计算该特写摄像头与所述发言者的相对距离;Calculating the relative distance between the close-up camera and the speaker according to the predetermined coordinates of the selected close-up camera and the position;
根据计算的相对距离,从选择的目标摄像头中,选择与所述发言者的相对距离最小的特写摄像头,为目标摄像头。According to the calculated relative distance, from the selected target cameras, a close-up camera with the smallest relative distance from the speaker is selected as the target camera.
在一个实施例中,如果选择的特写摄像头的数目在两个以上,所述方法还包括:In one embodiment, if the number of close-up cameras selected is more than two, the method further includes:
计算选择出的每个特写摄像头,与所述发言者在水平方向上的相对角度;Calculating the relative angle between each selected close-up camera and the speaker in the horizontal direction;
从选择出的特写摄像头中,选择与所述发言者在水平方向上的相对角度最小的特写摄像头,为目标摄像头。From the selected close-up cameras, a close-up camera having the smallest relative angle with the speaker in the horizontal direction is selected as the target camera.
在一个实施例中,所述方法还包括以下步骤:In one embodiment, the method further includes the following steps:
重新确定所述发言者的位置;Redefining the position of the speaker;
根据重新确定的位置与上一次确定的位置,判断所述发言者的位置变化量是否小于预定的变化量;Judging whether the position change amount of the speaker is less than a predetermined change amount according to the newly determined position and the last determined position;
如果小于,则执行显示所述第一图像和所述第二图像的步骤;If it is less than, performing the step of displaying the first image and the second image;
如果不小于,从两个以上的特写摄像头中,选取与重新确定的位置对应的特写摄像头拍摄所述发言者所得的图像,为第三图像;If it is not less than, select an image obtained by the close-up camera corresponding to the re-determined position from the two or more close-up cameras to capture the speaker as a third image;
显示所述第一图像和所述第三图像。The first image and the third image are displayed.
在一个实施例中,显示所述第一图像和所述第二图像,包括:In one embodiment, displaying the first image and the second image includes:
根据所述位置,映射出所述发言者在所述第二图像中的图像位置;Map the image position of the speaker in the second image according to the position;
提取所述第二图像中被映射的图像位置处的图像数据,以获得所述发言者的特写图像;Extracting image data at a mapped image position in the second image to obtain a close-up image of the speaker;
显示所述特写图像和所述第一图像。The close-up image and the first image are displayed.
在一个实施例中,提取所述第二图像中映射出的位置处的图像数据后,所述方法包括:In one embodiment, after extracting image data at a position mapped in the second image, the method includes:
从提取的图像数据中,识别并裁剪出所述发言者的目标部位的图像数据;Identifying and cropping image data of a target part of the speaker from the extracted image data;
将裁剪出的图像数据确定为所述特写图像。The cropped image data is determined as the close-up image.
在一个实施例中,根据所述位置,映射出所述发言者在所述第二图像中的图像位置,包括:In an embodiment, mapping an image position of the speaker in the second image according to the position includes:
获取全景图像的各图像区域与现实场景中各场景区域的对应关系;Obtaining the correspondence between each image area of the panoramic image and each scene area in the real scene;
基于所述对应关系,定位所述位置在所述第一图像中所属的图像区域;Positioning an image region to which the position belongs in the first image based on the correspondence relationship;
通过特征匹配,匹配出所述第二图像中特征与所定位的图像区域匹配的图像区域;Matching the image regions whose features in the second image match the localized image regions through feature matching;
将所匹配出的图像区域的坐标,作为映射出的图像位置。Use the coordinates of the matched image area as the mapped image position.
在一个实施例中,根据所述位置,映射出所述发言者在所述第二图像中的图像位置,包括:In an embodiment, mapping an image position of the speaker in the second image according to the position includes:
调取各特写摄像头的各图像区域,在现实场景中的位置信息;Retrieve the position information of each image area of each close-up camera in the real scene;
匹配调取的位置信息与所述位置;Matching the retrieved location information with the location;
根据匹配结果,获得所述发言者在所述第二图像中所属的图像区域;Obtaining an image area to which the speaker belongs in the second image according to the matching result;
将所得的图像区域的坐标,作为映射出的图像位置。The coordinates of the obtained image area are used as the mapped image position.
在一个实施例中,所述位置为所述发言者与本端显示屏的相对角度;In one embodiment, the position is a relative angle between the speaker and a local display screen;
所述位置信息为各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度。The position information is a scene area corresponding to each image area in a real scene, and a relative angle with the local display screen.
在一个实施例中,预生成所述位置信息的步骤包括:In one embodiment, the step of pre-generating the location information includes:
根据每个特写摄像头的镜头视角,标定该特写摄像头的所述场景区域与该特写摄像头的相对角度;Calibrating the relative angle between the scene area of the close-up camera and the close-up camera according to the lens angle of each close-up camera;
根据标定出的相对角度,及该特写摄像头与本端显示屏的相对位置 参数,计算该特写摄像头的所述场景区域与本端显示屏的相对角度。According to the calibrated relative angle and the relative position parameters of the close-up camera and the local display, calculate the relative angle between the scene area of the close-up camera and the local display.
根据本发明实施例的第一方面,提供一种图像显示系统,包括图像显示设备、全景摄像头及两个以上的特写摄像头,所述图像显示设备包括:According to a first aspect of the embodiments of the present invention, an image display system is provided, including an image display device, a panoramic camera, and two or more close-up cameras. The image display device includes:
显示屏:Display:
处理器;processor;
存储处理器可执行指令的存储器;Memory storing processor-executable instructions;
其中,所述处理器耦合于所述存储器,用于读取所述存储器存储的程序指令,并作为响应,执行如上所述方法中的操作。The processor is coupled to the memory, and is configured to read program instructions stored in the memory and, in response, perform operations in the method described above.
在一个实施例中,所述全景摄像头及各特写摄像头,安装在所述图像显示设备上。In one embodiment, the panoramic camera and each close-up camera are installed on the image display device.
在一个实施例中,各特写摄像头与所述显示屏的相对角度不同。In one embodiment, the relative angle between each close-up camera and the display screen is different.
在一个实施例中,所述全景摄像头安装在所述图像显示设备上侧的边框处,第一特写摄像头安装在所述图像显示设备左侧的边框处,第二特写摄像头安装在所述图像显示设备右侧的边框处。In one embodiment, the panoramic camera is installed at a frame on the upper side of the image display device, a first close-up camera is installed at a frame on the left side of the image display device, and a second close-up camera is installed at the image display. At the border on the right side of the device.
在一个实施例中,第一特写摄像头与所述显示屏的相对角度在10度至50度之间;In an embodiment, the relative angle between the first close-up camera and the display screen is between 10 degrees and 50 degrees;
第二特写摄像头与所述显示屏的相对角度在130度至170度之间。The relative angle between the second close-up camera and the display screen is between 130 degrees and 170 degrees.
在一个实施例中,所述图像显示设备还包括声学定位设备,用于对所述发言者进行定位。In one embodiment, the image display device further includes an acoustic positioning device for positioning the speaker.
在一个实施例中,所述声学定位设备包括麦克风阵列。In one embodiment, the acoustic positioning device includes a microphone array.
在一个实施例中,所述图像显示设备为会议交互设备。In one embodiment, the image display device is a conference interaction device.
在一个实施例中,所述会议交互设备为智能交互平板。In one embodiment, the conference interactive device is a smart interactive tablet.
根据本发明实施例的第三方面,提供一种图像显示设备,包括:According to a third aspect of the embodiments of the present invention, an image display device is provided, including:
显示屏:Display:
处理器;processor;
存储处理器可执行指令的存储器;Memory storing processor-executable instructions;
其中,所述处理器耦合于所述存储器,用于读取所述存储器存储的程序指令,并作为响应,执行如上所述方法中的操作。The processor is coupled to the memory, and is configured to read program instructions stored in the memory and, in response, perform operations in the method described above.
在一个实施例中,本发明实施例的图像显示设备与全景摄像头及至少两个特写摄像头关联。In one embodiment, the image display device of the embodiment of the present invention is associated with a panoramic camera and at least two close-up cameras.
在一个实施例中,各特写摄像头与所述显示屏的相对角度不同。In one embodiment, the relative angle between each close-up camera and the display screen is different.
在一个实施例中,一特写摄像头与所述显示屏的相对角度在10度至50度之间;另一特写摄像头与所述显示屏的相对角度在130度至170度之间。In one embodiment, the relative angle between a close-up camera and the display screen is between 10 degrees and 50 degrees; the relative angle between the other close-up camera and the display screen is between 130 degrees and 170 degrees.
在一个实施例中,所述图像显示设备还包括声学定位设备,用于对所述发言者进行定位。In one embodiment, the image display device further includes an acoustic positioning device for positioning the speaker.
在一个实施例中,所述声学定位设备包括麦克风阵列。In one embodiment, the acoustic positioning device includes a microphone array.
在一个实施例中,所述图像显示设备为会议交互设备。In one embodiment, the image display device is a conference interaction device.
在一个实施例中,所述会议交互设备为智能交互平板。In one embodiment, the conference interactive device is a smart interactive tablet.
根据本发明实施例的第四方面,提供一个或多个机器可读存储介质其上存储有指令,当由一个或多个处理器执行时,执行如上所述方法中的操作。According to a fourth aspect of the embodiments of the present invention, one or more machine-readable storage media are provided with instructions stored thereon, and when executed by one or more processors, perform operations in the method described above.
根据本发明实施例的第五方面,提供一种图像显示装置,包括:According to a fifth aspect of the embodiments of the present invention, an image display device is provided, including:
第一图像获取模块,用于获取全景摄像头拍摄的图像为第一图像;A first image acquisition module, configured to acquire an image captured by a panoramic camera as a first image;
发言位置确定模块,用于确定发言者的位置;A speech position determining module, configured to determine the position of a speaker;
第二图像获取模块,用于从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像;A second image acquisition module, configured to select, from two or more close-up cameras, a close-up camera corresponding to the position to capture an image of the speaker as a second image;
图像显示模块,用于显示所述第一图像和所述第二图像。An image display module is configured to display the first image and the second image.
在一个实施例中,所述位置包括所述发言者与本端显示屏的相对位置参数;所述第二图像获取模块包括:In one embodiment, the position includes a relative position parameter of the speaker and a local display screen; the second image acquisition module includes:
位置参数调取模块,用于调取各特写摄像头与本端显示屏的相对位置参数;Position parameter acquisition module, used to retrieve the relative position parameters of each close-up camera and the local display;
相对位置计算模块,用于基于所述位置及调取的相对位置参数,计 算所述发言者与各特写摄像头的相对位置参数;A relative position calculation module, configured to calculate the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters;
摄像头选取模块,用于根据计算出的相对位置参数,从两个以上的特写摄像头中,选取与所述发言者的相对位置参数最小的特写摄像头;A camera selection module for selecting a close-up camera with the smallest relative position parameter of the speaker from the two or more close-up cameras according to the calculated relative position parameters;
第一获取子模块,用于获取所选取的特写摄像头拍摄所述发言者得到的图像。The first acquisition submodule is configured to acquire an image obtained by the selected close-up camera and capturing the speaker.
在一个实施例中,所述第二图像获取模块包括:In one embodiment, the second image acquisition module includes:
预定关系获取模块,用于获取现实场景中各场景位置与各特写摄像头间的预定对应关系;每一场景位置对应的特写摄像头,专用于对该场景位置上的发言者进行特写;A predetermined relationship acquisition module, configured to obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is specifically used to close-up the speaker at the scene position;
目标摄像头选取模块,用于根据所述预定对应关系,从两个以上的特写摄像头中,选择与所述位置对应的特写摄像头为目标摄像头;A target camera selection module, configured to select a close-up camera corresponding to the position as a target camera from the two or more close-up cameras according to the predetermined correspondence relationship;
第二获取子模块,用于获取所述目标摄像头拍摄所述发言者得到的图像。A second acquisition submodule is configured to acquire an image obtained by the target camera shooting the speaker.
在一个实施例中,所述位置为所述发言者与本端显示屏的相对角度,所述相对位置参数为相对角度。In one embodiment, the position is a relative angle between the speaker and the local display screen, and the relative position parameter is a relative angle.
在一个实施例中,所述图像显示模块包括:In one embodiment, the image display module includes:
图像位置映射模块,用于根据所述位置,映射出所述发言者在所述第二图像中的图像位置;An image position mapping module, configured to map an image position of the speaker in the second image according to the position;
特写图像提取模块,用于提取所述第二图像中被映射的图像位置处的图像数据,以获得所述发言者的特写图像;A close-up image extraction module, configured to extract image data at a mapped image position in the second image to obtain a close-up image of the speaker;
图像显示子模块,用于显示所述特写图像和所述第一图像。An image display submodule, configured to display the close-up image and the first image.
在一个实施例中,所述装置还包括目标提取模块,用于:In one embodiment, the apparatus further includes a target extraction module, configured to:
从提取的图像数据中,识别并裁剪出所述发言者的目标部位的图像数据;Identifying and cropping image data of a target part of the speaker from the extracted image data;
将裁剪出的图像数据确定为所述特写图像。The cropped image data is determined as the close-up image.
在一个实施例中,所述图像位置映射模块被配置为:In one embodiment, the image position mapping module is configured to:
获取全景图像的各图像区域与现实场景中各场景区域的对应关系;Obtaining the correspondence between each image area of the panoramic image and each scene area in the real scene;
基于所述对应关系,定位所述位置在所述第一图像中所属的图像区域;Positioning an image region to which the position belongs in the first image based on the correspondence relationship;
通过特征匹配,匹配出所述第二图像中特征与所定位的图像区域匹配的图像区域;Matching the image regions whose features in the second image match the localized image regions through feature matching;
将所匹配出的图像区域的坐标,作为映射出的图像位置。Use the coordinates of the matched image area as the mapped image position.
在一个实施例中,所述图像位置映射模块被配置为:In one embodiment, the image position mapping module is configured to:
调取各特写摄像头的各图像区域,在现实场景中的位置信息;Retrieve the position information of each image area of each close-up camera in the real scene;
匹配调取的位置信息与所述位置;Matching the retrieved location information with the location;
根据匹配结果,获得所述发言者在所述第二图像中所属的图像区域;Obtaining an image area to which the speaker belongs in the second image according to the matching result;
将所得的图像区域的坐标,作为映射出的图像位置。The coordinates of the obtained image area are used as the mapped image position.
在一个实施例中,所述位置为所述发言者与本端显示屏的相对角度;In one embodiment, the position is a relative angle between the speaker and a local display screen;
所述位置信息为各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度。The position information is a scene area corresponding to each image area in a real scene, and a relative angle with the local display screen.
在一个实施例中,预生成所述位置信息的模块被配置为:In one embodiment, the module for pre-generating the location information is configured to:
根据每个特写摄像头的镜头视角,标定该特写摄像头的所述场景区域与该特写摄像头的相对角度;Calibrating the relative angle between the scene area of the close-up camera and the close-up camera according to the lens angle of each close-up camera;
根据标定出的相对角度,及该特写摄像头与本端显示屏的相对位置参数,计算该特写摄像头的所述场景区域与本端显示屏的相对角度。According to the calibrated relative angle and the relative position parameters of the close-up camera and the local display, calculate the relative angle between the scene area of the close-up camera and the local display.
根据本发明实施例的第六方面,提供一种智能交互平板,包括全景摄像头、第一特写摄像头和第二特写摄像头,所述全景摄像头、所述第一特写摄像头和所述第二特写摄像头设置在智能交互平板的边框上,所述第一特写摄像头和所述第二特写摄像头的光轴倾斜于智能交互平板的显示平面。According to a sixth aspect of the embodiments of the present invention, a smart interactive tablet is provided, including a panoramic camera, a first close-up camera, and a second close-up camera. The panoramic camera, the first close-up camera, and the second close-up camera are provided. On the frame of the smart interactive tablet, the optical axes of the first close-up camera and the second close-up camera are inclined to the display plane of the smart interactive tablet.
在一个实施例中,所述智能交互平板还用于:In one embodiment, the smart interactive tablet is further configured to:
获取全景摄像头拍摄的图像为第一图像;Acquiring an image captured by a panoramic camera as a first image;
确定发言者的位置;Determine the position of the speaker;
从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍 摄所述发言者的图像,为第二图像;Selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker as a second image;
显示所述第一图像和所述第二图像。The first image and the second image are displayed.
在一个实施例中,所述第一特写摄像头的光轴与所述显示屏的相对角度在10度至50度之间;In one embodiment, the relative angle between the optical axis of the first close-up camera and the display screen is between 10 degrees and 50 degrees;
所述第二特写摄像头的光轴与所述显示屏的相对角度在130度至170度之间。The relative angle between the optical axis of the second close-up camera and the display screen is between 130 degrees and 170 degrees.
本发明实施例,通过确定发言者的位置,可以快速从多个特写摄像头中选定与发言者的位置对应的特写摄像头,对发言者拍摄,然后显示全景摄像头拍摄的图像和选取的特写摄像头拍摄的图像,无需人工导播对特写摄像头进行摇移、俯仰、推拉等一系列额外操作,即可以实现对发言者的特写,且相较于需要人工导播对摄像头进行一些列额外操作的相关技术,能有效提高特写实现效率。In the embodiment of the present invention, by determining the position of the speaker, a close-up camera corresponding to the position of the speaker can be quickly selected from a plurality of close-up cameras, the speaker is shot, and then the image taken by the panoramic camera and the selected close-up camera are taken. Images, without the need for manual guides to pan, tilt, push and pull a series of additional operations on the close-up camera, you can achieve close-up of the speaker, and compared to related technologies that require manual guides to perform some additional operations on the camera, can Effectively improve the efficiency of close-ups.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1A是本发明一示例性实施例示出的图像显示系统的示意图;FIG. 1A is a schematic diagram of an image display system according to an exemplary embodiment of the present invention; FIG.
图1B是本发明另一示例性实施例示出的图像显示系统的示意图;FIG. 1B is a schematic diagram of an image display system according to another exemplary embodiment of the present invention; FIG.
图1C是本发明另一示例性实施例示出的图像显示系统的示意图;FIG. 1C is a schematic diagram of an image display system according to another exemplary embodiment of the present invention; FIG.
图2A是本发明另一示例性实施例示出的图像显示系统的处理逻辑图;Fig. 2A is a processing logic diagram of an image display system according to another exemplary embodiment of the present invention;
图2B是本发明另一示例性实施例示出的图像显示系统的交互示意图;FIG. 2B is an interaction schematic diagram of an image display system according to another exemplary embodiment of the present invention; FIG.
图2C是本发明一示例性实施例示出的显示屏的显示画面;2C is a display screen of a display screen according to an exemplary embodiment of the present invention;
图3是本发明一示例性实施例示出的图像显示方法的示意图;FIG. 3 is a schematic diagram of an image display method according to an exemplary embodiment of the present invention; FIG.
图4A是本发明另一示例性实施例示出的图像显示方法的示意图;FIG. 4A is a schematic diagram of an image display method according to another exemplary embodiment of the present invention; FIG.
图4B是本发明一示例性实施例示出的全景图像的划分示意图;FIG. 4B is a schematic diagram of dividing a panoramic image according to an exemplary embodiment of the present invention; FIG.
图5是本发明另一示例性实施例示出的图像显示方法的示意图;5 is a schematic diagram of an image display method according to another exemplary embodiment of the present invention;
图6是本发明一示例性实施例示出的图像显示装置的框图;Fig. 6 is a block diagram of an image display device according to an exemplary embodiment of the present invention;
图7是本发明一示例性实施例示出的图像显示设备的硬件结构图。Fig. 7 is a hardware structural diagram of an image display device according to an exemplary embodiment of the present invention.
具体实施方式detailed description
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本发明相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本发明的一些方面相一致的装置和方法的例子。Exemplary embodiments will be described in detail here, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with the present invention. Rather, they are merely examples of devices and methods consistent with some aspects of the invention as detailed in the appended claims.
在本发明使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本发明。在本发明和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本文中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。The terminology used in the present invention is for the purpose of describing particular embodiments and is not intended to limit the present invention. The singular forms "a," "the," and "the" as used in this invention and the appended claims are also intended to include the majority, unless the context clearly indicates otherwise. It should also be understood that the term "and / or" as used herein refers to and includes any or all possible combinations of one or more of the associated listed items.
应当理解,尽管在本发明可能采用术语第一、第二、第三等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本发明范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”。It should be understood that although the terms first, second, third, etc. may be used in the present invention to describe various kinds of information, these information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other. For example, without departing from the scope of the present invention, the first information may also be referred to as the second information, and similarly, the second information may also be referred to as the first information. Depending on the context, the word "if" as used herein can be interpreted as "at" or "when" or "in response to determination".
本发明实施例为了提高特写实现效率,提供了一种图像显示方法,及用于实现图像显示方法的图像显示系统。所提供的图像显示系统可以包括图像显示设备、全景摄像头及至少两个特写摄像头。In order to improve the close-up realization efficiency, the embodiments of the present invention provide an image display method and an image display system for implementing the image display method. The provided image display system may include an image display device, a panoramic camera, and at least two close-up cameras.
其中,图像显示设备,可以包括存储器、处理器、显示图像的显示屏,存储器存储处理器可执行的程序指令;处理器耦合于存储器,用于读取存储器存储的程序指令,并作为响应,执行用于实现本发明实施例的图像显示方法的操作。The image display device may include a memory, a processor, and a display screen for displaying an image. The memory stores program instructions executable by the processor. The processor is coupled to the memory and is used to read the program instructions stored in the memory and execute the program instructions in response. An operation for implementing an image display method according to an embodiment of the present invention.
这里提到的显示屏,可以是独立的显示屏,如LED显示屏等;也可 以是具有交互能力的图像显示设备的屏幕,图像显示设备例如触摸式显示屏、智能交互平板、或其他具有交互能力的计算机设备。The display screen mentioned here can be an independent display screen, such as an LED display screen; it can also be a screen of an image display device with interactive capabilities, such as a touch screen display, a smart interactive tablet, or other interactive display screens. Capable computer equipment.
全景摄像头,用于对目标场所的现实场景进行拍摄,相对特写摄像头来说,视角范围较大,拍摄范围较广,某些例子中,全景摄像头可以是广角摄像头。The panoramic camera is used to shoot the realistic scene of the target location. Compared with the close-up camera, the angle of view range is wider and the shooting range is wider. In some examples, the panoramic camera may be a wide-angle camera.
特写摄像头,用于对目标场所内的发言者进行特写。本发明的设计人员可以根据实际应用场景的特写需求,预置不同的特写摄像头对目标场所的不同预定场景区域进行拍摄,和/或,预置不同的特写摄像头按不同的相对位置参数对同一预定场景区域进行拍摄,这里提到的相对位置参数可以包括特写摄像头与预定场景区域中被拍对象的相对角度和/或相对距离。在实际的特写过程中,通常无需人工导播随发言者的变化调整特写摄像头。Close-up camera for close-up of speakers in the target location. The designer of the present invention can preset different close-up cameras to shoot different predetermined scene areas of the target place according to the close-up requirements of the actual application scene, and / or preset different close-up cameras to the same reservation according to different relative position parameters. The scene area is used for shooting. The relative position parameter mentioned herein may include a relative angle and / or a relative distance between the close-up camera and the subject in the predetermined scene area. In the actual close-up process, it is usually not necessary to manually guide the close-up camera to adjust with the changes of the speaker.
需要说明的是,实际的应用场景不同时,目标场所、特写摄像头的预定场景区域可以有所不同。例如:应用场景可以是会议场景,智能教育场景,直播场景,或其他需要对发言者进行特写的场景;相应的,目标场所可以是会议室,授课室,直播间,或其他需要对发言者进行特写的场景场所。预定场景区域可以是会议发言者可能出现的区域,课程中发言者可能出现的区域,主播可能出现的区域,或其他场景下发言者可能出现的区域。It should be noted that when the actual application scenario is different, the target scene and the predetermined scene area of the close-up camera may be different. For example, the application scenario can be a conference scenario, a smart education scenario, a live broadcast scenario, or other scenarios that require close-ups of the speakers; correspondingly, the target location can be a conference room, a lecture room, a live broadcast room, or other needs to perform a speaker Close-up scene place. The predetermined scene area may be an area in which speakers may appear in the meeting, an area in which speakers may appear in the course, an area in which anchors may appear, or an area in which speakers may appear in other scenarios.
此外,应用场景的特写需求不同时,特写摄像头的预定场景区域,和/或特写摄像头与预定场景区域的相对位置参数也可以有所不同。例如会议场景下,如对会议主讲人进行特写,会议主讲人一般在会议桌靠近会议交互设备的一端,预定场景区域可以是靠近会议交互设备的会议桌区域;如对任何可能发言的与会者进行特写,预定场景区域,可以是与会者所在的区域;如对发言者进行正面特写,特写摄像头与预定场景区域的相对角度为正面角度;如对发言者进行斜侧面特写,特写摄像头与预定场景区域的相对角度为斜侧角度,这里提到的正面角度是指与被摄 对象正面成垂直角度,斜侧角度是指偏离正面角度,或左、或右环绕预定场景区域内的被拍对象移动至侧面角度所经历的角度,侧面角度是指与被拍对象的侧面成垂直角度。In addition, when the close-up requirements of the application scene are different, the predetermined scene area of the close-up camera and / or the relative position parameters of the close-up camera and the predetermined scene area may also be different. For example, in a conference scenario, if the conference presenter is close-up, the conference presenter is usually at the end of the conference table near the conference interactive device, and the scheduled scene area may be the conference table area near the conference interactive device; for example, for any participant who may speak Close-up, scheduled scene area can be the area where participants are located. For example, if the speaker is front-closed, the relative angle between the close-up camera and the scheduled scene area is a positive angle; for oblique side close-up of the speaker, the close-up camera and the scheduled scene area The relative angle is the oblique angle. The front angle mentioned here refers to the vertical angle with the front of the subject. The oblique angle refers to the deviation from the front angle, or left or right around the subject in the predetermined scene area to move to The angle experienced by the side angle. The side angle refers to the angle perpendicular to the side of the subject.
再者,随着应用场景和/或特写需求的不同,全景摄像头与各特写摄像头,可以作为图像显示设备的附属设备,安装于图像显示设备,也可以独立于图像显示设备存在,安装在图像显示设备外的空间中。以下首先以应用场景为会议场景,目标场所为会议室为例,结合附图详述下本发明实施例的图像显示系统。Furthermore, with different application scenarios and / or close-up needs, the panoramic camera and each close-up camera can be used as an accessory device of the image display device, installed on the image display device, or can exist independently of the image display device and installed on the image display. In the space outside the device. The image display system according to the embodiment of the present invention is described in detail below with an application scenario as a conference scenario and a target location as a conference room as an example.
请参阅图1A,图1A是本发明一示例性实施例示出的图像显示系统的示意图。Please refer to FIG. 1A, which is a schematic diagram of an image display system according to an exemplary embodiment of the present invention.
图1A所示的图像显示系统,设置于会议室100,可以包括图像显示设备110,设置于图像显示设备110上的显示屏111、全景摄像头112及至少两个特写摄像头113和114等。The image display system shown in FIG. 1A is disposed in the conference room 100 and may include an image display device 110, a display screen 111, a panoramic camera 112, at least two close-up cameras 113 and 114, and the like.
其中,图像显示设备110,可以是具有显示功能的会议显示设备,也可以是兼具交互能力的会议交互设备,如智能交互平板。全景摄像头112、特写摄像头113和特写摄像头114设置在智能交互平板的边框上,特写摄像头113和特写摄像头114的光轴均倾斜于智能交互平板的显示平面。The image display device 110 may be a conference display device with a display function, or a conference interactive device with interactive capabilities, such as a smart interactive tablet. The panoramic camera 112, the close-up camera 113, and the close-up camera 114 are disposed on the frame of the smart interactive tablet, and the optical axes of the close-up camera 113 and the close-up camera 114 are inclined to the display plane of the smart interactive tablet.
会议室100内还可以包括会议桌120,实际会议中,发言者以及与会者A、与会者B、与会者C、与会者D、与会者E、与会者F,分别落座于会议桌120两侧。The conference room 100 may also include a conference table 120. In an actual conference, speakers and participants A, B, C, D, E, and F are seated on both sides of the table 120, respectively. .
一例子中,为尽可能多的拍摄到会议桌120周围的现实场景进行拍摄,全景摄像头112安装在图像显示设备110上侧的边框处。In one example, in order to capture as many realistic scenes around the conference table 120 as possible, the panoramic camera 112 is installed at a frame on the upper side of the image display device 110.
某例子中,对会议室100这一现实场景进行全景拍摄的同时,为了对不同场景区域的发言者进行特写,可以将不同特写摄像头设置在图像显示设备110的不同位置,如图像显示设备110的不同边框处,使得不同特写摄像头对不同的场景区域进行拍摄。In an example, while taking a panoramic shot of the real scene of the conference room 100, in order to close-up speakers in different scene areas, different close-up cameras can be set at different positions of the image display device 110, such as the image display device 110 The different borders make different close-up cameras to shoot different scene areas.
为了区分对不同场景区域进行拍摄的特写摄像头,本发明实施例可以用各特写摄像头与显示屏111的相对角度来标识各特写摄像头,对不同场景区域进行拍摄的特写摄像头与显示屏111的相对角度不同。这里的相对角度可以指特写摄像头的光轴与显示屏111的显示平面间的夹角,其具体数值可以由场景区域与特写摄像头的安装位置决定。In order to distinguish the close-up cameras for shooting different scene areas, the embodiment of the present invention may use the relative angles of the close-up cameras and the display 111 to identify the close-up cameras, and the relative angles of the close-up cameras for different scene areas and the display 111. different. The relative angle here can refer to the angle between the optical axis of the close-up camera and the display plane of the display 111, and the specific value can be determined by the scene area and the installation position of the close-up camera.
例如以显示屏111的屏幕中线为参照,特写摄像头113的光轴与显示屏111沿y方向分布的屏幕中线的夹角θ2,即特写摄像头113与显示屏111的相对角度;特写摄像头114的光轴与显示屏111沿y方向分布的屏幕中线的夹角θ3,即特写摄像头114与显示屏111的相对角度。在其他实施例中,也可以参照其他参照物,计算各特写摄像头与显示屏111的相对角度,在此不再赘述For example, taking the screen centerline of the display 111 as a reference, the angle θ2 between the optical axis of the close-up camera 113 and the screen centerline of the display 111 distributed along the y direction, that is, the relative angle between the close-up camera 113 and the display 111; The angle θ3 between the axis and the center line of the screen distributed along the y-direction of the display screen 111 is the relative angle between the close-up camera 114 and the display screen 111. In other embodiments, the relative angle between each close-up camera and the display screen 111 may also be calculated by referring to other reference objects, and details are not described herein again.
进而,为了尽可能对发言者正面或侧面进行特写,至少一特写摄像头113安装在图像显示设备110左侧的边框处,至少一特写摄像头114安装在图像显示设备110右侧的边框处。Furthermore, in order to perform close-up of the front or side of the speaker as much as possible, at least one close-up camera 113 is installed at a frame on the left side of the image display device 110, and at least one close-up camera 114 is installed at a frame on the right side of the image display device 110.
进一步地,为了尽可能对发言者的正面进行特写,本发明实施例以图1A所示的三维x-y-z坐标系为参照,安装特写摄像头113、114,其中,显示屏111位于y-z平面,显示屏111的中心点为坐标原点,也可以作为发言者的视线落点。Further, in order to make a close-up of the front of the speaker as much as possible, the embodiment of the present invention uses the three-dimensional xyz coordinate system shown in FIG. 1A as a reference, and installs close-up cameras 113 and 114, in which the display screen 111 is located on the yz plane and the display screen 111 The center point of is the origin of the coordinates and can also be used as the point of sight of the speaker.
安装好特写摄像头后,安装在图像显示设备110的左侧边框处的特写摄像头113,与显示屏111的相对角度在10度至50度之间(θ1的个例数值);安装在图像显示设备110的右侧边框处的特写摄像头,与显示屏的相对角度在130度至170度之间(θ2的个例数值)。在某些尺寸的会议室中,特写摄像头113与显示屏111的相对角度为15度,特写摄像头114与显示屏111的相对角度165度,按这样的相对角度安装特写摄像头,对发言者的特写效果相对而言比较好。After the close-up camera is installed, the relative angle of the close-up camera 113 installed on the left border of the image display device 110 and the display 111 is between 10 degrees and 50 degrees (a value of θ1); installed on the image display device The close-up camera at the right border of 110 has a relative angle with the display screen between 130 degrees and 170 degrees (an example value of θ2). In some sizes of conference rooms, the relative angle between the close-up camera 113 and the display 111 is 15 degrees, and the relative angle between the close-up camera 114 and the display 111 is 165 degrees. Install the close-up camera at such a relative angle to give a close-up of the speaker The effect is relatively good.
本发明实施例涉及的发言者,可以指发出音频的对象,如本实施例的会议场景下进行发言的与会者,为了准确地对发言者进行特写,需要 先确定发言者的位置,然后选取与该确定的位置对应的特写摄像头,以获取选取的摄像头拍摄的图像。这里提到的特写摄像头,可以是上述对发言者所在的场景区域进行拍摄的摄像头,或者对该场景区域中的发言者进行正面拍摄的摄像头。The speaker involved in the embodiment of the present invention may refer to an object that emits audio, such as a participant who speaks in the conference scene of this embodiment. In order to accurately close the speaker, it is necessary to first determine the position of the speaker, and then select and A close-up camera corresponding to the determined position to obtain an image captured by the selected camera. The close-up camera mentioned here may be the camera that shoots the area of the scene where the speaker is located, or the camera that shoots the speaker in the scene area in front.
本发明实施例为了区分全景摄像头112拍摄的图像和选取的特写摄像头113或114等拍摄的图像,将全景摄像头112拍摄的图像称为第一图像,将选取的特写摄像头113或114等拍摄的图像称为第二图像。在获取到第一图像和第二图像后,通过显示屏111显示第一图像和第二图像。In the embodiment of the present invention, in order to distinguish an image captured by the panoramic camera 112 from an image captured by the selected close-up camera 113 or 114, the image captured by the panoramic camera 112 is referred to as a first image, and an image captured by the selected close-up camera 113 or 114 is used Called the second image. After the first image and the second image are acquired, the first image and the second image are displayed through the display screen 111.
此外,本发明实施例的发言者发出的音频不同于环境音频,为了准确度定位出发言者,本发明实施例可以通过用于对发言者进行定位的声学定位设备115,确定所述发言者的位置,根据声学定位设备的定位结果,确定所述发言这的位置。In addition, the audio emitted by the speaker in the embodiment of the present invention is different from the ambient audio. In order to accurately locate the speaker, the embodiment of the present invention may determine the speaker's The position determines the position of the speaker according to the positioning result of the acoustic positioning device.
这里提到的声学定位设备115可以装设于图像显示设备110上,如包括图像显示设备110的麦克风阵列。在其他实施例中,声学定位设备115也可以独立于图像显示设备110存在。本发明还可以采取声学定位方式以外的其他方式确定发言者的位置,在此不再赘述。The acoustic positioning device 115 mentioned here may be mounted on the image display device 110, such as a microphone array including the image display device 110. In other embodiments, the acoustic positioning device 115 may also exist independently of the image display device 110. The present invention can also determine the position of the speaker by means other than the acoustic positioning method, which is not repeated here.
而通过声学定位设备115,确定所述发言者的位置的具体方式,可以由声学定位设备115的定位原理决定。一例子中,声学定位设备115可以包括垂直排列的麦克风和水平排列的麦克风,通过两个麦克风采集到音频信号的时间差,结合麦克风的空间位置,定位出发言者与声学定位设备115的相对位置参数,这里提到的相对位置参数包括相对角度和/或相对距离。The specific way of determining the position of the speaker through the acoustic positioning device 115 can be determined by the positioning principle of the acoustic positioning device 115. In an example, the acoustic positioning device 115 may include a vertically arranged microphone and a horizontally arranged microphone. The time difference between the audio signals collected by the two microphones is combined with the spatial position of the microphone to locate the relative position parameters of the speaker and the acoustic positioning device 115. The relative position parameters mentioned here include relative angle and / or relative distance.
另一例子中,参照预定的三维坐标系,声学定位设备115或图像显示设备110的处理器,可以根据发言者与声学定位设备115的相对位置参数,及声学定位设备115的三维坐标,计算出发言者在三维坐标系中的坐标。In another example, referring to a predetermined three-dimensional coordinate system, the processor of the acoustic positioning device 115 or the image display device 110 may calculate the relative position parameters of the speaker and the acoustic positioning device 115 and the three-dimensional coordinates of the acoustic positioning device 115. The coordinates of the speaker in a three-dimensional coordinate system.
其他例子中,声学定位设备115或图像显示设备110的处理器,可以预存显示屏111与声学定位设备115的相对位置参数,进而根据发言者与声学定位设备115的相对位置参数及预存的相对位置参数,计算出发言者与显示屏111的相对位置参数,这里提到的相对位置参数包括相对角度和相对距离。In other examples, the processor of the acoustic positioning device 115 or the image display device 110 may pre-store the relative position parameters of the display screen 111 and the acoustic positioning device 115, and then based on the relative position parameters of the speaker and the acoustic positioning device 115 and the pre-stored relative position Parameters, the relative position parameters of the speaker and the display 111 are calculated. The relative position parameters mentioned here include relative angle and relative distance.
例如图1A中,以显示屏111y方向分布的屏幕中线为参照,发言者指向显示屏111中心的向量的长度,即为发言者与显示屏111的相对距离,该向量与沿y方向分布的屏幕中线的夹角,即发言者与显示屏111的相对角度,与θ1大小相等且互为对顶角。For example, in FIG. 1A, taking the center line of the screen distributed in the display 111y direction as a reference, the length of the vector that the speaker points to the center of the display 111 is the relative distance between the speaker and the display 111, and the vector is related to the screen distributed in the y direction. The included angle of the center line, that is, the relative angle between the speaker and the display screen 111, is equal to θ1 and is opposite to each other.
参照前述实施例确定出的发言者的位置,可以选取出与所确定的位置对应的特写摄像头,进而将该特写摄像头拍摄的图像作为第二图像。With reference to the speaker position determined in the foregoing embodiment, a close-up camera corresponding to the determined position may be selected, and an image captured by the close-up camera is used as a second image.
由上可知,图1A所示的图像显示系统中,全景摄像头112及至少两个特写摄像头113和114安装于图像显示设备110上,在其他实施例中,全景摄像头112及至少两个特写摄像头113和114也可以于独立于图像显示设备110存在,具体可以参阅图1B。As can be seen from the above, in the image display system shown in FIG. 1A, the panoramic camera 112 and at least two close-up cameras 113 and 114 are installed on the image display device 110. In other embodiments, the panoramic camera 112 and at least two close-up cameras 113 The sums 114 and 114 may also exist independently of the image display device 110. For details, refer to FIG. 1B.
请参阅图1B,图1B是本发明另一示例性实施例示出的图像显示系统的示意图。Please refer to FIG. 1B, which is a schematic diagram of an image display system according to another exemplary embodiment of the present invention.
图1B所示的图像显示系统,设置于会议室100,可以包括图像显示设备110,设置于图像显示设备110上的存储器(未示出)、处理器(未示出)、显示屏111,设置于图像显示设备110外的三维空间的全景摄像头131及至少两个特写摄像头132和133等。The image display system shown in FIG. 1B is disposed in the conference room 100 and may include an image display device 110, a memory (not shown), a processor (not shown), and a display screen 111 provided on the image display device 110. A panoramic camera 131 and at least two close-up cameras 132 and 133 in a three-dimensional space outside the image display device 110.
本发明实施例的技术内容可以参阅前述实施例,在此不再赘述,不同之处在于,全景摄像头131及至少两个特写摄像头132和133等,设置于图像显示设备110外的三维空间。For the technical content of the embodiments of the present invention, reference may be made to the foregoing embodiments, and details are not described herein again. The difference is that the panoramic camera 131 and at least two close-up cameras 132 and 133 are provided in a three-dimensional space outside the image display device 110.
相应的,全景摄像头131及至少两个特写摄像头132和133等,与图像显示设备110,在至少一坐标轴方向上的相对距离可能加大。Accordingly, the relative distance between the panoramic camera 131 and at least two close-up cameras 132 and 133 and the image display device 110 in the direction of at least one coordinate axis may be increased.
此外,参照前述实施例可知,如果发言者的视线落点在显示屏111 的中心或y方向分布的屏幕中线,各特写摄像头的摄像头与显示屏111在y方向分布的中线平行,发言者及各特写摄像头与显示屏11的相对角度为水平方向上(x-y平面内)的相对角度,图1A与图1B中所示的θ1、θ2、θ3与图1C中所示的θ1p、θ2p、θ3p相等。In addition, referring to the foregoing embodiments, if the speaker's line of sight falls on the center of the display screen 111 or the center line of the screen distributed in the y direction, the cameras of the close-up cameras are parallel to the center line of the display screen 111 in the y direction. The relative angle between the close-up camera and the display screen 11 is the relative angle in the horizontal direction (in the xy plane), and θ1, θ2, θ3 shown in FIG. 1A and FIG. 1B are equal to θ1p, θ2p, θ3p shown in FIG. 1C.
如果发言者的视线落点在显示屏111中心之上或之下,θ1p为θ1在水平面上的投影。如果各特写摄像头的光轴与显示屏111的垂直平面上,θ2p、θ3p分别为θ2、θ3在水平面上的投影,图1C为图1A所示显示系统,在水平面上的投影,涉及的技术内容参照图1A涉及的实施例,在此不再赘述。以下结合图2A与图2B介绍下,本发明实施例的图像显示系统,为实现特写的图像显示过程的处理逻辑。If the speaker's line of sight falls above or below the center of the display 111, θ1p is the projection of θ1 on the horizontal plane. If the optical axis of each close-up camera and the vertical plane of the display screen 111, θ2p and θ3p are the projections of θ2 and θ3 on the horizontal plane, and FIG. 1C is the projection of the display system shown in FIG. 1A on the horizontal plane. The technical content involved Referring to the embodiment related to FIG. 1A, details are not described herein again. The following describes the processing logic of the image display system according to the embodiment of the present invention with reference to FIG. 2A and FIG. 2B to implement a close-up image display process.
图2A所示的图像显示系统,可以包括图像显示设备210、全景摄像头221及至少两个特写摄像头222、223等。图像显示设备210可以包括通过内部总线连接的处理器211、显示屏212、内存213、非易失性存储器214和设备接口215。全景摄像头221及至少两个特写摄像头222、223等,通过设备接口215与处理器211连接。设备接口215的具体形式,可以与全景摄像头221及至少两个特写摄像头222、223等的接口匹配,例如USB接口。The image display system shown in FIG. 2A may include an image display device 210, a panoramic camera 221, at least two close-up cameras 222, 223, and the like. The image display device 210 may include a processor 211, a display screen 212, a memory 213, a non-volatile memory 214, and a device interface 215 connected through an internal bus. The panoramic camera 221 and at least two close-up cameras 222 and 223 are connected to the processor 211 through the device interface 215. The specific form of the device interface 215 may match the interfaces of the panoramic camera 221 and at least two close-up cameras 222 and 223, such as a USB interface.
此外,图像显示系统还可以包括用于确定发言者位置的定位设备,如声学定位设备。声学定位设备可以与图像显示设备210关联,独立于图像关联设备存在;也可以是安装在图像显示设备210上的麦克风阵列,通过内部总线与处理器211连接。In addition, the image display system may further include a positioning device, such as an acoustic positioning device, for determining the position of the speaker. The acoustic positioning device may be associated with the image display device 210 and exist independently of the image associated device; it may also be a microphone array installed on the image display device 210 and connected to the processor 211 through an internal bus.
本发明的设计人员,可以将实现特写的程序指令(处理逻辑213a对应的程序指令),存储在非易失性存储器214。实际图像显示过程中,处理器211将程序指令读取到内存213进行运行,并作为响应,执行如处理逻辑213a所示的操作:获取全景摄像头221拍摄的图像为第一图像;确定发言者的位置;从两个以上的特写摄像头222、223等中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像;在显示 屏212显示所述第一图像和所述第二图像。The designer of the present invention can store the program instructions (program instructions corresponding to the processing logic 213 a) that realize the close-up in the non-volatile memory 214. During the actual image display process, the processor 211 reads the program instructions into the memory 213 for operation, and in response performs the operations shown in the processing logic 213a: acquiring the image captured by the panoramic camera 221 as the first image; determining the speaker's Position; from two or more close-up cameras 222, 223, etc., select a close-up camera corresponding to the position to capture the image of the speaker as the second image; display the first image and the display on the display screen 212 Second image.
上述处理逻辑213a的实现,可以由图像显示系统中各设备之间的交互来实现,具体的交互过程可以参阅图2B。The implementation of the processing logic 213a may be implemented by interactions between devices in the image display system. For a specific interaction process, refer to FIG. 2B.
请参阅图2B,在实际的应用场景中,全景摄像头221及两个以上的特写摄像头222、223等,按照设定的频率执行步骤S201拍摄各自的拍摄区域,并将拍摄的图像通过设备接口215发送至处理器211(步骤S202),其中,为了便于处理器211区分不同的图像,发送图像时,全景摄像头221及两个以上的特写摄像头222、223等,可以将各自的身份标识,或者各自与显示屏的相对位置,随图像一起发送至处理器211。Please refer to FIG. 2B. In an actual application scenario, the panoramic camera 221 and two or more close-up cameras 222, 223, etc. execute step S201 to capture respective shooting areas according to a set frequency, and pass the captured images through the device interface 215. Sending to the processor 211 (step S202), in order to facilitate the processor 211 to distinguish different images, when transmitting the images, the panoramic camera 221 and two or more close-up cameras 222, 223, etc. may be identified by their respective identities, or The relative position to the display screen is sent to the processor 211 along with the image.
处理211可以从中选取出全景摄像头221拍摄的全景图像为第一图像(步骤S203),如果在发送第一图像至显示屏前,目标场景中的发言者发言,定位设备230可以确定出发言者的位置(本步骤S204),并发送至处理器211(步骤S205),处理器211然后根据所述位置从两个以上的特写摄像头222、223等发送的图像中,选取与所述位置对应的特写摄像头发送的图像为第二图像(步骤S206),发送第一图像和第二图像至显示屏212(步骤S207),显示屏212显示第一图像和第二图像(S208)。某例子中,在图1A所示的会议场景下,显示屏212显示的第一图像和第二图像如图2C所示。第二图像可以叠加在第一图像上,也可以悬浮在第一图像上,具体显示方式可以由相关人员根据实际的现实需求设定。The process 211 may select the panoramic image captured by the panoramic camera 221 as the first image (step S203). If the speaker in the target scene speaks before the first image is sent to the display screen, the positioning device 230 may determine the speaker ’s Position (this step S204), and send it to the processor 211 (step S205), and the processor 211 then selects the close-up corresponding to the position from the images sent by the two or more close-up cameras 222, 223, etc. according to the position The image sent by the camera is a second image (step S206), and the first image and the second image are sent to the display screen 212 (step S207), and the display screen 212 displays the first image and the second image (S208). In an example, in a conference scene shown in FIG. 1A, the first image and the second image displayed on the display screen 212 are shown in FIG. 2C. The second image may be superimposed on the first image, or may be suspended on the first image, and a specific display manner may be set by a relevant person according to actual actual needs.
如果在发送第一图像至显示屏前,目标场景中没有发言者发言,处理器21发送第一图像值显示屏212,显示屏212显示第一图像。If no speaker speaks in the target scene before sending the first image to the display screen, the processor 21 sends a first image value display screen 212, and the display screen 212 displays the first image.
此外,某些应用场景下,图1A至图2B涉及的图像显示设备为智能交互平板时,智能交互平板可以是集成了投影机、电子白板、幕布、音响、电视机以及视频会议终端等一种或多种功能的一体化设备。In addition, in some application scenarios, when the image display device involved in FIG. 1A to FIG. 2B is a smart interactive tablet, the smart interactive tablet may be an integrated projector, electronic whiteboard, curtain, audio, television, and video conference terminal. Or an integrated device with multiple functions.
智能交互平板还可以与至少一个外部装置建立数据连接。其中,外部装置包括但不限于:智能手机、USB闪存盘、手提电脑、台式电脑、平板电脑、个人数字助理(PDA)等。The smart interactive tablet may also establish a data connection with at least one external device. The external devices include, but are not limited to, smart phones, USB flash drives, laptop computers, desktop computers, tablet computers, personal digital assistants (PDAs), and the like.
外部装置与智能交互平板的数据连接的通信方式,包括但不限于USB连接、互联网、局域网、蓝牙、Wi-Fi或紫峰协议(ZigBee)等通信方式,本发明实施例中不作限定。The communication methods of the data connection between the external device and the smart interactive tablet include, but are not limited to, communication methods such as USB connection, Internet, local area network, Bluetooth, Wi-Fi, or ZigBee, which are not limited in the embodiments of the present invention.
进一步的,智能交互平板与至少一个外部装置发生数据交互时,向交互智能平板发送投屏数据,以使智能交互平板对投屏数据的投屏内容进行显示,外部装置作为投屏客户端,一般而言,投屏客户端可以有一个或者多个,视具体的应用场景来设置,本发明实施例不做限定。Further, when a data interaction occurs between the intelligent interactive tablet and at least one external device, the projection data is sent to the interactive intelligent tablet, so that the intelligent interactive tablet displays the content of the projection data of the projection data. The external device serves as the projection client, and generally In particular, there may be one or more projection screen clients, which are set according to specific application scenarios, which are not limited in the embodiment of the present invention.
以下结合附图,详述下本发明实施例的图像显示方法:The image display method according to the embodiment of the present invention is described in detail below with reference to the accompanying drawings:
请参阅图3,图3是本发明一示例性实施例示出的图像显示方法的流程图,该实施例可以应用于图像显示系统,用于对发言者进行特写,包括以下步骤S301-S304:Please refer to FIG. 3. FIG. 3 is a flowchart of an image display method according to an exemplary embodiment of the present invention. This embodiment can be applied to an image display system for close-up of a speaker, and includes the following steps S301-S304:
步骤S301、获取全景摄像头拍摄的图像为第一图像。Step S301: Acquire an image captured by a panoramic camera as a first image.
步骤S302、确定发言者的位置。Step S302: Determine the position of the speaker.
步骤S303、从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像。Step S303: Select a close-up camera corresponding to the position from the two or more close-up cameras to capture an image of the speaker as a second image.
步骤S304、显示所述第一图像和所述第二图像。Step S304: Display the first image and the second image.
本发明实施例涉及的技术内容,可以参阅前述实施例,在此不再赘述。本发明实施例的方法可以由图像显示系统实现。For technical content related to the embodiments of the present invention, reference may be made to the foregoing embodiments, and details are not described herein again. The method of the embodiment of the present invention may be implemented by an image display system.
如果图像显示系统包括如图1A至图2B所述的单个图像显示设备时,本发明实施例的方法可以应用在单个图像显示设备上,由其执行。If the image display system includes a single image display device as described in FIGS. 1A to 2B, the method of the embodiment of the present invention may be applied to and executed by a single image display device.
如果图像显示系统包括多个图像显示设备时,如图1B所示会议室的三面靠墙位置,均设置有图像显示设备,每个图像显示设备周围均设置有与其关联的全景摄像头及至少两个特写摄像头。本发明实施例可以由每个图像显示设备或各图像显示设备的总控设备执行,通过定位设备确定出发言者朝向哪个图像显示设备,然后以该图像显示设备为参照,确定出发言者位置,进而从各图像显示设备关联的特写摄像头中,选取对应特写摄像头。If the image display system includes multiple image display devices, as shown in FIG. 1B, the three sides of the conference room against the wall are provided with image display devices, and each image display device is provided with a panoramic camera associated with it and at least two Close-up camera. The embodiment of the present invention may be executed by each image display device or a general control device of each image display device. The positioning device determines which image display device the speaker is facing, and then uses the image display device as a reference to determine the speaker position. Then, from the close-up cameras associated with each image display device, a corresponding close-up camera is selected.
实际应用中,本发明实施例,在确定好发言者的位置后,选取特写 摄像头拍摄的图像的具体方式,可以由确定的位置的具体形,式或预先所做的准备工作决定,以下列举几种情况:In practical applications, after determining the position of the speaker, the specific manner of selecting the image captured by the close-up camera in the embodiment of the present invention can be determined by the specific shape, formula or preparatory work of the determined position. Situation:
情况一:发言者面对图像显示设备发言,用于定位发言者位置的为声学定位设备。声学定位设备可以包括垂直排列的麦克风和水平排列的麦克风,通过两个麦克风采集到音频信号的时间差,结合麦克风的空间位置,定位出发言者与声学定位设备的相对位置参数,这里提到的相对位置参数如上所述,在此不再赘述。Case 1: The speaker speaks to the image display device, and the acoustic positioning device is used to locate the speaker. The acoustic positioning device may include a vertically arranged microphone and a horizontally arranged microphone. The time difference between the audio signals collected by the two microphones is combined with the spatial position of the microphone to locate the relative position parameters of the speaker and the acoustic positioning device. The position parameters are as described above, and are not repeated here.
本发明实施例所确定的位置,为发言者与声学定位设备的相对位置参数。在根据发言者位置选择特写摄像头时,可以根据各特写摄像头的位置及声学定位设备的位置,得到各特写摄像头与声学定位设备的相对位置参数,进而根据发言者的确定位置,及各特写摄像头与声学定位设备的相对位置参数,计算出各特写摄像头与发言者的相对位置,再参照计算出的相对位置参数选择特写摄像头。如需要正面特写时,选择相对角度最小的,需要较清晰较大的特写时,选择相对距离小的。The position determined in the embodiment of the present invention is a relative position parameter between the speaker and the acoustic positioning device. When selecting a close-up camera according to the position of the speaker, the relative position parameters of each close-up camera and the acoustic positioning device can be obtained according to the position of each close-up camera and the position of the acoustic positioning device, and then the position determined by the speaker, and the close-up cameras and The relative position parameters of the acoustic positioning device calculate the relative position of each close-up camera and the speaker, and then select the close-up camera with reference to the calculated relative position parameters. For frontal close-ups, choose the smallest relative angle. For clearer and larger close-ups, choose a smaller distance.
需要说明的是,相对位置参数中的相对距离可以指发言者所在位置与特写摄像头所在位置的直线距离,如图1A所示发言者与特写摄像头114间的虚线,或某个方向的投影距离;相对位置参数中的相对角度为发言者指向特写摄像头的向量与发言者的正面朝向的夹角,或者该夹角在某个方向或某个平面的投影。It should be noted that the relative distance in the relative position parameter may refer to a straight line distance between the position of the speaker and the position of the close-up camera, as shown in FIG. 1A, a dashed line between the speaker and the close-up camera 114, or a projection distance in a certain direction; The relative angle in the relative position parameter is the angle between the vector of the speaker pointing to the close-up camera and the front direction of the speaker, or the projection of the angle in a certain direction or a certain plane.
例如,发言者视线落点为图像显示设备的显示屏中心时,相对角度为由发言者指向特写摄像头的向量与由发言者指向显示屏中心的向量的夹角,如图1A所示的θ4,或者夹角在某个方向或某个平面的投影。这里提到的平面可以是水平面。For example, when the point of sight of the speaker is the center of the display screen of the image display device, the relative angle is the angle between the vector from the speaker to the close-up camera and the vector from the speaker to the center of the display, as shown in θ4 in FIG. 1A. Or the projection of the included angle in a certain direction or a plane. The plane mentioned here may be a horizontal plane.
情况二:声学定位设备,参照图预定坐标系,预存自身的位置坐标,定位出发言者与声学定位设备的相对位置后,可以进而定位出发言者在预定坐标系中的位置坐标。Case 2: The acoustic positioning device refers to the predetermined coordinate system in the figure, pre-stores its own position coordinates, and locates the relative position of the speaker and the acoustic positioning device, and then can locate the speaker's position coordinates in the predetermined coordinate system.
在根据发言者位置选择特写摄像头时,如果发言者的视线落点在坐标 原点,本发明实施例,可以直接根据各摄像头在预定坐标系中的位置坐标与发言者的位置坐标,计算各特写摄像头与发言者的相对位置参数,再参照计算出的相对位置参数选择特写摄像头。When selecting a close-up camera according to the position of the speaker, if the speaker's line of sight falls at the origin of the coordinates, the embodiment of the present invention can directly calculate each close-up camera according to the position coordinates of each camera in a predetermined coordinate system and the position coordinates of the speaker. The relative position parameter with the speaker, and then select the close-up camera with reference to the calculated relative position parameter.
一例子中,为尽可能对发言者正面进行特写,所述位置包括预定坐标系中所述发言者的坐标时,本发明实施例可以通过以下操作,从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像:In an example, in order to make a close-up of the front of the speaker as far as possible, when the position includes the coordinates of the speaker in a predetermined coordinate system, the embodiment of the present invention may select the same from two or more close-up cameras through the following operations. A close-up camera corresponding to the position captures an image of the speaker:
获取本端显示屏中心的预定坐标及各特写摄像头的预定坐标;这里本端显示屏相对于远端显示屏而言。Obtain the predetermined coordinates of the center of the local display screen and the predetermined coordinates of each close-up camera; here, the local display screen is relative to the remote display screen.
根据获取的预定坐标及所述位置,计算本端显示屏中心与每个摄像头相对所述发言者的夹角,为该特写摄像头与所述发言者的相对角度。According to the obtained predetermined coordinates and the position, the angle between the center of the local display screen and each camera with respect to the speaker is calculated, which is the relative angle between the close-up camera and the speaker.
根据计算的相对角度,选择与所述发言者的相对角度最小的特写摄像头,为目标摄像头。According to the calculated relative angle, a close-up camera with the smallest relative angle to the speaker is selected as the target camera.
其中,预定坐标系可以是图1A所示三维坐标系,在此不再赘述。The predetermined coordinate system may be a three-dimensional coordinate system shown in FIG. 1A, and details are not described herein again.
另外,在图1A所示场景下,发言者处于图1A所示位置时,发言者与特写摄像头114的相对夹角小于,发言者与特写摄像头113的夹角。因此,选择特写摄像头114为目标摄像头。In addition, in the scene shown in FIG. 1A, when the speaker is in the position shown in FIG. 1A, the relative angle between the speaker and the close-up camera 114 is smaller than the angle between the speaker and the close-up camera 113. Therefore, the close-up camera 114 is selected as the target camera.
参见前述图像显示系统可知,某些场景下,在三维空间布置较多特写摄像头时,可能出现与所述发言者的相对角度最小的特写摄像头,有两个以上的情况,本发明实施例可以将选择的特写摄像头拍摄的图像均作为第二图像进行显示。但是,为了减少多余的第二图像对第一图像的遮挡,一例子中,可以进一步根据相对距离选择特写摄像头,具体可以通过以下操作实现:Referring to the foregoing image display system, it can be known that in some scenes, when there are more close-up cameras arranged in a three-dimensional space, a close-up camera with the smallest relative angle to the speaker may appear. In more than two cases, the embodiment of the present invention may use The images taken by the selected close-up camera are displayed as the second image. However, in order to reduce the obstruction of the first image by the excessive second image, in an example, a close-up camera can be further selected according to the relative distance, which can be implemented by the following operations:
根据选择的每个特写摄像头的预定坐标及所述位置,计算该特写摄像头与所述发言者的相对距离。The relative distance between the close-up camera and the speaker is calculated according to the predetermined coordinates of each selected close-up camera and the position.
根据计算的相对距离,从选择的目标摄像头中,选择与所述发言者的相对距离最小的特写摄像头,为目标摄像头。According to the calculated relative distance, from the selected target cameras, a close-up camera with the smallest relative distance from the speaker is selected as the target camera.
另一例子中,可以将相对角度投影到二维水平面,得到水平方向上的相对角度,进而选择特写摄像头,投影可以参见图1A与图1C,具体可以通过以下操作继续选择特写摄像头:In another example, the relative angle can be projected on a two-dimensional horizontal plane to obtain the relative angle in the horizontal direction, and then a close-up camera can be selected. For the projection, see FIG. 1A and FIG.
计算选择出的每个特写摄像头,与所述发言者在水平方向上的相对角度。Calculate the relative angle of each selected close-up camera with the speaker in the horizontal direction.
从选择出的特写摄像头中,选择与所述发言者在水平方向上的相对角度最小的特写摄像头,为目标摄像头。From the selected close-up cameras, a close-up camera having the smallest relative angle with the speaker in the horizontal direction is selected as the target camera.
其他实施例中,预定的三维坐标系的坐标原点不是如图1A所示显示屏111的中心,发言者的视线落点为显示屏111的中心。本发明实施例可以预先获取显示屏111中心的预定坐标(在三维坐标系中的坐标),然后基于显示屏111中心的预定坐标、发言者的位置坐标及各特写摄像头的位置坐标,计算各特写摄像头与发言者的相对位置,再参照计算出的相对位置选择特写摄像头。In other embodiments, the coordinate origin of the predetermined three-dimensional coordinate system is not the center of the display screen 111 as shown in FIG. 1A, and the point of sight of the speaker is the center of the display screen 111. In the embodiment of the present invention, predetermined coordinates (coordinates in a three-dimensional coordinate system) of the center of the display screen 111 can be obtained in advance, and then each close-up is calculated based on the predetermined coordinates of the center of the display screen 111, the position coordinates of the speaker, and the position coordinates of the close-up cameras. The relative position of the camera and the speaker, and then select the close-up camera with reference to the calculated relative position.
情况三:考虑到本发明实施例在实现特写的过程中,不需要人工导播调整各特写摄像头的安装位置及拍摄的角度等,本发明实施例为了进一步提高特写效率,可以预先确定好各特写摄像头与图像显示设备的显示屏的相对位置参数(如图1A中,特写摄像头与显示屏在y方向分布的中线的夹角),然后在确定出发言者的位置后,调取预定的相对位置参数,计算出各特写摄像头与发言者的相对位置,再参照计算出的相对位置选择特写摄像头。Case 3: Considering that in the embodiment of the present invention, in the process of implementing close-ups, manual guides are not required to adjust the installation positions and angles of the close-up cameras, etc. In order to further improve the close-up efficiency, the close-up cameras can be determined in advance. The relative position parameter with the display screen of the image display device (as shown in FIG. 1A, the angle between the close-up camera and the center line of the display screen in the y direction), and then after determining the position of the speaker, call the predetermined relative position parameter , Calculate the relative position of each close-up camera and the speaker, and then select the close-up camera with reference to the calculated relative position.
一例子中,所述位置包括所述发言者与本端显示屏的相对位置参数;本发明实施例可以通过以下操作,从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像:In an example, the position includes the relative position parameters of the speaker and the local display screen. In the embodiment of the present invention, a close-up camera corresponding to the position can be selected from two or more close-up cameras to perform shooting by the following operations: Image of the speaker:
调取各特写摄像头与本端显示屏的相对位置参数。Recall the relative position parameters of each close-up camera and the local display.
基于所述位置及调取的相对位置参数,计算所述发言者与各特写摄像头的相对位置参数。Based on the position and the retrieved relative position parameters, the relative position parameters of the speaker and each close-up camera are calculated.
根据计算出的相对位置参数,从两个以上的特写摄像头中,选取与 所述发言者的相对位置参数最小的特写摄像头。According to the calculated relative position parameters, from the two or more close-up cameras, a close-up camera with the smallest relative position parameter of the speaker is selected.
获取所选取的特写摄像头拍摄所述发言者得到的图像。An image obtained by the selected close-up camera to capture the speaker is acquired.
在其他实施例中,也可以根据各摄像头与本端显示屏之间的相对角度,以及不同场景区域中发言者与本端显示屏的相对角度,预估下各特写摄像头对不同场景区域的发言者的特写效果,然后将不同的摄像头对应发言者与本端显示屏的不同相对角度范围,在后期确定出发言者位置或发言者与本端显示屏的相对角度后,根据该对应关系选取特写摄像头。In other embodiments, based on the relative angle between each camera and the local display, and the relative angle between the speaker and the local display in different scene areas, it is possible to estimate the speech of each close-up camera in different scene areas. The close-up effect of the speaker, and then different cameras correspond to different relative angle ranges of the speaker and the local display. After determining the speaker position or the relative angle of the speaker and the local display in the later stage, select the close-up according to the corresponding relationship. camera.
例如图1A所示场景,特写摄像头114与本端显示屏111的相对角度为165度,特写摄像头113与本端显示屏111的相对角度为15度,发言者与本端显示屏111的相对角度为0至90度时,对应特写摄像头114,发言者与本端显示屏111的相对角度为90度至180度时,对应特写摄像头113。For example, in the scenario shown in FIG. 1A, the relative angle between the close-up camera 114 and the local display 111 is 165 degrees, the relative angle between the close-up camera 113 and the local display 111 is 15 degrees, and the relative angle between the speaker and the local display 111 When it is 0 to 90 degrees, it corresponds to the close-up camera 114, and when the relative angle between the speaker and the local display 111 is 90 degrees to 180 degrees, it corresponds to the close-up camera 113.
情况四:考虑到不同特写摄像头可以专用于对不同的预定场景区域中的发言者进行特写,本发明实施例为了进一步提高特写效率,可以预先确定现实场景中各场景位置与各特写摄像头的对应关系,其中,每一场景位置对应的特写摄像头,专用于与该场景位置上发言者进行特写。如:该场景位置对应的特写摄像头,与该场景位置上发言者的相对位置参数,小于其他特写摄像头。Case 4: Considering that different close-up cameras can be dedicated to close-up of speakers in different predetermined scene areas, in order to further improve the close-up efficiency, the corresponding relationship between each scene position in the real scene and each close-up camera can be determined in advance. Among them, a close-up camera corresponding to each scene position is dedicated to close-up with a speaker at the scene position. For example, the relative position parameter of the close-up camera corresponding to the scene position is smaller than that of other close-up cameras at the scene position.
在确定出发言者的位置后,根据预定的对应关系选取与所述位置对应的特写摄像头。一例子中,可以通过以下操作从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像:After the position of the speaker is determined, a close-up camera corresponding to the position is selected according to a predetermined correspondence relationship. In an example, from the two or more close-up cameras, a close-up camera corresponding to the position may be selected to take an image of the speaker by performing the following operations:
获取现实场景中各场景位置与各特写摄像头间的预定对应关系。Obtain a predetermined correspondence between each scene position in a real scene and each close-up camera.
根据所述预定对应关系,从两个以上的特写摄像头中,选择与所述位置对应的特写摄像头为目标摄像头。According to the predetermined correspondence relationship, from the two or more close-up cameras, a close-up camera corresponding to the position is selected as a target camera.
获取所述目标摄像头拍摄所述发言者得到的图像。Acquire an image obtained by the target camera taking the speaker.
由上述实施例,得到第一图像和第二图像后,直接显示第一图像和第二图像,即可在显示全景图像的同时,实现对发言者的特写,无需人 工导播对特写摄像头进行摇移、俯仰、推拉等一系列额外操作,即可以快速实现对发言者的特写,且相较于需要人工导播对摄像头进行一些列额外操作的相关技术,能有效提高特写实现效率。According to the above embodiment, after the first image and the second image are obtained, the first image and the second image are directly displayed, and the close-up of the speaker can be realized while the panoramic image is displayed, and the close-up camera is not required to be shaken manually. A series of additional operations, such as tilting, pitching, pushing and pulling, can quickly achieve close-up of the speaker, and can effectively improve the efficiency of close-ups compared to related technologies that require manual guides to perform some additional operations on the camera.
例如图1A所示场景,特写摄像头114可以专用于特写:与本端显示屏111的相对角度为0至90度范围的发言者;特写摄像头113可以专用于特写:与本端显示屏111的相对角度为90度至180度范围的发言者。For example, in the scene shown in FIG. 1A, the close-up camera 114 may be dedicated to close-ups: speakers whose relative angle with the local display 111 is in the range of 0 to 90 degrees; the close-up camera 113 may be dedicated to close-ups: relative to the local display 111 Speakers with an angle ranging from 90 degrees to 180 degrees.
针对以上情况,某些实施例中,也可以预先判断下发言者是否在各特写摄像头能拍摄到的场景区域(拍摄区域)内,从拍摄区域含有所述发言者的特写摄像头中,选取与发言者的位置对应的特写摄像头。In view of the above, in some embodiments, it can also be determined in advance whether the speaker is in a scene area (shooting area) that can be captured by each close-up camera. Close-up camera corresponding to the position of the person.
其他应用场景下,在得到第一图像和第二图像后,并非直接显示第一图像和第二图像,而是仅对发言者本身或其目标部位进行特写。需要从第二图像中提取出发言者或其目标部位的图像后,再显示第一图像和提取出的图像,具体实现过程可以参阅图4A,图4A所示方法可以包括步骤S401-S406:In other application scenarios, after the first image and the second image are obtained, the first image and the second image are not directly displayed, but only the speaker or its target part is close-up. The image of the speaker or its target part needs to be extracted from the second image, and then the first image and the extracted image are displayed. For a specific implementation process, refer to FIG. 4A. The method shown in FIG. 4A may include steps S401-S406:
步骤S401、获取全景摄像头拍摄的图像为第一图像。Step S401: Acquire an image captured by a panoramic camera as a first image.
步骤S402、确定发言者的位置。Step S402: Determine the position of the speaker.
步骤S403、从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像。Step S403: From the two or more close-up cameras, select a close-up camera corresponding to the position to capture an image of the speaker as the second image.
步骤S404、根据所述位置,映射出所述发言者在所述第二图像中的图像位置。Step S404: Map the image position of the speaker in the second image according to the position.
步骤S405、提取所述第二图像中被映射的图像位置处的图像数据,以获得所述发言者的特写图像。Step S405: Extract image data at a mapped image position in the second image to obtain a close-up image of the speaker.
步骤S406、显示所述特写图像和所述第一图像。Step S406: Display the close-up image and the first image.
本实施例的步骤S401、S402及S403,与前述实施例相应,在此不再赘述。Steps S401, S402, and S403 in this embodiment correspond to the foregoing embodiments, and details are not described herein again.
对于步骤S404,映射出发言者在第二图像中的图像位置,是为了先粗定出需要提取的图像数据所在位置,然后进行提取,减少直接提取时 匹配图像特征耗费的时间,提高提取发言者或发言者目标部位图像数据的效率。这里提到的目标部位可以是脸部或上半身。For step S404, the image position of the speaker in the second image is mapped in order to roughly determine the location of the image data to be extracted, and then perform the extraction to reduce the time required to match the image features during direct extraction and improve the speaker extraction. Or the efficiency of the image data of the speaker's target part. The target part mentioned here can be the face or upper body.
实际映射所述位置在第二图像中的位置时,可以根据摄像头的成像过程涉及的四个坐标系,实时映射出发言者在第二图像中的图像位置,这里提到的四个坐标系为世界坐标系、相机坐标系、图像坐标系和像素坐标系。When the position of the position in the second image is actually mapped, the image position of the speaker in the second image can be mapped in real time according to the four coordinate systems involved in the imaging process of the camera. The four coordinate systems mentioned here are World coordinate system, camera coordinate system, image coordinate system, and pixel coordinate system.
如果第二图像用图像坐标系描述图像内各点的坐标,可以根据世界坐标系到相机坐标系,相机坐标系到图像坐标系的映射关系,将所述位置映射到第二图像中的图像位置。If the second image uses the image coordinate system to describe the coordinates of points in the image, the position can be mapped to the image position in the second image according to the mapping relationship between the world coordinate system to the camera coordinate system and the camera coordinate system to the image coordinate system. .
如果第二图像用像素坐标系描述图像内像素点的坐标,可以根据世界坐标系到相机坐标系,相机坐标系到图像坐标系,图像坐标系到像素坐标系的映射关系,将所述位置映射到第二图像中的图像位置。If the second image uses a pixel coordinate system to describe the coordinates of pixel points in the image, the positions can be mapped according to the mapping relationship between the world coordinate system to the camera coordinate system, the camera coordinate system to the image coordinate system, and the image coordinate system to the pixel coordinate system. To the image position in the second image.
此外,考虑到全景图像中包括目标场景中大部分场景位置的被拍对象的图像,本发明实施例,可以预先确定出全景图像的各图像区域与现实场景中各场景区域的对应关系。如参阅图1A所示会议场景,本发明实施例可以得到会议室内各区域与全景图像中各图像间的对应关系,具体如4B所示。In addition, considering that the panoramic image includes images of the subject being photographed at most scene positions in the target scene, in the embodiment of the present invention, the correspondence between each image region of the panoramic image and each scene region in the real scene can be determined in advance. Referring to the conference scene shown in FIG. 1A, the embodiment of the present invention can obtain the correspondence between each area in the conference room and each image in the panoramic image, as shown in FIG. 4B.
图4B中,会议室100中不同场景区域的被拍对象与显示屏111的y方向分布的屏幕中线的夹角,在全景图像中对应不同的图像区域,其中,全景图像被分割为6个图像区域,自左至右,对应的场景区域为0度至30度,30度至60度,60度至90度,90度至120度,120度至150度,150度至180度。在发言者与显示屏111的y方向分布的屏幕中线的夹角θ1为65度时,映射出的图像区域为60度至90度对应的图像区域(图4B中的阴影区)。In FIG. 4B, the included angle between the subject in different scene areas in the conference room 100 and the center line of the screen distributed in the y-direction of the display 111 corresponds to different image areas in the panoramic image, where the panoramic image is divided into 6 images Area, from left to right, corresponding to the scene area is 0 degrees to 30 degrees, 30 degrees to 60 degrees, 60 degrees to 90 degrees, 90 degrees to 120 degrees, 120 degrees to 150 degrees, 150 degrees to 180 degrees. When the angle θ1 between the speaker and the center line of the screen distributed in the y-direction of the display screen 111 is 65 degrees, the mapped image area is an image area corresponding to 60 degrees to 90 degrees (the shaded area in FIG. 4B).
在映射出全景图像中的图像区域后,通过特征匹配,可以得出所述位置在第二图像中的区域,进而可以得到所述位置在第二图像中的图像位置,具体可以通过以下操作,映射出所述发言者在所述第二图像中的 图像位置:After mapping the image area in the panoramic image, the area in the second image can be obtained through feature matching, and then the image position in the second image can be obtained. Specifically, the following operations can be performed. Map the image position of the speaker in the second image:
获取全景图像的各图像区域与现实场景中各场景区域的对应关系。A correspondence relationship between each image region of the panoramic image and each scene region in a real scene is obtained.
基于所述对应关系,定位所述位置在所述第一图像中所属的图像区域。Based on the correspondence, an image region to which the position belongs in the first image is located.
通过特征匹配,匹配出所述第二图像中特征与所定位的图像区域匹配的图像区域。Through feature matching, an image region whose features in the second image match the located image region is matched.
将所匹配出的图像区域的坐标,作为映射出的图像位置。Use the coordinates of the matched image area as the mapped image position.
其中,特征匹配时,主要是将所述第一图像中所属的图像区域内发言者的特征,与第二图像中的图像特征进行匹配。When the features are matched, the features of the speakers in the image area to which the first image belongs are matched with the features of the image in the second image.
在其他实施例中,本发明的设计人员,也可以预生成各特写摄像头拍摄的图像的各图像区域,在现实场景中的位置信息,该位置信息可以是各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度。所确定的发言者的位置为所述发言者与本端显示屏的相对角度。这里提到的相对角度为:所述发言者或场景区域内的被拍对象,与图1A所示显示屏111中沿y方向上分布的屏幕中线的夹角。In other embodiments, the designer of the present invention may also pre-generate the position information of each image area of the image captured by each close-up camera in a real scene, and the position information may be a scene corresponding to each image area in the real scene. Area, the relative angle to the local display. The determined position of the speaker is the relative angle between the speaker and the local display screen. The relative angle mentioned here is the included angle between the speaker or the subject in the scene area and the center line of the screen distributed along the y direction in the display screen 111 shown in FIG. 1A.
进而,可以得到发言者在第二图像中的图像位置。一例子中,可以通过以下操作,映射出所述发言者在所述第二图像中的图像位置:Furthermore, the image position of the speaker in the second image can be obtained. In an example, the following operations can be used to map the image position of the speaker in the second image:
调取各特写摄像头的各图像区域,在现实场景中的位置信息。Recall the position information of each image area of each close-up camera in the real scene.
匹配调取的位置信息与所述位置。Match the retrieved location information with the location.
根据匹配结果,获得所述发言者在所述第二图像中所属的图像区域。According to the matching result, an image area to which the speaker belongs in the second image is obtained.
将所得的图像区域的坐标,作为映射出的图像位置。The coordinates of the obtained image area are used as the mapped image position.
本例子中,所述位置为所述发言者与本端显示屏的相对角度;所述位置信息为各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度时,预生成所述位置信息的步骤可以包括:In this example, the position is the relative angle between the speaker and the local display; the position information is the scene area corresponding to each image area in the real scene. When the relative angle with the local display is pre-generated, The step of the location information may include:
根据每个特写摄像头的镜头视角,标定该特写摄像头的所述场景区域与该特写摄像头的相对角度。According to the lens angle of each close-up camera, the relative angle between the scene area of the close-up camera and the close-up camera is calibrated.
根据标定出的相对角度,及该特写摄像头与本端显示屏的相对位置 参数,计算该特写摄像头的所述场景区域与本端显示屏的相对角度。According to the calibrated relative angle and the relative position parameters of the close-up camera and the local display, calculate the relative angle between the scene area of the close-up camera and the local display.
例如:特写摄像头水平拍摄角度参数(如160度),对特写摄像头拍摄图像进行标定,一张照片中,最左、中间或最右侧的图像区域,相对摄像头的角度为10度,90度,160度。以此为基础,参考摄像头相对图1A所示显示屏中心的距离及相对显示屏y方向分布的屏幕中线的角度,可以计算出图像中某一位置与显示屏y方向分布的屏幕中线的相对角度。For example, the horizontal camera angle parameter (such as 160 degrees) of the close-up camera is used to calibrate the image taken by the close-up camera. The leftmost, middle, or rightmost image area in a photo is 10 degrees and 90 degrees relative to the camera. 160 degrees. Based on this, referring to the distance of the camera from the center of the display screen shown in FIG. 1A and the angle of the screen centerline distributed in the y direction of the display screen, the relative angle of a position in the image and the screen centerline distributed in the y direction of the display screen can be calculated .
映射出图像位置后,如果需要特写的发言者的目标部位,可以在提取所述第二图像中映射出的位置处的图像数据后,从提取的图像数据中,识别并裁剪出所述发言者的目标部位的图像数据;将裁剪出的图像数据确定为所述特写图像。After mapping the image position, if a close-up of the speaker's target part is required, after extracting the image data at the mapped position in the second image, the speaker can be identified and cropped from the extracted image data Image data of the target part; determining the cropped image data as the close-up image.
通过上述实施例,可以快速从多个特写摄像头中,选定与发言者的位置对应的特写摄像头,对发言者拍摄,然后显示全景摄像头拍摄的图像与提取的图像数据,显示全景图像的同时,实现对发言者或发言者目标部位的特写,无需人工导播对特写摄像头进行摇移、俯仰、推拉等一系列额外操作,即可以实现对发言者的特写,且相较于需要人工导播对摄像头进行一些列额外操作的相关技术,能有效提高特写实现效率。According to the foregoing embodiment, a close-up camera corresponding to the position of the speaker can be selected from a plurality of close-up cameras, the speaker is shot, and then the image captured by the panoramic camera and the extracted image data are displayed, and at the same time the panoramic image is displayed, Realize close-up of the speaker or the target part of the speaker, without a series of additional operations such as panning, tilting, pushing and pulling of the close-up camera without manual guidance, it can achieve close-up of the speaker, compared with the need to manually guide the camera. Some related techniques for additional operations can effectively improve the efficiency of close-ups.
某些应用场景下,发言者可能随时间变化移动自身的位置,其他应用场景下,也可能由不位置的对象交替作为发言者进行发言,这些应用场景下,为了准确度对发言者进行特写,需要在实现特写后,重新确定发言者的位置,然后再次选择特写摄像头,实现对新位置的发言者的特写,具体可以参见图5,图5所示方法可以包括步骤In some application scenarios, the speaker may move his position with time. In other application scenarios, non-positioned objects may alternately speak as speakers. In these application scenarios, to close the speaker for accuracy, After the close-up is implemented, the position of the speaker needs to be re-determined, and then the close-up camera is selected again to achieve close-up of the speaker at the new position. For details, refer to FIG. 5. The method shown in FIG. 5 may include steps.
步骤S501、获取全景摄像头拍摄的图像为第一图像。Step S501: Acquire an image captured by a panoramic camera as a first image.
步骤S502、确定发言者的位置。Step S502: Determine the position of the speaker.
步骤S503、从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像。Step S503: Select a close-up camera corresponding to the position from the two or more close-up cameras to capture the image of the speaker as the second image.
步骤S504、显示所述第一图像和所述第二图像。Step S504: Display the first image and the second image.
步骤S505、重新确定所述发言者的位置。Step S505: Re-determine the position of the speaker.
步骤S506、根据重新确定的位置与上一次确定的位置,判断所述发言者的位置变化量是否小于预定的变化量;如果小于,则执行步骤S504。如果不小于,则执行步骤S507。Step S506: Determine whether the position change amount of the speaker is less than a predetermined change amount according to the re-determined position and the last determined position; if it is less, perform step S504. If it is not smaller, step S507 is performed.
步骤S507、从两个以上的特写摄像头中,选取与重新确定的位置对应的特写摄像头拍摄所述发言者所得的图像,为第三图像;Step S507: From the two or more close-up cameras, select an image obtained by the close-up camera corresponding to the re-determined position to capture the speaker, which is a third image;
步骤S508、显示所述第一图像和所述第三图像。Step S508: Display the first image and the third image.
本实施例的技术内容与前述实施例相应,在此不再赘述。The technical content of this embodiment corresponds to the foregoing embodiment, and details are not described herein again.
其中,可以由本发明的设计人员,预定重新确定所述发言者的位置的条件,如周期性,定时,用户触发等。Among them, the designer of the present invention may predetermine conditions for re-determining the position of the speaker, such as periodicity, timing, and user triggering.
为了在发言者位置变化较小时,多次执行选择特写摄像头的操作,选定的特写摄像头与之前的摄像头相同的情况,本发明实施例可以预定一个变化量,该变化量由实际的应用场景,及该应用场景的目标场所确定。如会议场景下,可以为两个与会者的间隔。In order to perform the operation of selecting a close-up camera multiple times when the position of the speaker changes little, and the selected close-up camera is the same as the previous camera, the embodiment of the present invention may predetermine a change amount, which is determined by the actual application scenario. And the target place of the application scenario is determined. For example, in a conference scenario, the interval between two participants can be set.
此外,部分场景区域内,无论更换发言者还是发言者位置变化,选定的特写摄像头均相同,因此,该情况下无需频繁执行选定特写摄像头的操作,直接以之前选定的特写摄像头拍摄的图像为第二图像,或从第二图像中提取图像数据即可。In addition, in some scene areas, the selected close-up camera is the same regardless of whether the speaker is changed or the position of the speaker changes. Therefore, in this case, it is not necessary to frequently perform the operation of selecting the close-up camera, and the previously taken close-up camera is directly taken. The image may be a second image, or image data may be extracted from the second image.
一例子中,可以在从两个以上的特写摄像头中,选取与重新确定的位置对应的特写摄像头拍摄所述发言者所得的图像前,判断重新确定的位置与上一次确定的位置是否均在预定的位置范围内;如果是,则执行显示所述第一图像和所述第二图像的操作;如果否,则执行从两个以上的特写摄像头中,选取与重新确定的位置对应的特写摄像头对所述发言者进行拍摄,以获得第三图像的操作。In an example, before selecting a close-up camera corresponding to the re-determined position from two or more close-up cameras to capture an image obtained by the speaker, it may be determined whether the re-determined position and the last determined position are within a predetermined range. Within the range of the position; if yes, perform the operation of displaying the first image and the second image; if not, perform the selection of a close-up camera pair corresponding to the re-determined position from more than two close-up cameras The speaker performs an operation of shooting to obtain a third image.
如在图1A所示会议场景,所述位置为发言者与显示屏111沿y方向分布的屏幕中线的相对角度,可以判断重新确定的位置与上一次确定的位置是否均在75度至105度之间,如是执行显示所述第一图像和所述 第二图像的操作。As shown in the meeting scene shown in FIG. 1A, the position is the relative angle between the speaker and the center line of the screen distributed along the y-direction of the display screen 111. It can be determined whether the newly determined position and the last determined position are both 75 degrees to 105 degrees In between, if the operation of displaying the first image and the second image is performed.
与前述方法的实施例相对应,本发明还提供了装置的实施例。Corresponding to the embodiment of the foregoing method, the present invention also provides an embodiment of the device.
参见图6,图6是本发明一示例性实施例示出的图像显示装置的框图,该装置可以应用于前述实施例中的图像显示系统,可以包括:第一图像获取模块610、发言位置确定模块620、第二图像获取模块630和图像显示模块640。Referring to FIG. 6, FIG. 6 is a block diagram of an image display device according to an exemplary embodiment of the present invention. The device may be applied to the image display system in the foregoing embodiment, and may include a first image acquisition module 610 and a speaking position determination module. 620. A second image acquisition module 630 and an image display module 640.
其中,第一图像获取模块610,用于获取全景摄像头拍摄的图像为第一图像。The first image acquisition module 610 is configured to acquire an image captured by a panoramic camera as a first image.
发言位置确定模块620,用于确定发言者的位置。The speaking position determining module 620 is configured to determine the position of the speaker.
第二图像获取模块630,用于从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像。The second image acquisition module 630 is configured to select, from the two or more close-up cameras, a close-up camera corresponding to the position to capture an image of the speaker as a second image.
图像显示模块640,用于显示所述第一图像和所述第二图像。The image display module 640 is configured to display the first image and the second image.
一例子中,所述位置包括所述发言者与本端显示屏的相对位置参数;第二图像获取模块630可以包括:In an example, the position includes a relative position parameter of the speaker and a local display screen; the second image acquisition module 630 may include:
位置参数调取模块,用于调取各特写摄像头与本端显示屏的相对位置参数;Position parameter acquisition module, used to retrieve the relative position parameters of each close-up camera and the local display;
相对位置计算模块,用于基于所述位置及调取的相对位置参数,计算所述发言者与各特写摄像头的相对位置参数;A relative position calculation module, configured to calculate the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters;
摄像头选取模块,用于根据计算出的相对位置参数,从两个以上的特写摄像头中,选取与所述发言者的相对位置参数最小的特写摄像头;A camera selection module for selecting a close-up camera with the smallest relative position parameter of the speaker from the two or more close-up cameras according to the calculated relative position parameters;
第一获取子模块,用于获取所选取的特写摄像头拍摄所述发言者得到的图像。The first acquisition submodule is configured to acquire an image obtained by the selected close-up camera and capturing the speaker.
另一例子中,第二图像获取模块630可以包括:In another example, the second image acquisition module 630 may include:
预定关系获取模块,用于获取现实场景中各场景位置与各特写摄像头间的预定对应关系;每一场景位置对应的特写摄像头,专用于对该场景位置上的发言者进行特写;A predetermined relationship acquisition module, configured to obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is specifically used to close-up the speaker at the scene position;
目标摄像头选取模块,用于根据所述预定对应关系,从两个以上的 特写摄像头中,选择与所述位置对应的特写摄像头为目标摄像头;A target camera selection module, configured to select a close-up camera corresponding to the position as a target camera from the two or more close-up cameras according to the predetermined corresponding relationship;
第二获取子模块,用于获取所述目标摄像头拍摄所述发言者得到的图像。A second acquisition submodule is configured to acquire an image obtained by the target camera shooting the speaker.
作为例子,所述位置为所述发言者与本端显示屏的相对角度,所述相对位置参数为相对角度。As an example, the position is a relative angle between the speaker and the local display screen, and the relative position parameter is a relative angle.
另一例子中,图像显示模块64可以包括:In another example, the image display module 64 may include:
图像位置映射模块,用于根据所述位置,映射出所述发言者在所述第二图像中的图像位置;An image position mapping module, configured to map an image position of the speaker in the second image according to the position;
特写图像提取模块,用于提取所述第二图像中被映射的图像位置处的图像数据,以获得所述发言者的特写图像;A close-up image extraction module, configured to extract image data at a mapped image position in the second image to obtain a close-up image of the speaker;
图像显示子模块,用于显示所述特写图像和所述第一图像。An image display submodule, configured to display the close-up image and the first image.
作为例子,本发明实施例的图像显示装置还可以包括目标提取模块,用于:As an example, the image display device according to the embodiment of the present invention may further include a target extraction module, configured to:
从提取的图像数据中,识别并裁剪出所述发言者的目标部位的图像数据;Identifying and cropping image data of a target part of the speaker from the extracted image data;
将裁剪出的图像数据确定为所述特写图像。The cropped image data is determined as the close-up image.
作为例子,所述图像位置映射模块被配置为:As an example, the image position mapping module is configured to:
获取全景图像的各图像区域与现实场景中各场景区域的对应关系;Obtaining the correspondence between each image area of the panoramic image and each scene area in the real scene;
基于所述对应关系,定位所述位置在所述第一图像中所属的图像区域;Positioning an image region to which the position belongs in the first image based on the correspondence relationship;
通过特征匹配,匹配出所述第二图像中特征与所定位的图像区域匹配的图像区域;Matching the image regions whose features in the second image match the localized image regions through feature matching;
将所匹配出的图像区域的坐标,作为映射出的图像位置。Use the coordinates of the matched image area as the mapped image position.
作为例子,所述图像位置映射模块被配置为:As an example, the image position mapping module is configured to:
调取各特写摄像头的各图像区域,在现实场景中的位置信息;Retrieve the position information of each image area of each close-up camera in the real scene;
匹配调取的位置信息与所述位置;Matching the retrieved location information with the location;
根据匹配结果,获得所述发言者在所述第二图像中所属的图像区域;Obtaining an image area to which the speaker belongs in the second image according to the matching result;
将所得的图像区域的坐标,作为映射出的图像位置。The coordinates of the obtained image area are used as the mapped image position.
作为例子,所述位置为所述发言者与本端显示屏的相对角度;As an example, the position is the relative angle between the speaker and the local display screen;
所述位置信息为各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度。The position information is a scene area corresponding to each image area in a real scene, and a relative angle with the local display screen.
作为例子,预生成所述位置信息的模块被配置为:As an example, a module that pre-generates the location information is configured to:
根据每个特写摄像头的镜头视角,标定该特写摄像头的所述场景区域与该特写摄像头的相对角度;Calibrating the relative angle between the scene area of the close-up camera and the close-up camera according to the lens angle of each close-up camera;
根据标定出的相对角度,及该特写摄像头与本端显示屏的相对位置参数,计算该特写摄像头的所述场景区域与本端显示屏的相对角度。According to the calibrated relative angle and the relative position parameters of the close-up camera and the local display, calculate the relative angle between the scene area of the close-up camera and the local display.
其他实施例中,从硬件层面而言,如图7所示,为本发明图像显示设备的一种硬件结构图,除了图7所示的处理器、内存、网络接口、以及非易失性存储器之外,实施例中装置所在的图像显示设备通常根据该设备的实际功能,还可以包括其他硬件,对此不再赘述。其中,内存和非易失性存储器是设备可读的存储器,图像显示设备的存储器可以存储处理器可执行的程序指令;处理器可以耦合存储器,用于读取所述存储介质存储的程序指令,并作为响应,执行如上所述图像显示方法中的操作。In other embodiments, in terms of hardware, as shown in FIG. 7, it is a hardware structure diagram of the image display device of the present invention, except for the processor, memory, network interface, and non-volatile memory shown in FIG. 7. In addition, the image display device in which the device is located in the embodiment may generally include other hardware according to the actual function of the device, and details are not described herein again. The memory and the non-volatile memory are device-readable memory, and the memory of the image display device may store program instructions executable by the processor; the processor may be coupled to the memory and used to read the program instructions stored in the storage medium, And in response, the operations in the image display method described above are performed.
在其他实施例中,处理器所执行的操作可以参考以上方法的实施例中相关的描述,在此不予赘述。In other embodiments, for operations performed by the processor, reference may be made to related descriptions in the foregoing method embodiments, and details are not described herein.
此外,本发明实施例还提供一种机器可读存储介质(发送端设备/外围设备/接收端设备的存储器),所述可读存储介质中存储有程序指令,所述程序指令包括以上所述方法的各步骤对应的指令。当由一个或多个处理器执行时,使得图像显示设备执行以上对应方法中的操作。In addition, an embodiment of the present invention also provides a machine-readable storage medium (storage device / peripheral device / receiver device memory), where the readable storage medium stores program instructions, and the program instructions include the foregoing Instructions for each step of the method. When executed by one or more processors, the image display device is caused to perform the operations in the above corresponding method.
本发明实施例可采用在一个或多个其中包含有程序代码的可读存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。计算机可用可读存储介质包括永久性和非永久性、可移动和非可移动媒体,可以由任何方法或技术来实现信息存储。 信息可以是计算机可读指令、数据结构、程序的模块或其他数据。机器可读存储介质的例子包括但不限于:相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。Embodiments of the present invention may take the form of a computer program product implemented on one or more readable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing program code. Computer-readable storage media includes permanent and non-permanent, removable and non-removable media, and information storage can be accomplished by any method or technology. Information may be computer-readable instructions, data structures, modules of a program, or other data. Examples of machine-readable storage media include, but are not limited to: phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only Memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technologies, read-only disc read-only memory (CD-ROM), digital versatile disc (DVD), or other optical storage , Magnetic tape cartridges, magnetic tape storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明保护的范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the present invention. Within the scope of protection.

Claims (48)

  1. 一种图像显示方法,其特征在于,包括以下步骤:An image display method includes the following steps:
    获取全景摄像头拍摄的图像为第一图像;Acquiring an image captured by a panoramic camera as a first image;
    确定发言者的位置;Determine the position of the speaker;
    从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像;Selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker as a second image;
    显示所述第一图像和所述第二图像。The first image and the second image are displayed.
  2. 根据权利要求1所述的方法,其特征在于,所述确定发言者的位置,包括:The method according to claim 1, wherein the determining the position of the speaker comprises:
    获取关联的声学定位设备对所述发言者的定位结果;Acquiring a positioning result of the associated acoustic positioning device on the speaker;
    根据所述定位结果确定所述发言者的位置。Determining the position of the speaker according to the positioning result.
  3. 根据权利要求1所述的方法,其特征在于,所述位置包括所述发言者与本端显示屏的相对位置参数;从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,包括:The method according to claim 1, wherein the position includes a relative position parameter of the speaker and a local display screen; and from among two or more close-up cameras, a close-up camera corresponding to the position is selected for shooting The speaker's image includes:
    调取各特写摄像头与本端显示屏的相对位置参数;Recall the relative position parameters of each close-up camera and the local display;
    基于所述位置及调取的相对位置参数,计算所述发言者与各特写摄像头的相对位置参数;Calculating the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters;
    根据计算出的相对位置参数,从两个以上的特写摄像头中,选取与所述发言者的相对位置参数最小的特写摄像头;Selecting a close-up camera with a minimum relative position parameter to the speaker from the two or more close-up cameras according to the calculated relative position parameters;
    获取所选取的特写摄像头拍摄所述发言者得到的图像。An image obtained by the selected close-up camera to capture the speaker is acquired.
  4. 根据权利要求1所述的方法,其特征在于,从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,包括:The method according to claim 1, wherein selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker comprises:
    获取现实场景中各场景位置与各特写摄像头间的预定对应关系;每一场景位置对应的特写摄像头,专用于对该场景位置上的发言者进行特写;Obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is dedicated to close-up the speaker at the scene position;
    根据所述预定对应关系,从两个以上的特写摄像头中,选择与所述位置对应的特写摄像头为目标摄像头;Selecting the close-up camera corresponding to the position as the target camera from the two or more close-up cameras according to the predetermined correspondence relationship;
    获取所述目标摄像头拍摄所述发言者得到的图像。Acquire an image obtained by the target camera taking the speaker.
  5. 根据权利要求3或4所述的方法,其特征在于,所述相对位置参数包括相对角度和/或相对距离。The method according to claim 3 or 4, wherein the relative position parameter comprises a relative angle and / or a relative distance.
  6. 根据权利要求5所述的方法,其特征在于,所述位置为所述发言者与本端显示屏的相对角度,所述相对位置参数为相对角度。The method according to claim 5, wherein the position is a relative angle between the speaker and a local display screen, and the relative position parameter is a relative angle.
  7. 根据权利要求6所述的方法,其特征在于,所述相对角度为水平方向上的相对角度。The method according to claim 6, wherein the relative angle is a relative angle in a horizontal direction.
  8. 根据权利要求1所述的方法,其特征在于,所述位置包括预定坐标系中所述发言者的坐标;从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,包括:The method according to claim 1, wherein the position includes coordinates of the speaker in a predetermined coordinate system; and from two or more close-up cameras, a close-up camera corresponding to the position is selected to capture the speech. Images of people, including:
    获取本端显示屏中心的预定坐标及各特写摄像头的预定坐标;Obtain the predetermined coordinates of the center of the local display screen and the predetermined coordinates of each close-up camera;
    根据获取的预定坐标及所述位置,计算本端显示屏中心与每个摄像头相对所述发言者的夹角,为该特写摄像头与所述发言者的相对角度;Calculating the included angle between the center of the local display screen and each camera with respect to the speaker according to the obtained predetermined coordinates and the position, which is the relative angle between the close-up camera and the speaker;
    根据计算的相对角度,选择与所述发言者的相对角度最小的特写摄像头,为目标摄像头。According to the calculated relative angle, a close-up camera with the smallest relative angle to the speaker is selected as the target camera.
  9. 根据权利要求8所述的方法,其特征在于,如果选择的特写摄像头的数目在两个以上,所述方法还包括:The method according to claim 8, wherein if the number of close-up cameras selected is more than two, the method further comprises:
    根据选择的每个特写摄像头的预定坐标及所述位置,计算该特写摄像头与所述发言者的相对距离;Calculating the relative distance between the close-up camera and the speaker according to the predetermined coordinates of the selected close-up camera and the position;
    根据计算的相对距离,从选择的目标摄像头中,选择与所述发言者的相对距离最小的特写摄像头,为目标摄像头。According to the calculated relative distance, from the selected target cameras, a close-up camera with the smallest relative distance from the speaker is selected as the target camera.
  10. 根据权利要求8所述的方法,其特征在于,如果选择的特写摄像头的数目在两个以上,所述方法还包括:The method according to claim 8, wherein if the number of close-up cameras selected is more than two, the method further comprises:
    计算选择出的每个特写摄像头,与所述发言者在水平方向上的相对角度;Calculating the relative angle between each selected close-up camera and the speaker in the horizontal direction;
    从选择出的特写摄像头中,选择与所述发言者在水平方向上的相对角度最小的特写摄像头,为目标摄像头。From the selected close-up cameras, a close-up camera having the smallest relative angle with the speaker in the horizontal direction is selected as the target camera.
  11. 根据权利要求1所述的方法,其特征在于,所述方法还包括以下步骤:The method according to claim 1, further comprising the following steps:
    重新确定所述发言者的位置;Redefining the position of the speaker;
    根据重新确定的位置与上一次确定的位置,判断所述发言者的位置变化量是否小于预定的变化量;Judging whether the position change amount of the speaker is less than a predetermined change amount according to the newly determined position and the last determined position;
    如果小于,则执行显示所述第一图像和所述第二图像的步骤;If it is less than, performing the step of displaying the first image and the second image;
    如果不小于,从两个以上的特写摄像头中,选取与重新确定的位置对应的特写摄像头拍摄所述发言者所得的图像,为第三图像;If it is not less than, select an image obtained by the close-up camera corresponding to the re-determined position from the two or more close-up cameras to capture the speaker as a third image;
    显示所述第一图像和所述第三图像。The first image and the third image are displayed.
  12. 根据权利要求1所述的方法,其特征在于,显示所述第一图像和所述第二图像,包括:The method according to claim 1, wherein displaying the first image and the second image comprises:
    根据所述位置,映射出所述发言者在所述第二图像中的图像位置;Map the image position of the speaker in the second image according to the position;
    提取所述第二图像中被映射的图像位置处的图像数据,以获得所述发言者的特写图像;Extracting image data at a mapped image position in the second image to obtain a close-up image of the speaker;
    显示所述特写图像和所述第一图像。The close-up image and the first image are displayed.
  13. 根据权利要求12所述的方法,其特征在于,提取所述第二图像中映射出的位置处的图像数据后,所述方法还包括:The method according to claim 12, wherein after extracting image data at a position mapped in the second image, the method further comprises:
    从提取的图像数据中,识别并裁剪出所述发言者的目标部位的图像数据;Identifying and cropping image data of a target part of the speaker from the extracted image data;
    将裁剪出的图像数据确定为所述特写图像。The cropped image data is determined as the close-up image.
  14. 根据权利要求12所述的方法,其特征在于,根据所述位置,映射出所述发言者在所述第二图像中的图像位置,包括:The method according to claim 12, wherein mapping the image position of the speaker in the second image according to the position comprises:
    获取全景图像的各图像区域与现实场景中各场景区域的对应关系;Obtaining the correspondence between each image area of the panoramic image and each scene area in the real scene;
    基于所述对应关系,定位所述位置在所述第一图像中所属的图像区域;Positioning an image region to which the position belongs in the first image based on the correspondence relationship;
    通过特征匹配,匹配出所述第二图像中特征与所定位的图像区域匹配的图像区域;Matching the image regions whose features in the second image match the localized image regions through feature matching;
    将所匹配出的图像区域的坐标,作为映射出的图像位置。Use the coordinates of the matched image area as the mapped image position.
  15. 根据权利要求12所述的方法,其特征在于,根据所述位置,映射出所述发言者在所述第二图像中的图像位置,包括:The method according to claim 12, wherein mapping the image position of the speaker in the second image according to the position comprises:
    调取各特写摄像头的各图像区域,在现实场景中的位置信息;Retrieve the position information of each image area of each close-up camera in the real scene;
    匹配调取的位置信息与所述位置;Matching the retrieved location information with the location;
    根据匹配结果,获得所述发言者在所述第二图像中所属的图像区域;Obtaining an image area to which the speaker belongs in the second image according to the matching result;
    将所得的图像区域的坐标,作为映射出的图像位置。The coordinates of the obtained image area are used as the mapped image position.
  16. 根据权利要求15所述的方法,其特征在于,所述位置为所述发言者与本端显示屏的相对角度;The method according to claim 15, wherein the position is a relative angle between the speaker and a local display screen;
    所述位置信息为各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度。The position information is a scene area corresponding to each image area in a real scene, and a relative angle with the local display screen.
  17. 根据权利要求16所述的方法,其特征在于,预生成所述位置信息的步骤包括:The method according to claim 16, wherein the step of pre-generating the location information comprises:
    根据每个特写摄像头的镜头视角,标定该特写摄像头的所述场景区域与该特写摄像头的相对角度;Calibrating the relative angle between the scene area of the close-up camera and the close-up camera according to the lens angle of each close-up camera;
    根据标定出的相对角度,及该特写摄像头与本端显示屏的相对位置参数,计算该特写摄像头的所述场景区域与本端显示屏的相对角度。According to the calibrated relative angle and the relative position parameters of the close-up camera and the local display, calculate the relative angle between the scene area of the close-up camera and the local display.
  18. 一种图像显示系统,其特征在于,包括图像显示设备、全景摄像头及两个以上的特写摄像头,所述图像显示设备包括:An image display system includes an image display device, a panoramic camera, and two or more close-up cameras. The image display device includes:
    显示屏:Display:
    处理器;processor;
    存储处理器可执行指令的存储器;Memory storing processor-executable instructions;
    其中,所述处理器耦合于所述存储器,用于读取所述存储器存储的程序指令,并作为响应,执行如权利要求1至17中任一项所述方法中的操作。The processor is coupled to the memory, and is configured to read program instructions stored in the memory and, in response, perform the operations in the method according to any one of claims 1 to 17.
  19. 根据权利要求18所述的系统,其特征在于,所述全景摄像头及各特写摄像头,安装在所述图像显示设备上。The system according to claim 18, wherein the panoramic camera and each close-up camera are installed on the image display device.
  20. 根据权利要求18所述的系统,其特征在于,各特写摄像头与所述显示屏的相对角度不同。The system according to claim 18, wherein the relative angle between each close-up camera and the display screen is different.
  21. 根据权利要求20所述的系统,其特征在于,所述全景摄像头安装在所述图像显示设备上侧的边框处,第一特写摄像头安装在所述图像显示设备左侧的边框处,第二特写摄像头安装在所述图像显示设备右侧的边框处。The system according to claim 20, wherein the panoramic camera is installed at a frame on the upper side of the image display device, a first close-up camera is installed at a frame on the left side of the image display device, and a second close-up camera is installed The camera is installed at a frame on the right side of the image display device.
  22. 根据权利要求21所述的系统,其特征在于,第一特写摄像头与所述显示屏的相对角度在10度至50度之间;The system according to claim 21, wherein the relative angle between the first close-up camera and the display screen is between 10 degrees and 50 degrees;
    第二特写摄像头与所述显示屏的相对角度在130度至170度之间。The relative angle between the second close-up camera and the display screen is between 130 degrees and 170 degrees.
  23. 根据权利要求18所述的系统,其特征在于,所述图像显示设备还包括声学定位设备,用于对所述发言者进行定位。The system according to claim 18, wherein the image display device further comprises an acoustic positioning device for positioning the speaker.
  24. 根据权利要求23所述的系统,其特征在于,所述声学定位设备包括麦克风阵列。The system of claim 23, wherein the acoustic positioning device comprises a microphone array.
  25. 根据权利要求18至23中任一项所述的系统,其特征在于,所述图像显示设备为会议交互设备。The system according to any one of claims 18 to 23, wherein the image display device is a conference interactive device.
  26. 根据权利要求25所述的系统,其特征在于,所述会议交互设备为智能交互平板。The system according to claim 25, wherein the conference interactive device is a smart interactive tablet.
  27. 一种图像显示设备,其特征在于,包括:An image display device, comprising:
    显示屏:Display:
    处理器;processor;
    存储处理器可执行指令的存储器;Memory storing processor-executable instructions;
    其中,所述处理器耦合于所述存储器,用于读取所述存储器存储的程序指令,并作为响应,执行如权利要求1至17中任一项所述方法中的操作。The processor is coupled to the memory, and is configured to read the program instructions stored in the memory and, in response, perform the operations in the method according to any one of claims 1 to 17.
  28. 根据权利要求27所述的设备,其特征在于,与全景摄像头及至少两个特写摄像头关联。The device according to claim 27, wherein the device is associated with a panoramic camera and at least two close-up cameras.
  29. 根据权利要求28所述的设备,其特征在于,各特写摄像头与所述显示屏的相对角度不同。The device according to claim 28, wherein a relative angle between each close-up camera and the display screen is different.
  30. 根据权利要求29所述的设备,其特征在于,一特写摄像头与所述显示屏的相对角度在10度至50度之间;另一特写摄像头与所述显示屏的相对角度在130度至170度之间。The device according to claim 29, wherein a relative angle between a close-up camera and the display screen is between 10 degrees and 50 degrees; a relative angle between the other close-up camera and the display screen is 130 degrees to 170 Degrees between.
  31. 根据权利要求27所述的设备,其特征在于,还包括声学定位设备,用于对所述发言者进行定位。The device according to claim 27, further comprising an acoustic positioning device for positioning the speaker.
  32. 根据权利要求31所述的设备,其特征在于,所述声学定位设备包括麦克风阵列。The device of claim 31, wherein the acoustic positioning device comprises a microphone array.
  33. 根据权利要求27至32中任一项所述的设备,其特征在于,为会议交互设备。The device according to any one of claims 27 to 32, wherein the device is a conference interaction device.
  34. 根据权利要求33所述的设备,其特征在于,所述会议交互设备为智能交互平板。The device according to claim 33, wherein the conference interactive device is a smart interactive tablet.
  35. 一个或多个机器可读存储介质,其特征在于,其上存储有指令,当由一个或多个处理器执行时,执行如权利要求1至17中任一项所述方法中的操作。One or more machine-readable storage media, characterized in that instructions are stored thereon, and when executed by one or more processors, perform the operations in the method according to any one of claims 1 to 17.
  36. 一种图像显示装置,其特征在于,包括:An image display device, comprising:
    第一图像获取模块,用于获取全景摄像头拍摄的图像为第一图像;A first image acquisition module, configured to acquire an image captured by a panoramic camera as a first image;
    发言位置确定模块,用于确定发言者的位置;A speech position determining module, configured to determine the position of a speaker;
    第二图像获取模块,用于从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像;A second image acquisition module, configured to select, from two or more close-up cameras, a close-up camera corresponding to the position to capture an image of the speaker as a second image;
    图像显示模块,用于显示所述第一图像和所述第二图像。An image display module is configured to display the first image and the second image.
  37. 根据权利要求36所述的装置,其特征在于,所述位置包括所述发言者与本端显示屏的相对位置参数;所述第二图像获取模块包括:The device according to claim 36, wherein the position includes a relative position parameter of the speaker and a local display screen; and the second image acquisition module includes:
    位置参数调取模块,用于调取各特写摄像头与本端显示屏的相对位置参数;Position parameter acquisition module, used to retrieve the relative position parameters of each close-up camera and the local display;
    相对位置计算模块,用于基于所述位置及调取的相对位置参数,计算所述发言者与各特写摄像头的相对位置参数;A relative position calculation module, configured to calculate the relative position parameters of the speaker and each close-up camera based on the position and the retrieved relative position parameters;
    摄像头选取模块,用于根据计算出的相对位置参数,从两个以上的特写摄像头中,选取与所述发言者的相对位置参数最小的特写摄像头;A camera selection module for selecting a close-up camera with the smallest relative position parameter of the speaker from the two or more close-up cameras according to the calculated relative position parameters;
    第一获取子模块,用于获取所选取的特写摄像头拍摄所述发言者得到的图像。The first acquisition submodule is configured to acquire an image obtained by the selected close-up camera and capturing the speaker.
  38. 根据权利要求36所述的装置,其特征在于,所述第二图像获取模块包括:The apparatus according to claim 36, wherein the second image acquisition module comprises:
    预定关系获取模块,用于获取现实场景中各场景位置与各特写摄像头间的预定对应关系;每一场景位置对应的特写摄像头,专用于对该场景位置上的发言者进行特写;A predetermined relationship acquisition module, configured to obtain a predetermined correspondence between each scene position and each close-up camera in a real scene; the close-up camera corresponding to each scene position is specifically used to close-up the speaker at the scene position;
    目标摄像头选取模块,用于根据所述预定对应关系,从两个以上的特写摄像头中,选择与所述位置对应的特写摄像头为目标摄像头;A target camera selection module, configured to select a close-up camera corresponding to the position as a target camera from the two or more close-up cameras according to the predetermined correspondence relationship;
    第二获取子模块,用于获取所述目标摄像头拍摄所述发言者得到的图像。A second acquisition submodule is configured to acquire an image obtained by the target camera shooting the speaker.
  39. 根据权利要求37或38所述的装置,其特征在于,所述位置为所述发言者与本端显示屏的相对角度,所述相对位置参数为相对角度。The device according to claim 37 or 38, wherein the position is a relative angle between the speaker and a local display screen, and the relative position parameter is a relative angle.
  40. 根据权利要求36所述的装置,其特征在于,所述图像显示模块包括:The apparatus according to claim 36, wherein the image display module comprises:
    图像位置映射模块,用于根据所述位置,映射出所述发言者在所述第二图像中的图像位置;An image position mapping module, configured to map an image position of the speaker in the second image according to the position;
    特写图像提取模块,用于提取所述第二图像中被映射的图像位置处的图像数据,以获得所述发言者的特写图像;A close-up image extraction module, configured to extract image data at a mapped image position in the second image to obtain a close-up image of the speaker;
    图像显示子模块,用于显示所述特写图像和所述第一图像。An image display submodule, configured to display the close-up image and the first image.
  41. 根据权利要求40所述的装置,其特征在于,所述装置还包括目标提取模块,用于:The apparatus according to claim 40, wherein the apparatus further comprises a target extraction module, configured to:
    从提取的图像数据中,识别并裁剪出所述发言者的目标部位的图像数据;Identifying and cropping image data of a target part of the speaker from the extracted image data;
    将裁剪出的图像数据确定为所述特写图像。The cropped image data is determined as the close-up image.
  42. 根据权利要求40所述的装置,其特征在于,所述图像位置映射模块被配置为:The apparatus according to claim 40, wherein the image position mapping module is configured to:
    获取全景图像的各图像区域与现实场景中各场景区域的对应关系;Obtaining the correspondence between each image area of the panoramic image and each scene area in the real scene;
    基于所述对应关系,定位所述位置在所述第一图像中所属的图像区域;Positioning an image region to which the position belongs in the first image based on the correspondence relationship;
    通过特征匹配,匹配出所述第二图像中特征与所定位的图像区域匹配的图像区域;Matching the image regions whose features in the second image match the localized image regions through feature matching;
    将所匹配出的图像区域的坐标,作为映射出的图像位置。Use the coordinates of the matched image area as the mapped image position.
  43. 根据权利要求40所述的装置,其特征在于,所述图像位置映射模块被配置为:The apparatus according to claim 40, wherein the image position mapping module is configured to:
    调取各特写摄像头的各图像区域,在现实场景中的位置信息;Retrieve the position information of each image area of each close-up camera in the real scene;
    匹配调取的位置信息与所述位置;Matching the retrieved location information with the location;
    根据匹配结果,获得所述发言者在所述第二图像中所属的图像区域;Obtaining an image area to which the speaker belongs in the second image according to the matching result;
    将所得的图像区域的坐标,作为映射出的图像位置。The coordinates of the obtained image area are used as the mapped image position.
  44. 根据权利要求43所述的装置,其特征在于,所述位置为所述发言者与本端显示屏的相对角度;The device according to claim 43, wherein the position is a relative angle between the speaker and a local display screen;
    所述位置信息为各图像区域在现实场景中对应的场景区域,与本端显示屏的相对角度。The position information is a scene area corresponding to each image area in a real scene, and a relative angle with the local display screen.
  45. 根据权利要求44所述的装置,其特征在于,预生成所述位置信息的模块被配置为:The apparatus according to claim 44, wherein the module for pre-generating the position information is configured to:
    根据每个特写摄像头的镜头视角,标定该特写摄像头的所述场景区域与该特写摄像头的相对角度;Calibrating the relative angle between the scene area of the close-up camera and the close-up camera according to the lens angle of each close-up camera;
    根据标定出的相对角度,及该特写摄像头与本端显示屏的相对位置参数,计算该特写摄像头的所述场景区域与本端显示屏的相对角度。According to the calibrated relative angle and the relative position parameters of the close-up camera and the local display, calculate the relative angle between the scene area of the close-up camera and the local display.
  46. 一种智能交互平板,其特征在于,包括全景摄像头、第一特写摄像头和第二特写摄像头,所述全景摄像头、所述第一特写摄像头和所述第二特写摄像头设置在智能交互平板的边框上,所述第一特写摄像头和所述第二特写摄像头的光轴倾斜于智能交互平板的显示平面。An intelligent interactive tablet is characterized in that it includes a panoramic camera, a first close-up camera, and a second close-up camera. The panoramic camera, the first close-up camera, and the second close-up camera are disposed on a frame of the intelligent interactive tablet. The optical axes of the first close-up camera and the second close-up camera are inclined to the display plane of the smart interactive tablet.
  47. 根据权利要求46所述的智能交互平板,其特征在于,所述智能交互平板还用于:The intelligent interactive tablet according to claim 46, wherein the intelligent interactive tablet is further configured to:
    获取全景摄像头拍摄的图像为第一图像;Acquiring an image captured by a panoramic camera as a first image;
    确定发言者的位置;Determine the position of the speaker;
    从两个以上的特写摄像头中,选取与所述位置对应的特写摄像头拍摄所述发言者的图像,为第二图像;Selecting a close-up camera corresponding to the position from two or more close-up cameras to capture an image of the speaker as a second image;
    显示所述第一图像和所述第二图像。The first image and the second image are displayed.
  48. 根据权利要求46或47所述的智能交互平板,其特征在于,所述第一特写摄像头的光轴与所述显示屏的相对角度在10度至50度之间;所述第二特写摄像头的光轴与所述显示屏的相对角度在130度至170度之间。The smart interactive tablet according to claim 46 or 47, wherein the relative angle between the optical axis of the first close-up camera and the display screen is between 10 degrees and 50 degrees; The relative angle between the optical axis and the display screen is between 130 degrees and 170 degrees.
PCT/CN2018/114074 2018-06-20 2018-11-06 Image display method, apparatus, system and device, and readable storage medium WO2019242207A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810639563.3 2018-06-20
CN201810639563.3A CN108900787B (en) 2018-06-20 2018-06-20 Image display method, device, system and equipment, readable storage medium

Publications (1)

Publication Number Publication Date
WO2019242207A1 true WO2019242207A1 (en) 2019-12-26

Family

ID=64345223

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/114074 WO2019242207A1 (en) 2018-06-20 2018-11-06 Image display method, apparatus, system and device, and readable storage medium

Country Status (2)

Country Link
CN (1) CN108900787B (en)
WO (1) WO2019242207A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115174949A (en) * 2022-06-30 2022-10-11 广州汇才创新科技有限公司 Projection-based remote live broadcast interaction method and system

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109743511B (en) * 2019-01-03 2021-04-20 苏州佳世达光电有限公司 Method and system for automatically adjusting display direction of playing picture
CN110072134B (en) * 2019-04-30 2021-09-28 深圳Tcl新技术有限公司 Horizontal and vertical screen switching method of frame television and frame television
CN110312100A (en) * 2019-06-06 2019-10-08 西安中易建科技有限公司 Security-protecting and monitoring method and device
CN113411487B (en) * 2020-03-17 2023-08-01 中国电信股份有限公司 Method, apparatus, system and computer readable storage medium for controlling device
CN111464746B (en) * 2020-04-20 2022-01-28 维沃移动通信(杭州)有限公司 Photographing method and electronic equipment
CN111586341A (en) * 2020-05-20 2020-08-25 深圳随锐云网科技有限公司 Shooting method and picture display method of video conference shooting device
CN111711845B (en) * 2020-06-29 2022-07-08 广州视源电子科技股份有限公司 Signal processing method, device, system and storage medium
TWI798867B (en) * 2021-06-27 2023-04-11 瑞昱半導體股份有限公司 Video processing method and associated system on chip
CN115396602A (en) * 2022-07-30 2022-11-25 珠海视熙科技有限公司 Scene shooting control method, device and system based on three-camera system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102256098A (en) * 2010-05-18 2011-11-23 宝利通公司 Videoconferencing endpoint having multiple voice-tracking cameras
CN102368816A (en) * 2011-12-01 2012-03-07 中科芯集成电路股份有限公司 Intelligent front end system of video conference
CN105657329A (en) * 2016-02-26 2016-06-08 苏州科达科技股份有限公司 Video conference system, processing device and video conference method
CN106254781A (en) * 2016-09-14 2016-12-21 江苏开放大学 A kind of full-automatic live all-in-one of recorded broadcast based on three photographic head
CN206251236U (en) * 2016-12-13 2017-06-13 广州视源电子科技股份有限公司 Camera system and mutual intelligent dull and stereotyped all-in-one
CN107277427A (en) * 2017-05-16 2017-10-20 广州视源电子科技股份有限公司 Method and device for automatically selecting camera picture and audio/video system
US20180098029A1 (en) * 2016-10-04 2018-04-05 Avaya Inc. Multi-mode video conferencing system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101072332A (en) * 2007-06-04 2007-11-14 深圳市融合视讯科技有限公司 Automatic mobile target tracking and shooting method
CN100531373C (en) * 2007-06-05 2009-08-19 西安理工大学 Video frequency motion target close-up trace monitoring method based on double-camera head linkage structure
CN104125433A (en) * 2014-07-30 2014-10-29 西安冉科信息技术有限公司 Moving object video surveillance method based on multi-PTZ (pan-tilt-zoom)-camera linkage structure
CN106060470B (en) * 2016-06-24 2022-12-23 邵文超 Video monitoring method and system
CN107734244B (en) * 2016-08-10 2019-07-05 深圳看到科技有限公司 Panorama movie playback method and playing device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102256098A (en) * 2010-05-18 2011-11-23 宝利通公司 Videoconferencing endpoint having multiple voice-tracking cameras
CN102368816A (en) * 2011-12-01 2012-03-07 中科芯集成电路股份有限公司 Intelligent front end system of video conference
CN105657329A (en) * 2016-02-26 2016-06-08 苏州科达科技股份有限公司 Video conference system, processing device and video conference method
CN106254781A (en) * 2016-09-14 2016-12-21 江苏开放大学 A kind of full-automatic live all-in-one of recorded broadcast based on three photographic head
US20180098029A1 (en) * 2016-10-04 2018-04-05 Avaya Inc. Multi-mode video conferencing system
CN206251236U (en) * 2016-12-13 2017-06-13 广州视源电子科技股份有限公司 Camera system and mutual intelligent dull and stereotyped all-in-one
CN107277427A (en) * 2017-05-16 2017-10-20 广州视源电子科技股份有限公司 Method and device for automatically selecting camera picture and audio/video system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115174949A (en) * 2022-06-30 2022-10-11 广州汇才创新科技有限公司 Projection-based remote live broadcast interaction method and system
CN115174949B (en) * 2022-06-30 2024-02-23 广州汇才创新科技有限公司 Remote live broadcast interaction method and system based on projection

Also Published As

Publication number Publication date
CN108900787A (en) 2018-11-27
CN108900787B (en) 2021-06-04

Similar Documents

Publication Publication Date Title
WO2019242207A1 (en) Image display method, apparatus, system and device, and readable storage medium
US10440322B2 (en) Automated configuration of behavior of a telepresence system based on spatial detection of telepresence components
WO2017215295A1 (en) Camera parameter adjusting method, robotic camera, and system
US8614735B2 (en) Video conferencing
US20150146078A1 (en) Shift camera focus based on speaker position
US8749607B2 (en) Face equalization in video conferencing
US20180027211A1 (en) Immersive telepresence anywhere
TW201901527A (en) Video conference and video conference management method
WO2015184724A1 (en) Seat-selection prompting method and device
US20150138314A1 (en) Generating Panoramic Images
US20130063538A1 (en) Method and apparatus for providing device angle image correction
US20170127020A1 (en) Communication system, communication device, and communication method
TW201801516A (en) Image capturing apparatus and photo composition method thereof
WO2019200720A1 (en) Image processing-based ambient light compensation method and apparatus, and electronic device
US9686467B2 (en) Panoramic video
US11062422B2 (en) Image processing apparatus, image communication system, image processing method, and recording medium
JP7074056B2 (en) Image processing equipment, image processing systems, and image processing methods, and programs
WO2017128750A1 (en) Image collection method and image collection device
SG187168A1 (en) Image processing apparatus, image processing method, and computer-readable recording medium
JP7424076B2 (en) Image processing device, image processing system, imaging device, image processing method and program
TW201713110A (en) Electronic device and method for taking photos
WO2009119288A1 (en) Communication system and communication program
JP2018033107A (en) Video distribution device and distribution method
WO2020000393A1 (en) Image processing method and apparatus, first electronic device, and image processing system
JP2013141231A (en) Video conference apparatus and control method of video conference apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18923359

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 25.03.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18923359

Country of ref document: EP

Kind code of ref document: A1