CN115550599A - Audio and video output method for presence meeting place, electronic equipment and storage medium - Google Patents

Audio and video output method for presence meeting place, electronic equipment and storage medium Download PDF

Info

Publication number
CN115550599A
CN115550599A CN202211178189.4A CN202211178189A CN115550599A CN 115550599 A CN115550599 A CN 115550599A CN 202211178189 A CN202211178189 A CN 202211178189A CN 115550599 A CN115550599 A CN 115550599A
Authority
CN
China
Prior art keywords
meeting place
display screen
meeting
speaking
image data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211178189.4A
Other languages
Chinese (zh)
Inventor
翟小刚
罗东礼
刁磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Keda Technology Co Ltd
Original Assignee
Suzhou Keda Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Keda Technology Co Ltd filed Critical Suzhou Keda Technology Co Ltd
Priority to CN202211178189.4A priority Critical patent/CN115550599A/en
Publication of CN115550599A publication Critical patent/CN115550599A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4852End-user interface for client configuration for modifying audio parameters, e.g. switching between mono and stereo
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4854End-user interface for client configuration for modifying image parameters, e.g. image brightness, contrast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4858End-user interface for client configuration for modifying screen layout parameters, e.g. fonts, size of the windows

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Information Transfer Between Computers (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention relates to the technical field of video conferences, in particular to an audio and video output method, electronic equipment and a storage medium of a network-presented meeting place, wherein the method comprises the steps of acquiring image data and audio data of a meeting place, and the meeting place type of the meeting place comprises the network-presented meeting place; determining a target display screen corresponding to the meeting place based on the corresponding relation between the meeting place and the display screen in the meeting place presented by the current network; displaying image data of the meeting place on a target display screen in a first size; when the participant conference room is a speaking conference room and the number of the participant conference rooms is larger than or equal to the number of the display screens, determining a preset display screen of the current network presentation conference room based on the conference room type of the speaking conference room, and displaying the image data of the speaking conference room on the preset display screen in a second size, wherein the second size is larger than the first size; and outputting the audio data of the speaking meeting place from an audio channel of a preset display screen. The method ensures that the current network is in a meeting place and can realize the effect of listening and distinguishing positions.

Description

Audio and video output method for presence meeting place, electronic equipment and storage medium
Technical Field
The invention relates to the technical field of video conferences, in particular to an audio and video output method of a web presence conference place, electronic equipment and a storage medium.
Background
The presence conference system integrates video communication, communication experience and conference control, and is a video conference system which is newly developed in recent years. The technology has the characteristics of multi-path audio and video, the size of a real person 1, listening and position distinguishing and the like, focuses on the face-to-face communication effect, can provide the remote communication experience which is similar to eye-to-eye communication, and is generally used for the high-simulation communication requirements, such as remote case handling, remote meetings and other high-end video conference application fields. To distinguish between different types of conference places, conference rooms using this technology, commonly referred to as web presentation conference places; a conventional conference room for one-way audio-video streams is called a single-screen conference room.
The main video image of the network presentation system is used for transmitting a real-time panoramic high-simulation video image of a conference room, and a double-flow image is additionally contained for transmitting auxiliary electronic images such as documents and the like. The sound of the network presentation system is output in the corresponding playing equipment along with the main video, so as to realize the functions of listening and distinguishing positions.
When a multi-party video conference is carried out, due to the fact that the number of people participating in the multi-party video conference is large, the far-end videos can only be displayed on one-to-one basis in the local display equipment, or the videos are displayed on a certain display equipment after being subjected to picture synthesis through the platform. If the mode is applied to a network presentation meeting place, the characteristics of part of real people 1.
Disclosure of Invention
In view of this, embodiments of the present invention provide an audio and video output method, an electronic device, and a storage medium for a web conference room, so as to solve the problem of audio and video output in the web conference room.
According to a first aspect, an embodiment of the present invention provides an audio and video output method for a web presence meeting place, including:
acquiring image data and audio data of a meeting place, wherein the meeting place type of the meeting place comprises a network presenting meeting place;
determining a target display screen corresponding to the meeting place based on the corresponding relation between the meeting place and the display screen in the meeting place of the current network;
displaying the image data of the meeting place on the target display screen in a first size;
when the conference places are speaking places and the number of the conference places is larger than or equal to that of the display screens, determining a preset display screen of the current network-presented conference place based on the conference place type of the speaking conference place, and displaying image data of the speaking conference place on the preset display screen in a second size, wherein the second size is larger than the first size;
and outputting the audio data of the speaking meeting place from the audio channel of the preset display screen.
According to the audio and video output method of the network presence meeting place, provided by the embodiment of the invention, for the image data of the meeting place, the image data is displayed on the target display screen corresponding to the meeting place in a first size; for the speaking places in the participating places, the image data of the speaking places are displayed on the target display screen in a first size and also displayed on the preset display screen in a second size, the image data of each participating place are sequentially displayed, and the real person 1 is displayed for the speaking places. And simultaneously, the audio data of the speaking meeting place corresponds to the image data, and the audio data and the image data are output from an audio channel of a preset display screen, so that the effect of listening and distinguishing positions in the meeting place in the current network is ensured.
In some embodiments, the determining, based on the conference room type of the conference room, a preset display screen of the current web presentation conference room, and displaying, on the preset display screen, image data of the conference room at a second size includes:
when the meeting place type of the speaking meeting place is a meeting place in a network form and the number of the meeting places is larger than that of the display screens, determining that the preset display screens are N display screens in the middle of the current meeting place in the network form, wherein N is the number of image channels in the speaking meeting place;
and respectively displaying the N paths of image data of the speaking meeting place on the middle N display screens in a second size.
According to the audio and video output method for the web conference site, provided by the embodiment of the invention, when the conference site type of the speaking conference site is the web conference site and the number of the conference sites is large, N paths of image data of the speaking conference site are correspondingly displayed in the middle N display screens respectively, so that the characteristics of a real person 1 of a participant, listening position recognition, eye contact communication and the like are not lost, and the communication efficiency is improved.
In some embodiments, the determining, based on the conference room type of the conference room, a preset display screen of the current web presentation conference room, and displaying, on the preset display screen, image data of the conference room at a second size includes:
when the number of the meeting places is equal to that of the display screens, determining the preset display screen as the corresponding target display screen;
acquiring image data of the speaking meeting place, wherein the image data of the speaking meeting place is a speaking access image of the speaking meeting place when the speaking meeting place is a network presenting meeting place, and the image data of the speaking meeting place is a meeting place image of the speaking meeting place when the speaking meeting place is a single-screen meeting place;
and displaying the image data of the speaking place on the target display screen in the second size, and displaying the latest speaking image data of other participating places on the target display screens corresponding to the other participating places in the second size.
According to the audio and video output method of the network-presented conference room, when the number of the participating conference rooms is equal to that of the display screens, the participating conference rooms correspond to the display screens one by one, and for the speaking conference room, the speaking image is displayed on the corresponding target display screen in the second size, so that switching of the speaking conference rooms is avoided, data processing amount is reduced, and real-time display of image data is guaranteed.
In some embodiments, the determining, based on the conference room type of the conference room, a preset display screen of the current web presentation conference room, and displaying, on the preset display screen, image data of the conference room at a second size includes:
when the meeting place type of the speaking meeting place is a single-screen meeting place and the number of the meeting places is larger than that of the display screens, determining that the preset display screen is a middle display screen of the current meeting place;
and displaying the conference site image of the speaking conference site on the middle display screen in a second size, and displaying preset image data on other display screens of the current network presentation conference site in the second size.
According to the audio and video output method for the network-presented conference room, when the speaking conference room is a single-screen conference room, the image data of the speaking conference room is displayed on the middle display screen, and the preset image data is displayed on other display screens, so that the real person 1 in the single-screen conference room is displayed.
In some embodiments, when the conference site corresponding to the current image displayed in the second size in the middle display screen is inconsistent with the conference site corresponding to the current image displayed in the second size in the middle display screen, the displaying the image data of the conference site in the second size on the middle display screen, and displaying the preset image data in the second size on other display screens of the current web presentation conference site includes:
storing a current image displayed in a second size in the intermediate display screen;
replacing the current image with image data of a second size of the floor;
and replacing the image data with the earliest speech in the other display screens by using the current image.
According to the audio and video output method for the network presence conference, for the single-screen conference, the current image of the middle display screen is replaced by the image data with the earliest speech, namely, the preset image data in other display screens are updated according to the current time of the speech distance, and the image data with the latest speech can be displayed in other display screens.
In some embodiments, the outputting the audio data of the speaking venue from the audio channel of the preset display screen includes:
outputting the audio data of the speaking meeting place from an audio channel of a preset display screen where the image data of the second size is located based on the corresponding relation between the audio data of the speaking meeting place and the image data of the second size;
and outputting the audio data of the other participating sites from the audio channels of the target display screens corresponding to the other participating sites.
The audio and video output method for the network-presentation conference room provided by the embodiment of the invention utilizes the corresponding relation between the audio data of the speaking conference room and the image data of the second size to output the audio data, thereby realizing the effect of listening and distinguishing positions.
In some embodiments, the method further comprises:
acquiring the number of display screens of the meeting places presented by the current network and the number of the meeting places;
and distributing all the meeting places based on the number of the display screens and the number of the meeting places, and determining the corresponding relation between the meeting places and the display screens of the meeting places on the current network.
According to the audio and video output method of the network-presented conference place, the conference-participating places are distributed based on the number of the display screens of the current network-presented conference place, namely, the conference-participating places are distributed by taking the conference-participating places as units instead of image data in the conference-participating places as units, so that the image data of the same conference-participating place can be distributed to the same display screen for display.
In some embodiments, when the number of meeting places is greater than the number of display screens, the allocating all the meeting places based on the number of display screens and the number of meeting places, and determining the correspondence between the meeting places and the display screens of the meeting places currently on the network includes:
acquiring the display quantity of images with a first size in the display screen;
distributing the meeting place of which the meeting place type is a network meeting place so that the image data of the first size of the network meeting place is displayed on the same display screen of the current network meeting place;
determining the display quantity of the remaining images with the first size in the display screen based on the distribution result and the display quantity of the images;
and distributing the meeting place with the meeting place type of the single-screen meeting place by using the display quantity of the residual images.
According to the audio and video output method of the network meeting place, when meeting place allocation is carried out, the network meeting place allocation is preferentially carried out, and the network meeting place is provided with N paths of image data and occupies more image display areas, so that the network meeting place is allocated firstly, then the single-screen meeting place is used for filling the residual image display quantity, and further the image data of the same meeting place is ensured to be displayed in the same display screen.
According to a second aspect, an embodiment of the present invention further provides an audio/video output device for a web conference venue, including:
the system comprises an acquisition module, a display module and a display module, wherein the acquisition module is used for acquiring image data and audio data of a meeting place, and the meeting place type of the meeting place comprises a network presenting meeting place;
the determining module is used for determining a target display screen corresponding to the meeting place based on the corresponding relation between the meeting place and the display screen in the meeting place presented by the current network;
the first display module is used for displaying the image data of the meeting place on the target display screen in a first size;
the second display module is used for determining a preset display screen of the current network-presented conference room based on the conference room type of the speaking conference room when the conference rooms are speaking conference rooms and the number of the conference rooms is greater than or equal to that of the display screens, and displaying the image data of the speaking conference room on the preset display screen in a second size, wherein the second size is greater than the first size;
and the audio output module is used for outputting the audio data of the speaking meeting place from the audio channel of the preset display screen.
According to a third aspect, an embodiment of the present invention provides an electronic device, including: the network meeting place audio and video output method comprises a memory and a processor, wherein the memory and the processor are in communication connection with each other, computer instructions are stored in the memory, and the processor executes the computer instructions so as to execute the first aspect or the audio and video output method of the network meeting place in any one implementation mode of the first aspect.
According to a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, which stores computer instructions for causing a computer to execute the method for outputting audio and video in a web conference site described in the first aspect or any one of the implementation manners of the first aspect.
It should be noted that, for corresponding beneficial effects of the audio/video output device, the electronic device, and the computer-readable storage medium in the network presence meeting place in the embodiment of the present invention, please refer to the description of the corresponding beneficial effects of the audio/video output method in the network presence meeting place, which is not described herein again.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a flowchart of an audio/video output method of a web presence site according to an embodiment of the present invention;
FIG. 2 is a schematic diagram illustrating display of image data of a first size and a second size according to an embodiment of the present invention;
fig. 3 is a flowchart of an audio/video output method of a web presence site according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of image data display of a web site according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of image data display of a web site according to an embodiment of the present invention;
FIG. 6 is a schematic diagram of image data display of a web site according to an embodiment of the present invention;
FIGS. 7 a-7 b are schematic diagrams of image data display of a web site according to an embodiment of the present invention;
FIG. 8 is a schematic diagram of image data display for a single-screen conference room according to an embodiment of the present invention;
fig. 9 is a block diagram of an audio/video output device of a web presentation site according to an embodiment of the present invention;
fig. 10 is a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.
The audio and video output method of the network presence meeting place provided by the embodiment of the invention can be used for a meeting platform or a meeting terminal in a video meeting, and is hereinafter referred to as electronic equipment. Before the video conference starts, the electronic equipment determines the display strategy of each meeting place according to the number of the meeting places and the number of display screens of each network meeting place. And in the video conference process, the electronic equipment controls the audio display of each conference site according to the corresponding display strategy.
When a multiparty conference, for example, the number of meeting places is greater than the number of display screens of a web site, and image data of the meeting places are displayed in the web site, the image data are divided into a first size and a second size for display. The image data of the conference site corresponding to the display screen is displayed in a first size, and the image data of the speaking site is displayed in a second size.
The number of the display screens in the network presentation place is set according to actual requirements, for example, 3 or more. In the following embodiments, 3 display screens are taken as an example for detailed description. For a presence site, the image data comprises 3 paths of main videos, namely 3 paths of image data.
The specific audio/video display manner will be described in detail below, and it should be noted that the audio/video output method below is an example of a network meeting place, that is, how the network meeting place performs audio/video output in a meeting place.
In accordance with an embodiment of the present invention, there is provided an embodiment of an audio/video output method for a web presentation venue, where the steps illustrated in the flowchart of the figure may be performed in a computer system, such as a set of computer executable instructions, and where a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than that illustrated herein.
In this embodiment, an audio and video output method for a presence meeting place is provided, which can be used for electronic devices, such as a meeting platform, and fig. 1 is a flowchart of an audio and video output method for a presence meeting place according to an embodiment of the present invention, and as shown in fig. 1, the flowchart includes the following steps:
and S11, acquiring image data and audio data of the meeting place.
Wherein the meeting place type of the meeting place comprises a network presenting meeting place.
The image data of the meeting place is the video of the meeting place, the audio data is the audio of the meeting place, and the image data and the audio data of each meeting place are respectively sent to the electronic equipment. The image data and the audio data have a corresponding relation so as to ensure the consistency between audios and videos.
For the electronic device, before the conference starts, each conference place is registered in the electronic device, and accordingly, the electronic device can know the conference place type of each conference place.
And S12, determining a target display screen corresponding to the meeting place based on the corresponding relation between the meeting place and the display screen in the meeting place presented by the current network.
As described above, before the video conference starts, the electronic device determines the display policy of each participating site. That is, for the network presence meeting place, the electronic device determines the corresponding relationship between each display screen in the network presence meeting place and the meeting place. For example, if the network presence meeting place includes 3 display screens and the meeting places include 9 meeting places, the electronic device may equally divide the meeting places, and allocate the 9 meeting places to the 3 display screens respectively; or, other modes may be adopted for distribution, and the like, and the distribution is not limited herein, and only image data of the same conference site is ensured to be displayed on the same display screen. For the single-screen conference place, the electronic equipment can determine that all the image data of all the conference places are displayed in the single-screen conference place; or to determine which of the participant's image data needs to be displayed, etc.
And when the electronic equipment stores the corresponding relation between the meeting places and the display screen in the meeting place of the current network, the electronic equipment adopts unique identification for representing each meeting place. That is, the electronic device queries the corresponding relationship by using the identifier of the meeting place, and thus, the target display screen corresponding to the meeting place can be determined.
For example, the current web presence meeting place includes 3 display screens, which are a left display screen, a middle display screen and a right display screen; the number of the conference places is 4, and the conference places are respectively conference place 1-conference place 4. Wherein, the corresponding relation between the meeting place and the display screen is as follows: conference places 1 and 2 correspond to the left display screen, conference place 3 corresponds to the middle display screen, and conference place 4 corresponds to the right display screen. The electronic equipment can determine the corresponding target display screen by using the identification of the meeting place.
And S13, displaying the image data of the meeting place on a target display screen in a first size.
For the target display screen, there is a maximum display amount of image data of the first size. When the electronic device determines the correspondence, it needs to determine the maximum display number. If the number of the participating conference places is too large, the electronic equipment determines which image data of the participating conference places need to be displayed according to actual requirements, and other participating conference places which do not need to be displayed are used as the onlooker conference places. For the other participating conference places, the image data and the audio data are not output.
There is a size relationship between the first dimension and the second dimension hereinafter, and for better representation of the size relationship, the first dimension may also be referred to as the small dimension and the second dimension may also be referred to as the large dimension. After the electronic equipment determines the target display screen corresponding to the meeting place, the image data of the meeting place is displayed on the target display screen in a first size.
Continuing with the example above, that is, participating sites 1, 2 correspond to the left display screen, participating site 3 corresponds to the center display screen, and participating site 4 corresponds to the right display screen. Then, the image data of meeting place 1 and meeting place 2 are displayed on the left display screen in the first size; the image data of the meeting place 3 is displayed on the middle display screen in a first size; the image data of the conference site 4 is displayed on the right display screen in a first size.
And S14, when the participant site is a speaking site and the number of the participant sites is greater than or equal to that of the display screens, determining a preset display screen of the current site based on the site type of the speaking site, and displaying the image data of the speaking site on the preset display screen in a second size.
Wherein the second size is greater than the first size.
For example, a single-screen conference room includes 1-path image data, and a presence conference room includes 3-path image data, because the number of paths of image data of conference rooms of different conference room types is different. Therefore, the presentation conference rooms of different conference types have different display modes of image data. And the electronic equipment determines a preset display screen in the display screen of the current network meeting place by using the meeting place type of the speaking meeting place. The preset display screen may be the same as the target display screen, or may be a plurality of display screens including the target display screen, and the like, and is specifically set according to actual requirements.
And after the preset display screen is determined, the electronic equipment displays the image data of the speaking meeting place on the preset display screen in a second size. The image data of the speaking place may be a meeting place image of the speaking place, image data of an image channel where the speaker is located, image data of the speaker, and the like. If the same display screen includes both the image data of the first size and the image data of the second size, the display mode may be as shown in fig. 2. For example, A in FIG. 2 L 、A M 、A R The net presents images of the first size of the left path, the middle path and the right path of the meeting place AData, similarly, B L 、B M 、B R Image data of a first size of the left, middle and right paths of the meeting place B, C L 、C M 、C R The method comprises the steps of displaying image data of a first size of a left path, a middle path and a right path of a meeting place C on a screen; x y Is image data of a second size of the speaking venue.
And S15, outputting the audio data of the speaking meeting place from an audio channel of a preset display screen.
For audio data, the electronic device may output only audio data of the speaking venue. Based on the method, the audio data of the speaking meeting place are output from the audio channel of the preset display screen, and the corresponding output of the image data and the audio data is realized. Or, the electronic device may also output the audio data of the other conference places, that is, the audio data of the other conference places are output from the audio channels of the target display screens corresponding to the other conference places. In order to prevent the audio data of the other participating sites from interfering with the audio data of the speaking site, the volume of the audio of the other participating sites may be set.
According to the audio and video output method of the network-based conference place, for the image data of the conference place, the image data is displayed on a target display screen corresponding to the conference place in a first size; for the speaking places in the participating places, the image data of the speaking places are displayed on the target display screen in a first size and also displayed on the preset display screen in a second size, the image data of each participating place are sequentially displayed, and the real person 1 is displayed for the speaking places. And simultaneously, the audio data of the speaking meeting place corresponds to the image data, and the audio data and the image data are output from an audio channel of a preset display screen, so that the effect of listening and distinguishing positions in the meeting place in the current network is ensured.
In this embodiment, an audio/video output method for a presence meeting place is provided, which can be used for electronic devices, such as a meeting platform, and fig. 3 is a flowchart of the audio/video output method for a presence meeting place according to an embodiment of the present invention, and as shown in fig. 3, the flowchart includes the following steps:
and S21, acquiring image data and audio data of the meeting place.
Wherein the meeting place type of the meeting place comprises a net presenting meeting place.
Please refer to S11 in fig. 1 for details, which are not described herein again.
And S22, determining a target display screen corresponding to the meeting place based on the corresponding relation between the meeting place and the display screen in the meeting place of the current network.
Please refer to S12 in fig. 1, which is not repeated herein.
And S23, displaying the image data of the meeting place on the target display screen in a first size.
Please refer to S13 in fig. 1, which is not repeated herein.
And S24, when the conference places are speaking conference places and the number of the conference places is larger than or equal to that of the display screens, determining the preset display screen of the current conference place based on the conference place type of the speaking conference place, and displaying the image data of the speaking conference place on the preset display screen in a second size.
Wherein the second size is greater than the first size.
For the speaking meeting place, the meeting place type can be a net presenting meeting place or a single-screen meeting place. Based on this, the following description is made for a web presence site and a single screen site, respectively.
In some embodiments, the S24 includes:
and S241, when the meeting place type of the speaking meeting place is a meeting place with a network and the number of the meeting places is larger than that of the display screens, determining that the preset display screens are N display screens in the middle of the meeting place with the current network.
Wherein N is the number of image channels in the talk session.
For the web presence meeting place, since the web presence meeting place includes N paths of image data, the N paths of image data need to be respectively displayed in different display screens in the second size, and therefore, N display screens need to be determined from the current web presence meeting place as the preset display screens. Further, in order to achieve a better display effect, the preset display screen is determined as the middle N display screens of the current meeting place. For example, the speaking place includes 3 paths of image data, and then the middle 3 display screens of the current network presenting place are taken as the preset display screens. And if the number of the display screens included in the current network-presenting meeting place is 3, determining all the display screens of the current network-presenting meeting place as preset display screens.
And S242, respectively displaying the N paths of image data of the speaking meeting place on the middle N display screens in a second size.
For example, fig. 4 shows 3 display screens of a current web presentation venue, in which image data of a first size and image data of a second size are displayed, respectively. Wherein, 3 display screens from left to right respectively display the image data of the left, middle and right 3 ways of the speaking place. The splicing of the image data of the second size of the 3 display screens is a complete image picture of the speaking meeting place, so that when the image data of the second size is displayed on the preset display screen, the image data of the second size needs to be displayed according to a certain sequence, and then the image data of the second size can be spliced in sequence visually to obtain the complete image data of the speaking meeting place.
In other embodiments, S24 includes:
(1) And when the meeting place type of the speaking meeting place is the meeting place represented by the network and the number of the meeting places is equal to the number of the display screens, determining the preset display screen as a corresponding target display screen.
(2) And acquiring an image of a speaking passage of the speaking conference place.
(3) And displaying the speaking passage image on the target display screen in a second size, and displaying the latest speaking passage image data of other participating sites on the target display screens corresponding to the other participating sites in the second size.
The other conference places refer to other conference places except the speaking conference place. When the number of the meeting places is equal to that of the display screens, the meeting places are presented in the current network, and the meeting places correspond to the display screens one by one. That is, each display screen displays image data of one participating venue. For example, fig. 5 shows a display example of a current web presentation site of 3 display screens. Image data of meeting place a, meeting place B, and meeting place C are displayed from left to right, respectively. The conference places A to C are all network conference places, image data of a first size corresponding to the conference places are respectively displayed in each display screen, for example, in a left display screen, 3 paths of image data of the left, middle and right of the conference place A are displayed in the first size; in the middle display screen, displaying the left, middle and right 3-path image data of the meeting place B in a first size; in the right display screen, the 3-way image data of the left, middle, and right of the participating site C is displayed in the first size.
If the speaking place is a, it is first determined in which speaking path of the speaking place a the speaker is located, which may be determined by the position of the speaker, etc., and this is not limited herein. And after the speaking passage corresponding to the speaker is determined, displaying the image of the speaking passage in a second size on the left display screen. For example, if it is determined that the utterance passage image is a L Then, A is calculated L Is displayed in the left display screen in the second size.
Meanwhile, as shown in fig. 5, with respect to the center display screen and the right display screen, utterance passage images corresponding to the nearest speakers of participating sites B and C are displayed in the second size. For example, the speech path image corresponding to the nearest speaker in conference site B is B M The image data of (2) is that of the speech path image corresponding to the nearest speaker in the conference site C is C R Then, B is displayed in the second size in the middle display screen M Displays C in the second size in the right display screen R The current image data of (2).
For example, as shown in fig. 5, there are 3 meeting places and 3 display screens of the web-based meeting place, and then each display screen of the web-based meeting place displays exactly one meeting place. And for the current network meeting place, displaying a large picture and a small picture of the network meeting place corresponding to the display screen: the image data of the second size is the image of the speaker with the largest voice energy in the meeting place, and the image data of the first size is a panoramic image formed by splicing the three images in the meeting place. For a single-screen meeting place, the only one path of image of the single-screen meeting place can be directly displayed on one screen.
For an electronic device, the internal data processing logic may include an audio processing device, a member management module, and an image management module, according to functional division. The audio detection device is responsible for detecting the volume value of the current participant and maintaining a mixing list; all the meeting places managed by the member management module are main logic execution units of the embodiment; the image processing device is responsible for image styles and member management. Based on the display, the generation flow of the one-size-three-size display of the network meeting place is that firstly, the audio processing device detects the Ax with the maximum volume in the multi-path audio of the network meeting place and informs the member management module of the serial number of the Ax; and the member management module informs the image management module of updating the image display mode. Namely, the previous large picture is cancelled, the Ax image is taken as a large image, three paths of images in a meeting place are arranged side by side in the net and are just below the center of the large image, and thus, a four-picture style image with three large pictures and three small pictures is synthesized. The processing scheme of the other two participating venues B and C is identical to that of a. And the sounds of the meeting places A, B and C are respectively output from the corresponding display screens. As shown in fig. 6, for all the single-screen conference places participating in the conference, the large screen is the image of the conference place where the maximum volume is detected by the audio module, and the images of all the three conference places are displayed below in the form of small images.
When the conference place type of the speaking conference place is a conference place with a net and the number of the conference places is consistent with that of the display screens, the conference places correspond to the display screens one by one, and for the speaking conference place, the speaking passage image is displayed on the corresponding target display screen in the second size, so that the switching of the speaking conference place is avoided, the data processing amount is reduced, and the real-time display of the image data is ensured.
In other embodiments, S24 includes:
(1) And when the meeting place type of the speaking meeting place is a single-screen meeting place and the number of the meeting places is larger than that of the display screens, determining that the preset display screen is a middle display screen of the meeting place in the current network.
(2) And displaying the conference site image of the speaking conference site on the middle display screen in a second size, and displaying the preset image data on other display screens of the current network presentation conference site in the second size.
For a single-screen meeting place, the middle display screen of the meeting place with the current network is determined as a preset display screen. And if the number of the display screens of the meeting place of the current network is even, the middle display screen is any one of the two middle display screens. Regarding the selection strategy of the intermediate display in this case, the configuration may be performed before the conference starts, and so on. And displaying the conference site image of the speaking conference site in a second size on the middle display screen, and displaying preset image data in the second size on other display screens. The preset image data is a static image, and specific display content is set according to actual requirements.
When the speaking meeting place is a single-screen meeting place, the meeting place image of the speaking meeting place is displayed on the middle display screen, preset image data is displayed on other display screens, and the real person 1 of the single-screen meeting place is displayed.
As an optional implementation manner of this embodiment, when the conference site type of the conference site is a single-screen conference site and when the conference site is not consistent with the conference site corresponding to the current image displayed in the second size in the intermediate display screen, step (2) of S24 includes:
2.1 Store the current image displayed in the second size in the intermediate display screen.
2.2 Replacing the current image with image data of a second size for the floor.
2.3 Replace the oldest speaking image data in the other display screen with the current image.
When the speaking place is switched to the single-screen place and when the speaking place is not consistent with the place corresponding to the current image displayed in the second size in the middle display screen, as shown in fig. 6, the speaking place is recorded as MT C . Then using MT C Replaces the image data of the current middle screen and records the current middle screen image as MT A . If MT A For image data of single-screen meeting place, MT is required A And replacing to the left and right screens. In particular, if the left screen is not a single screen meeting place, then use MT A Replacing the left screen; otherwise, checking the right screen, if the right screen is not the single screen meeting place, directly using MT A The right panel is replaced. If the left screen and the right screen are both single-screen meeting places, the description is given aboveThe speaking places are all single screens, and the single-screen place with the earliest speaking is replaced. When the continuous single-screen meeting place speaks, the three screens are all images of the single-screen meeting place, and the middle screen is always kept as the current speaking person.
For a single-screen conference place, the current image of the middle display screen is replaced with the image data with the earliest speech, namely, the preset image data in other display screens are updated according to the current time of the speech distance, and the image data with the latest speech can be displayed in other display screens, so that participants can visually know the image data with the latest speech.
In another embodiment, S24 includes:
(1) And when the meeting place type of the speaking meeting place is a single-screen meeting place and the number of the meeting places is equal to the number of the display screens, determining that the preset display screen is a corresponding target display screen.
(2) And acquiring a meeting place image of the speaking meeting place.
(3) And displaying the conference site images of the speaking conference sites on the target display screen in a second size, and displaying the latest speaking images of other conference sites on the corresponding target display screens in the second size.
When the number of the conference participating sites is equal to the number of the display screens, no matter the speaking site is a single-screen conference site or a network presentation conference site, the image data of the speaking site is displayed in a target display screen corresponding to the current network presentation conference site of the speaking site, namely, the image of the first size and the image of the second size are displayed on the same display screen. And when the conference place type of the speaking conference place is the single-screen conference place, displaying the conference place image of the single-screen conference place on the target display screen in a second size, and for other conference places, displaying the respective latest speaking image on the target display screens corresponding to the other conference places in the second size.
For example, if the meeting place is a meeting place of 3, the current web-based meeting place has 3 display screens, namely a left display screen, a middle display screen and a right display screen, the left display screen is a target display screen of the single-screen meeting place, and the middle display screen and the right display screen correspond to the first web-based meeting place and the current web-based meeting place respectively. If the speaking meeting place is a single-screen meeting place, displaying a meeting place image of the single-screen meeting place in a second size in the left display screen; and displaying the latest speaking channel image of the first network presenting meeting place in a second size in the middle display screen, and displaying the latest speaking channel image of the current network presenting meeting place in a second size in the right display screen.
And S25, outputting the audio data of the speaking meeting place from an audio channel of a preset display screen.
Specifically, the above S25 includes:
and S251, outputting the audio data of the speaking meeting place from the audio channel of the preset display screen where the image data of the second size is located based on the corresponding relation between the audio data of the speaking meeting place and the image data of the second size.
For the speaking place, the image data and the audio data of the second size are correspondingly output. For example, image data of the second size is output from the intermediate display screen, and accordingly, audio data of the speaking place is output from the intermediate display screen.
The electronic equipment firstly determines a preset display screen where the image data of the second size of the speaking meeting place is located, and then outputs the audio data of the speaking meeting place from an audio channel of the preset display screen.
And S252, outputting the audio data of the other meeting places from the audio channels of the target display screens corresponding to the other meeting places.
And for the audio data of other meeting places, outputting the audio data from the audio channel of the target display screen corresponding to each meeting place. For example, if meeting place 1 corresponds to the left display screen, then the audio data of meeting place 1 is output from the audio channel of the left display screen; if the meeting place 2 corresponds to the middle display screen, the audio data of the meeting place 2 is output from the audio channel of the middle display screen.
For example, three video queues are first defined: v L 、V M 、V R Corresponding to the left, middle and right display screens, A L 、A M 、A R For the corresponding three audio queues. When the video queue is confirmed, V is set L Corresponding audio insert A L In, V M Corresponding audio insertion A M In, V R Corresponding audio insertion A R In (1). After the image distribution is completed, V is set L 、V M 、V R Corresponding audio streams are respectively added to A L 、A M 、A R In (1). In the case of a conference call site, the sound is not added to A alone L 、A M 、A R The left, middle and right audio of a speaking conference room are respectively added to A L 、A M 、A R In the queue, this ensures that the person and sound of the second size image are output from the same display screen.
According to the audio and video output method for the web conference room, when the conference room type of the speaking conference room is the web conference room and the number of the conference rooms is large, N paths of image data of the speaking conference room are correspondingly displayed in the middle N display screens respectively, so that the characteristics of a real person 1 of a participant, listening position discrimination, eye contact communication and the like are not lost, and the communication efficiency is improved. And outputting the audio data by utilizing the corresponding relation between the audio data of the speaking meeting place and the image data of the second size, thereby realizing the effect of listening and distinguishing positions.
In some embodiments, if the number of display screens of the current presence site is greater than the number of participating sites, the number of used display screens may be determined by the number of participating sites, and the initial image may be displayed on other unused display screens, or these unused display screens may not be enabled. If the number of the display screens of the current meeting place in the form of a net is 3, the number of the meeting places in the form of a net is 2, and all the display screens are the meeting places in the form of a net, and the left, middle and right image data of the meeting place in the opposite party are respectively displayed on the 3 display screens of the current meeting place in the form of a net; similarly, the left, middle and right image data of the meeting place of the current network are displayed on 3 display screens of the other party meeting place.
For example, the video conference includes 2 conference places, namely a conference place a and a conference place B, and the 2 conference places are all web presence places. As shown in FIG. 7a, 3-way image data of participating site B, namely B, are displayed on 3 display screens of participating site A L 、B M 、B R (ii) a 3 displays at meeting place BThe 3-channel image data of the meeting place A, namely A, are respectively displayed on the screen L 、A M 、A R
In some embodiments, for a single-screen conference room, it is used to display image data for a conference room of a first size and image data for a conference room of a second size. And if the speaking meeting place is the network presenting meeting place, the image data with the second size is a speaking passage image of the network presenting meeting place.
As shown in fig. 8, since the single-screen conference room has only one display screen, three images of the network conference room need to be collectively displayed on one screen. The image data with the second size is images of speakers in the meeting place, and the speakers can manually specify or automatically adjust the left, middle and right images to be large images of the speakers according to the volume of a plurality of audios of the meeting place presented by the network. And the net is three paths of images B in a meeting place L 、B M 、B R The images are reduced and displayed side by side below to form a continuous panoramic image. Similarly, if it is required to display another complete image of the web presentation site on a screen of the web presentation site, the same display mode is used.
When the meeting place has only two members, a and B, then a and B see each other's image. If A is a single-screen meeting place and B is a meeting place with a network, a middle display screen of the meeting place B displays image data of the meeting place A, and a display screen of the meeting place A displays a large and three small picture composite image of the meeting place B, for example, as shown in FIG. 8; if both parties are web conference sites, the three screens of conference site a and conference site B correspond one-to-one, as shown in fig. 7a and 7B.
In some embodiments, the audio/video output method of the web presence site further includes:
(1) And acquiring the number of display screens of the current network-presented meeting places and the number of meeting places.
(2) And distributing all the meeting places based on the number of the display screens and the number of the meeting places, and determining the corresponding relation between the meeting places and the display screens of the meeting places on the current network.
Before the video conference starts, the display strategy of each conference-participating place needs to be determined, that is, the conference-participating places are distributed based on the number of display screens of the current network-presenting places and the number of the conference-participating places. When the participant fields are distributed, all the participant fields can be evenly distributed to each display screen; or may be distributed in accordance with the number of paths of image data of each participating site, and the like. The method is not limited at all, and only image data of the same meeting place is ensured to be displayed on the same display screen. After the meeting places are allocated, the meeting places corresponding to all display screens of the meeting places in the current network can be determined.
The conference places are distributed based on the number of the display screens of the current network conference places, namely, the conference places are distributed by taking the conference places as units, and image data in the conference places are not taken as units, so that the image data of the same conference place can be distributed to the same display screen for display.
In some optional embodiments, when the number of meeting places is greater than the number of display screens, the step (2) further includes:
2.1 The number of displays of images of a first size in the display screen is acquired.
2.2 The meeting place with the meeting place type of the web-based meeting place is distributed, so that the image data with the first size of the web-based meeting place is displayed on the same display screen of the current web-based meeting place.
2.3 Based on the assignment result and the number of image displays, the number of remaining image displays of the first size in the display screen is determined.
2.4 The number of remaining image displays is used to allocate the conference site whose type is the single screen conference site.
For the network presentation meeting places, the image style of each display screen is a big-N mode, the big image is image data of a second size of a nearest speaker, and the N small images are image data of a first size distributed by other meeting places. For example, N is at most 9, and the three screens total 27 small images.
When the electronic equipment allocates meeting places, the number of images in the first size in the display screen is determined, wherein the number of images in the first size is the largest number of images in the first size which can be displayed in the display screen. For audio, speaking more than three parties simultaneously can make a person hear unclear speech, so excessive voice participation has little meaning. Similarly, too many videos are displayed simultaneously, which may also make a single picture unclear. The output characteristics of the audios and videos are comprehensively considered, the display scenes of at most 27 parties are defined, and if more than 27 parties are involved in the conference, redundant parties can only participate as audiences and cannot be displayed to other meeting places.
When the meeting place is distributed, the meeting places with the meeting place types of the network presenting meeting places are distributed firstly in combination with the image display quantity of the first size. And then supplementing the display quantity of the residual images with the first size by using the meeting place of the single-screen meeting place.
When the meeting participating fields are distributed, the distribution of the network meeting place is preferentially carried out, and the network meeting place has N paths of image data and more image display areas which need to be occupied, so that the network meeting place is distributed firstly, and then the single-screen meeting place is utilized to fill the display quantity of the residual images, and the image data of the same meeting participating field is further ensured to be displayed in the same display screen.
The audio and video output method for the network presence meeting place provided by the embodiment uniformly manages the display and playing of the audio and video in a mode of combining the image and the sound according to the network presence meeting place as a unit. For all meeting places participating in the meeting, the seen images are a whole, and the audios and the videos are in one-to-one correspondence. All important parties are displayed in a centralized way, so that the display effect of the network presence meeting place under the condition of participation of multiple parties is effectively improved.
The embodiment also provides an audio/video output device for a web conference site, which is used to implement the foregoing embodiments and preferred embodiments, and the description of the device is omitted. As used below, the term "module" may be a combination of software and/or hardware that implements a predetermined function. Although the means described in the embodiments below are preferably implemented in software, an implementation in hardware, or a combination of software and hardware is also possible and contemplated.
The present embodiment provides an audio/video output device for a network meeting place, as shown in fig. 9, including:
an obtaining module 41, configured to obtain image data and audio data of a meeting place, where a meeting place type of the meeting place includes a web presence meeting place;
a determining module 42, configured to determine a target display screen corresponding to the meeting place based on a correspondence between the meeting place and a display screen in the meeting place of the current network;
a first display module 43, configured to display image data of the meeting place on the target display screen in a first size;
a second display module 44, configured to determine, when the participant site is a talk-spurt site and the number of the participant sites is greater than or equal to the number of the display screens, a preset display screen of the current web presence site based on a site type of the talk-spurt site, and display image data of the talk-spurt site on the preset display screen in a second size, where the second size is greater than the first size;
and an audio output module 45, configured to output the audio data of the speaking conference place from the audio channel of the preset display screen.
In some embodiments, the second display module 44 includes:
the first determining unit is used for determining that the preset display screens are N display screens in the middle of the current conference place in the form of a network when the conference place type of the speaking conference place is the conference place in the form of a network and the number of the conference places is greater than that of the display screens, wherein N is the number of image accesses in the speaking conference place;
and the first display unit is used for displaying the N paths of image data of the speaking place on the middle N display screens in a second size.
In some embodiments, the second display module 44 includes:
the second determining unit is used for determining the preset display screen as the corresponding target display screen when the number of the meeting places is equal to the number of the display screens;
a first obtaining unit, configured to obtain image data of the talk session, where the image data of the talk session is a talk passage image of the talk session when the talk session is a network-presenting session, and the image data of the talk session is a session image of the talk session when the talk session is a single-screen session;
and the second display unit is used for displaying the image data of the speaking place on the target display screen in the second size and displaying the latest speaking image data of other participating places on the target display screens corresponding to the other participating places in the second size.
In some embodiments, the second display module 44 includes:
a third determining unit, configured to determine that the preset display screen is a middle display screen of the current conference room when the conference room type of the speaking conference room is a single-screen conference room and the number of the conference rooms is greater than the number of the display screens;
and the third display unit is used for displaying the conference site image of the speaking conference site on the middle display screen in a second size and displaying preset image data on other display screens of the current network presenting conference site in the second size.
In some embodiments, when the speaking venue does not coincide with a venue corresponding to the current image displayed in the second size in the intermediate display screen, the third display unit includes:
the storage subunit is used for storing the current image displayed in the second size in the middle display screen;
a first replacement subunit configured to replace the current image with image data of a second size of the speaking venue;
and the second replacing subunit is used for replacing the image data with the earliest speech in the other display screens by using the current image.
In some embodiments, audio output module 45 includes:
the first output unit is used for outputting the audio data of the speaking meeting place from an audio channel of a preset display screen where the image data of the second size is located based on the corresponding relation between the audio data of the speaking meeting place and the image data of the second size;
and the second output unit is used for outputting the audio data of the other conference places from the audio channels of the target display screens corresponding to the other conference places.
In some embodiments, the apparatus further comprises:
the acquisition module is used for acquiring the number of display screens of the current network meeting place and the number of meeting places;
and the distribution module is used for distributing all the meeting places based on the number of the display screens and the number of the meeting places and determining the corresponding relation between the meeting places and the display screens of the meeting places on the current network.
In some embodiments, when the number of meeting places is greater than the number of display screens, the assigning module comprises:
a second acquisition unit configured to acquire the number of displayed images of a first size on the display screen;
the first distribution unit is used for distributing the meeting place taking the meeting place type as a network-presenting meeting place so as to display the image data of the first size of the network-presenting meeting place on the same display screen of the current network-presenting meeting place;
a fourth determination unit configured to determine a remaining number of image displays of the first size in the display screen based on the allocation result and the number of image displays;
and the second distribution unit is used for distributing the meeting place with the meeting place type of the single-screen meeting place by using the residual image display quantity.
The audio/video output device of the network-based conference site in this embodiment is presented in the form of a functional unit, where the unit refers to an ASIC circuit, a processor and a memory executing one or more software or fixed programs, and/or other devices that can provide the above-described functions.
Further functional descriptions of the modules are the same as those of the corresponding embodiments, and are not repeated herein.
An embodiment of the present invention further provides an electronic device, which includes the audio/video output device in the network meeting place shown in fig. 9.
Referring to fig. 10, fig. 10 is a schematic structural diagram of an electronic device according to an alternative embodiment of the present invention, and as shown in fig. 10, the electronic device may include: at least one processor 51, such as a CPU (Central Processing Unit), at least one communication interface 53, memory 54, at least one communication bus 52. Wherein a communication bus 52 is used to enable the connection communication between these components. The communication interface 53 may include a Display (Display) and a Keyboard (Keyboard), and the optional communication interface 53 may also include a standard wired interface and a standard wireless interface. The Memory 54 may be a high-speed RAM Memory (volatile Random Access Memory) or a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. The memory 54 may alternatively be at least one memory device located remotely from the processor 51. Wherein the processor 51 may be combined with the apparatus described in fig. 9, the memory 54 stores an application program, and the processor 51 calls the program code stored in the memory 54 for performing any of the above method steps.
The communication bus 52 may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus. The communication bus 52 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one thick line is shown in FIG. 10, but this is not intended to represent only one bus or type of bus.
The memory 54 may include a volatile memory (RAM), such as a random-access memory (RAM); the memory may also include a non-volatile memory (e.g., flash memory), a hard disk (HDD) or a solid-state drive (SSD); the memory 54 may also comprise a combination of the above types of memories.
The processor 51 may be a Central Processing Unit (CPU), a Network Processor (NP), or a combination of a CPU and an NP.
The processor 51 may further include a hardware chip. The hardware chip may be an application-specific integrated circuit (ASIC), a Programmable Logic Device (PLD), or a combination thereof. The PLD may be a Complex Programmable Logic Device (CPLD), a field-programmable gate array (FPGA), general Array Logic (GAL), or any combination thereof.
Optionally, the memory 54 is also used to store program instructions. The processor 51 may call the program instruction to implement the audio/video output method of the web presence site as shown in any embodiment of the present application.
The embodiment of the invention also provides a non-transient computer storage medium, wherein the computer storage medium stores computer executable instructions which can execute the audio and video output method of the network presence meeting place in any method embodiment. The storage medium may be a magnetic Disk, an optical Disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a Flash Memory (Flash Memory), a Hard Disk Drive (Hard Disk Drive, abbreviated as HDD), or a Solid State Drive (SSD); the storage medium may also comprise a combination of memories of the kind described above.
Although the embodiments of the present invention have been described in conjunction with the accompanying drawings, those skilled in the art can make various modifications and variations without departing from the spirit and scope of the invention, and such modifications and variations fall within the scope defined by the appended claims.

Claims (10)

1. An audio and video output method for a presence meeting place is characterized by comprising the following steps:
acquiring image data and audio data of a meeting place, wherein the meeting place type of the meeting place comprises a network presenting meeting place;
determining a target display screen corresponding to the meeting place based on the corresponding relation between the meeting place and the display screen in the meeting place of the current network;
displaying the image data of the meeting place on the target display screen in a first size;
when the meeting place is a speaking meeting place and the number of the meeting places is greater than or equal to that of the display screens, determining a preset display screen of the current network presenting meeting place based on the meeting place type of the speaking meeting place, and displaying the image data of the speaking meeting place on the preset display screen in a second size, wherein the second size is greater than the first size;
and outputting the audio data of the speaking meeting place from the audio channel of the preset display screen.
2. The method of claim 1, wherein the determining a preset display screen of the current web presentation conference site based on the conference site type of the conference site, and displaying the image data of the conference site on the preset display screen in a second size comprises:
when the conference place type of the speaking conference place is a network-presented conference place and the number of the conference places is larger than that of the display screens, determining that the preset display screen is the middle N display screens of the current network-presented conference place, wherein N is the number of image channels in the speaking conference place;
and respectively displaying the N paths of image data of the speaking meeting place on the middle N display screens in a second size.
3. The method of claim 1, wherein the determining a preset display screen of the current web presentation venue based on the venue type of the presentation venue, and displaying the image data of the presentation venue on the preset display screen in a second size comprises:
when the number of the meeting places is equal to that of the display screens, determining the preset display screen as the corresponding target display screen;
acquiring image data of the speaking meeting place, wherein the image data of the speaking meeting place is a speaking access image of the speaking meeting place when the speaking meeting place is a network presenting meeting place, and the image data of the speaking meeting place is a meeting place image of the speaking meeting place when the speaking meeting place is a single-screen meeting place;
and displaying the image data of the speaking conference place on the target display screen in the second size, and displaying the latest speaking image data of other participating conference places on the target display screens corresponding to the other participating conference places in the second size.
4. The method of claim 1, wherein the determining a preset display screen of the current web presentation venue based on the venue type of the presentation venue, and displaying the image data of the presentation venue on the preset display screen in a second size comprises:
when the meeting place type of the speaking meeting place is a single-screen meeting place and the number of the meeting places is larger than that of the display screens, determining that the preset display screen is a middle display screen of the meeting place in the current network;
and displaying the conference site image of the speaking conference site on the middle display screen in a second size, and displaying preset image data on other display screens of the current network presentation conference site in the second size.
5. The method of claim 4, wherein when the speaking meeting place is inconsistent with a meeting place corresponding to a current image displayed in a second size in the intermediate display screen, the displaying image data of the speaking meeting place in the second size on the intermediate display screen and displaying preset image data in the second size on other display screens of the current web presentation meeting place comprises:
storing a current image displayed in a second size in the intermediate display screen;
replacing the current image with image data of a second size of the speaking venue;
and replacing the image data with the earliest speech in the other display screens by using the current image.
6. The method of claim 1, wherein outputting the audio data of the speaking venue from the audio channel of the preset display screen comprises:
outputting the audio data of the speaking meeting place from an audio channel of a preset display screen where the image data of the second size is located based on the corresponding relation between the audio data of the speaking meeting place and the image data of the second size;
and outputting the audio data of the other participating sites from the audio channels of the target display screens corresponding to the other participating sites.
7. The method according to any one of claims 1-6, further comprising:
acquiring the number of display screens of the meeting places presented by the current network and the number of the meeting places;
and distributing all the meeting places based on the number of the display screens and the number of the meeting places, and determining the corresponding relation between the meeting places and the display screen of the meeting place on the current network.
8. The method according to claim 7, wherein when the number of meeting places is greater than the number of display screens, the allocating all the meeting places based on the number of display screens and the number of meeting places, and determining the corresponding relationship between the meeting places and the display screens of the current web presence place comprises:
acquiring the display quantity of the images with the first size in the display screen;
distributing the meeting place of which the meeting place type is a meeting place in a net shape so as to display image data of a first size of the meeting place in the net shape on the same display screen of the current meeting place in the net shape;
determining the display quantity of the remaining images with the first size in the display screen based on the distribution result and the display quantity of the images;
and distributing the meeting place taking the meeting place type as the single-screen meeting place by using the display quantity of the residual images.
9. An electronic device, comprising:
a memory and a processor, the memory and the processor are connected with each other in a communication manner, the memory stores computer instructions, and the processor executes the computer instructions, so as to execute the audio/video output method of the web presence venue as claimed in any one of claims 1 to 8.
10. A computer-readable storage medium storing computer instructions for causing a computer to execute the audio-video output method of a web presentation venue according to any one of claims 1 to 8.
CN202211178189.4A 2022-09-22 2022-09-22 Audio and video output method for presence meeting place, electronic equipment and storage medium Pending CN115550599A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211178189.4A CN115550599A (en) 2022-09-22 2022-09-22 Audio and video output method for presence meeting place, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211178189.4A CN115550599A (en) 2022-09-22 2022-09-22 Audio and video output method for presence meeting place, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115550599A true CN115550599A (en) 2022-12-30

Family

ID=84730213

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211178189.4A Pending CN115550599A (en) 2022-09-22 2022-09-22 Audio and video output method for presence meeting place, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115550599A (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101534413A (en) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 System, method and apparatus for remote representation
CN101583011A (en) * 2009-05-27 2009-11-18 深圳华为通信技术有限公司 Video conference control method and system, video conference network equipment and conference places
WO2010033036A1 (en) * 2008-09-17 2010-03-25 Tandberg Telecom As A control system for a local telepresence videoconferencing system and a method for establishing a video conference call
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device
CN101877643A (en) * 2010-06-29 2010-11-03 中兴通讯股份有限公司 Multipoint sound-mixing distant view presenting method, device and system
CN102265613A (en) * 2008-12-23 2011-11-30 坦德伯格电信公司 Method, device and computer program for processing images in conference between plurality of video conferencing terminals
CN102469295A (en) * 2010-10-29 2012-05-23 华为终端有限公司 Conference control method, related equipment and system
US20120218373A1 (en) * 2011-02-28 2012-08-30 Cisco Technology, Inc. System and method for selection of video data in a video conference environment
CN102655584A (en) * 2011-03-04 2012-09-05 中兴通讯股份有限公司 Media data transmitting and playing method and system in tele-presence technology
CN113259314A (en) * 2021-03-30 2021-08-13 海南视联通信技术有限公司 Split screen display method and device, terminal equipment and storage medium

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010033036A1 (en) * 2008-09-17 2010-03-25 Tandberg Telecom As A control system for a local telepresence videoconferencing system and a method for establishing a video conference call
CN102217310A (en) * 2008-09-17 2011-10-12 坦德伯格电信公司 A control system for a local telepresence videoconferencing system and a method for establishing a video conference call
CN102265613A (en) * 2008-12-23 2011-11-30 坦德伯格电信公司 Method, device and computer program for processing images in conference between plurality of video conferencing terminals
CN101534413A (en) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 System, method and apparatus for remote representation
CN101583011A (en) * 2009-05-27 2009-11-18 深圳华为通信技术有限公司 Video conference control method and system, video conference network equipment and conference places
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device
CN101877643A (en) * 2010-06-29 2010-11-03 中兴通讯股份有限公司 Multipoint sound-mixing distant view presenting method, device and system
CN102469295A (en) * 2010-10-29 2012-05-23 华为终端有限公司 Conference control method, related equipment and system
US20120218373A1 (en) * 2011-02-28 2012-08-30 Cisco Technology, Inc. System and method for selection of video data in a video conference environment
CN102655584A (en) * 2011-03-04 2012-09-05 中兴通讯股份有限公司 Media data transmitting and playing method and system in tele-presence technology
CN113259314A (en) * 2021-03-30 2021-08-13 海南视联通信技术有限公司 Split screen display method and device, terminal equipment and storage medium

Similar Documents

Publication Publication Date Title
US9749473B2 (en) Placement of talkers in 2D or 3D conference scene
CN109788236A (en) Audio/video conference control method, device, equipment and storage medium
JP5297449B2 (en) Method, medium and apparatus for providing visual resources for video conference participants
US7679640B2 (en) Method and system for conducting a sub-videoconference from a main videoconference
US7679638B2 (en) Method and system for allowing video-conference to choose between various associated video conferences
US9961208B2 (en) Schemes for emphasizing talkers in a 2D or 3D conference scene
JP3128680B2 (en) Multipoint video conference system
US9485596B2 (en) Utilizing a smartphone during a public address system session
CN107613242A (en) Video conference processing method and terminal, server
CN108833823B (en) Video conference realization method and device, computer equipment and storage medium
US20100091687A1 (en) Status of events
WO2016169496A1 (en) Video conference image presentation method and device therefor
US9420109B2 (en) Clustering of audio streams in a 2D / 3D conference scene
US20170048284A1 (en) Non-transitory computer readable medium, information processing apparatus, and information processing system
CN111263103A (en) Teleconference method and system
CN104380720B (en) Video conference processing method and equipment
US9721215B2 (en) Enhanced management of a web conferencing server
CN111246154A (en) Video call method and system
CN113542660A (en) Method, system and storage medium for realizing conference multi-picture high-definition display
US20110069141A1 (en) Communication Between Scheduled And In Progress Event Attendees
WO2016206471A1 (en) Multimedia service processing method, system and device
CN115550599A (en) Audio and video output method for presence meeting place, electronic equipment and storage medium
CN108495074B (en) Video chat method and device
KR20180105594A (en) Multi-point connection control apparatus and method for video conference service
RU2617680C1 (en) Method, device and system for transmitting multimedia data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination