WO2016101623A1 - Remote interaction method and device in multipoint audio and video communication - Google Patents
Remote interaction method and device in multipoint audio and video communication Download PDFInfo
- Publication number
- WO2016101623A1 WO2016101623A1 PCT/CN2015/086271 CN2015086271W WO2016101623A1 WO 2016101623 A1 WO2016101623 A1 WO 2016101623A1 CN 2015086271 W CN2015086271 W CN 2015086271W WO 2016101623 A1 WO2016101623 A1 WO 2016101623A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- video
- information
- terminal
- interaction
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/25—Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
Definitions
- the present invention relates to the field of communications, and in particular to a method and device for remote interaction in multi-point audio video communication.
- a multipoint audio video communication system includes: a multipoint audio video communication service management system 1 and a multipoint audio and video communication multipoint control process.
- Unit 2 In the system shown in FIG. 1, a multipoint audio and video communication multipoint control processing unit 2 is connected to a plurality of multipoint audio and video communication terminals distributed around the country to realize interaction and communication between the participants.
- FIG. 2 is a schematic diagram showing the structure of a multipoint audio and video communication system having a voting function according to the related art. As shown in FIG. 2, a third party voting system is added to the system shown in FIG.
- the embodiments of the present invention provide a method and a device for remote interaction in multi-point audio and video communication, so as to at least solve the defects caused by the introduction of a third-party interactive system in a multi-point audio and video communication system.
- a remote interaction method in an audio video conference including: the MCU sends an audio and/or video interface corresponding to the current interaction to direct user input to each participating terminal; Each participating terminal supports the user input channel; the conference service management system collects the interactive information fed back by the participating terminals according to the audio and/or video interface; the MCU sends the statistical result obtained by the conference service management system to each participating terminal.
- the MCU Before the MCU sends the audio and/or video interface to each participant terminal, the MCU obtains the pre-stored audio file and the image file corresponding to the interaction, and synthesizes the audio file and the image file separately to form a user input. Audio and video interface.
- the MCU sends the audio and/or video interface input by the user to each participating terminal, and specifically includes: the MCU acquires the type of information supported by each participating terminal, and sends an audio and/or video interface to each participating terminal based on the type of the information.
- the MCU will send the audio and/or video interface input by the user to each participating terminal, and specifically includes: the MCU acquires the type of information supported by each participating terminal, and detects whether each participating terminal sets the type of the received information; If yes, it detects whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, sends an audio or video interface to the corresponding conference terminal based on the set information type; otherwise, according to the information type supported by the conference terminal The participating terminal sends an audio or video interface; if not set, the audio and/or video interface is sent to each participating terminal based on the type of information supported by each participating terminal.
- the MCU sends the statistics to each participant terminal, including: the MCU synthesizes the statistical result into text, audio, and video, and sends text, audio, and/or video to each participating terminal based on the type of information previously supported by each participating terminal. Statistical results.
- the manner in which the MCU obtains the information types supported by the participating terminals includes: the MCU acquires a logical channel established for each participating terminal, and determines the type of information supported by the participating terminal according to the type of the logical channel; the types of the logical channel include: audio, video, and data.
- the MCU will direct the audio and/or video interface input by the user to each participating terminal, and further includes: the MCU sets the audio and/or video interface set to direct the user input together with the data, audio and video streams in the current conference as The media stream format is sent to each participating terminal; the MCU sends the statistics to each participant terminal, and further includes: the MCU encodes the statistical result together with the data, audio, and video streams in the current conference into a media stream format, and then sends the result to each participant. terminal.
- the types of interaction include: voting, scoring, and voting.
- a multipoint control unit MCU comprising: an interaction initiation module, configured to set, in a video conference, an audio and/or video corresponding to the current interaction to guide user input.
- the interface is sent to each participating terminal; wherein each participating terminal supports a user input channel;
- the interactive processing module is configured to receive interactive information fed back by each participating terminal according to the audio and/or video interface, and send the interactive information to the meeting.
- the business management system, and the statistical result obtained by the conference service management system using the interactive information, and the statistical result is sent to each participating terminal.
- the interaction initiation module is further configured to obtain a pre-stored audio file and a picture file corresponding to the interaction, and separately synthesize the audio file and the picture file to form an audio and video interface for guiding user input.
- the interaction initiation module is specifically configured to obtain an information type supported by each conference terminal, and send an audio and/or video interface to each conference terminal based on the information type.
- the interaction initiation module is specifically configured to obtain the type of information supported by each conference terminal, and detect whether each conference terminal sets the type of the received information. If the setting is performed, it is detected whether the information type set by the corresponding conference terminal belongs to the The type of information supported, if yes, sending an audio or video interface to the corresponding conference terminal based on the set information type; otherwise, sending an audio or video interface to the corresponding conference terminal based on the type of information supported by the conference terminal; if not, An audio and/or video interface is sent to each participating terminal based on the type of information supported by each participating terminal.
- the interactive processing module is further configured to synthesize the statistical result into text, audio, and video, and send the statistical result of the text, audio, and/or video to each participating terminal based on the type of information supported by each of the participating terminals.
- the interaction initiation module is further configured to encode the audio and/or video interface set to direct the user input together with the data, audio and video code streams in the current conference as a media stream format, and then send the format to the conference terminal; the interaction processing module further It is set to encode the statistical result together with the data, audio and video streams in the current conference into the media stream format and then send it to each participating terminal.
- a remote interaction system in an audio video conference including: a conference service management system, a conference terminal, and the above-mentioned MCU; the conference terminal is set to receive the data transmitted by the MCU. After the interaction is set to guide the user to input the audio and/or video interface, the interactive operation is performed according to the instructions of the audio and/or video interface, and the interactive information obtained by the operation is sent to the MCU; the conference service management system is set to receive The interactive information sent by each participating terminal sent by the MCU according to the guidance of the audio and/or video interface, and the statistical information is collected, and the statistical result is sent to the MCU.
- a remote interaction method in multi-point audio video communication including: a multi-point audio video communication multi-point control processing unit sets the audio corresponding to the current interaction to be set to guide the user input. And/or the video interface is sent to each participating terminal; wherein each participating terminal supports the user input channel; the multi-point audio and video communication multi-point control processing unit receives the interactive information fed back by each participating terminal according to the instruction of the audio and/or video interface; The multi-point audio and video communication multi-point control processing unit sends the interactive information to the multi-point audio video communication service management system; the multi-point audio and video communication multi-point control processing unit receives the statistical data obtained by the multi-point audio and video communication service management system using the interactive information. As a result, the multipoint audio video communication multipoint control processing unit transmits the statistical result to each participating terminal.
- the multi-point audio video communication multi-point control processing unit sends the audio and/or video interface to each participating terminal, and further includes: a multi-point audio video communication multi-point control processing unit acquires a pre-stored audio file corresponding to the interaction and The image file, and the audio file and the image file are separately synthesized to form an audio and video interface for guiding user input.
- the multi-point audio and video communication multi-point control processing unit sends the audio and/or video interface that directs the user input to each participating terminal, including: the multi-point audio video communication multi-point control processing unit acquires the type of information supported by each participating terminal; The audio video multipoint control processing unit transmits an audio and/or video interface each guiding the user input to each participating terminal based on the type of information supported by each participating terminal.
- the multi-point audio and video communication multi-point control processing unit sends an audio and/or video interface guiding the user input to each participating terminal, including: a multi-point audio video communication multi-point control processing unit acquires information types supported by each participating terminal, and detects Whether each participating terminal sets the type of the received information; when the participating terminal sets the type of the received information, it detects whether the type of information set by the corresponding participating terminal belongs to the type of information it supports, and if so, based on the type of the set information Sending an audio and/or video interface to the corresponding conference terminal; if not, transmitting an audio and/or video interface to the corresponding conference terminal based on the type of information supported by the conference terminal; and/or when the conference terminal does not set the type of the reception information, based on The type of information supported by each participating terminal sends an audio and/or video interface to each participating terminal.
- the multi-point audio and video communication multi-point control processing unit sends the statistical result to each participating terminal, including: a multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text, audio and video, and is based on the previously acquired each participating terminal The type of information supported, sending statistical results of text, audio and/or video to each participating terminal.
- the multi-point audio and video communication multi-point control processing unit acquires the information types supported by the participating terminals, including: the multi-point audio video communication multi-point control processing unit acquires a logical channel established for each participating terminal, and determines the participating terminal according to the type of the logical channel.
- Types of information supported; types of logical channels include: audio, video, and data.
- the multipoint audio video communication multipoint control processing unit transmits an audio and/or video interface directing user input to each of the participating terminals, including: a multipoint audio video communication multipoint control processing unit will be set to direct user input audio and/or
- the video interface is encoded into the media stream format together with the data, audio and video code streams in the current multi-point audio and video communication, and then sent to each participating terminal; the multi-point audio and video communication multi-point control processing unit sends the statistical result to each participating terminal.
- the multi-point audio and video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then sends the result to each participating terminal.
- the types of interaction include: voting, scoring, and voting.
- the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
- a multipoint audio video communication multipoint control processing unit including: an interaction initiation module, configured to set a guide corresponding to the interaction in the multipoint audio video communication as a guide
- the audio and/or video interface input by the user is sent to each participating terminal; wherein each participating terminal supports a user input channel;
- the interactive processing module is configured to receive interactive information fed back by each participating terminal according to the audio and/or video interface guidelines. And transmitting the interactive information to the multi-point audio video communication service management system, and receiving the statistical result obtained by the conference service management system using the interactive information, and transmitting the statistical result to each participating terminal.
- the interaction initiation module is further configured to obtain a pre-stored audio file and a picture file corresponding to the interaction, and separately synthesize the audio file and the picture file to form an audio and video interface for guiding user input.
- the interaction initiation module is configured to obtain an information type supported by each participant terminal, and send an audio and/or video interface to each participant terminal based on the information type.
- the interaction initiation module is configured to obtain the type of information supported by each conference terminal, and detect whether each conference terminal sets the type of the received information. If the setting is performed, it is detected whether the information type set by the corresponding conference terminal belongs to the support. The information type, if yes, sends an audio or video interface to the corresponding conference terminal based on the set information type; otherwise, the audio or video interface is sent to the corresponding conference terminal based on the type of information supported by the conference terminal; if not, the The type of information supported by each participating terminal sends an audio and/or video interface to each participating terminal.
- the interactive processing module is further configured to synthesize the statistical result into text, audio, and video, and send the statistical result of the text, audio, and/or video to each participating terminal based on the type of information supported by each of the participating terminals.
- the interaction initiation module is further configured to encode the audio and/or video interface set to direct the user input together with the data, audio and video code streams in the current conference as a media stream format, and then send the format to the conference terminal;
- the management module is further configured to encode the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then send the result to each conference terminal.
- the multipoint audio video communication includes a voice video conference; the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
- remote interaction is implemented in a multi-point audio video communication system, thereby eliminating the need to introduce third-party devices, improving operability and reducing costs.
- FIG. 1 is a schematic structural diagram of a multipoint audio video communication system according to the related art
- FIG. 2 is a schematic structural diagram of a multipoint audio video communication system with a voting function according to the related art
- FIG. 3 is a flow chart of a method for remote interaction in multi-point audio video communication according to Embodiment 1 of the present invention
- FIG. 4 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to Embodiment 1 of the present invention.
- FIG. 5 is a flowchart of a method for remote interaction in another multi-point audio video communication according to Embodiment 1 of the present invention.
- FIG. 6 is a structural block diagram of a multipoint audio video communication service management system according to Embodiment 1 of the present invention.
- FIG. 7 is a flowchart of a method for remote interaction in multi-point audio video communication according to Embodiment 2 of the present invention.
- FIG. 8 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to Embodiment 2 of the present invention.
- FIG. 9 is a flowchart of a method for remote interaction in multi-point audio video communication according to an alternative embodiment 1 of the present invention.
- FIG. 10 is a flowchart of a method for remote interaction in multi-point audio video communication according to an alternative embodiment 2 of the present invention.
- FIG. 11 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to an alternative embodiment 3 of the embodiment of the present invention.
- FIG. 12 is a schematic structural diagram of a remote interaction system in distance education according to an alternative embodiment 4 of the embodiment of the present invention.
- FIG. 3 is a flowchart of a method for remote interaction in multi-point audio video communication according to the first embodiment of the present invention, as shown in FIG. The process includes the following steps:
- Step S302 the multipoint audio video communication multipoint control processing unit sends the audio and/or video data carrying the first interactive information to the multipoint audio video communication terminal;
- Step S304 the multi-point audio video communication multi-point control processing unit receives the audio and/or video data that carries the second interaction information that is sent by the multi-point audio video communication terminal according to the first interaction information;
- Step S306 the multipoint audio video communication multipoint control processing unit processes the second interactive information.
- remote interaction is implemented in a multi-point audio video communication system, thereby eliminating the need to introduce third-party devices, improving operability and reducing costs.
- the multi-point audio and video communication multi-point control processing unit processing the second interaction information in the step S306 may include: multi-point audio and video communication multi-point control processing unit to multi-point audio video
- the communication service management system sends the second interaction information, and receives a processing result sent by the multi-point audio video communication service management system according to the second interaction information.
- the processing load of the multipoint audio video communication multipoint control processing unit is reduced.
- the multi-point audio video communication multi-point control processing unit receives the processing result sent by the multi-point audio video communication service management system according to the second interactive information, and may also provide multi-point audio video.
- the communication terminal transmits audio and/or video data carrying the above processing result.
- the multi-point audio video communication multi-point control processing unit sends the audio and/or video data carrying the first interaction information to the multi-point audio video communication terminal, including: multi-point audio video.
- the communication multi-point control processing unit acquires the first interaction information, and the first interaction information and the voice in the conference and/or Or video stream encoding, obtaining audio and/or video data carrying the first interactive information, and transmitting the audio and/or video data carrying the first interactive information to the multi-point audio video communication terminal.
- the information and the interactive information will be encoded and sent to the multi-point audio video communication terminal.
- the multipoint audio video communication multipoint control processing unit sends the audio and/or video data of the bearer processing result to the multipoint audio video communication terminal, including: multipoint audio video communication.
- the multipoint control processing unit encodes the above processing result and the voice and/or video code stream in the conference to obtain audio and/or video data carrying the processing result, and transmits the audio of the bearer processing result to the multipoint audio video communication terminal and/or Or video data.
- a multi-point audio and video communication multi-point control processing unit is further provided, and the unit is configured to implement the above-mentioned embodiments and preferred embodiments, and the detailed description thereof has been omitted.
- the term "module” may implement a combination of software and/or hardware of a predetermined function.
- the multipoint audio video communication multipoint control processing unit includes: a first sending module 410, setting The receiving module 420 is connected to the first sending module 410 and configured to receive the multi-point audio and video communication terminal according to the first interactive information, to send the audio and/or video data that carries the first interactive information to the multi-point audio and video communication terminal.
- the audio and/or video data carrying the second interactive information is connected to the receiving module 420 and configured to process the second interactive information.
- the processing module 430 may include: a sending unit, configured to send second interaction information to the multi-point audio video communication service management system; and the receiving unit is connected to the sending unit, and is configured to Receiving a processing result sent by the multi-point audio video communication service management system according to the second interactive information.
- the multi-point audio video communication multi-point control processing unit further includes: a second sending module, configured to send the audio carrying the processing result to the multi-point audio video communication terminal and/or Video data.
- the first sending module 410 may include: an acquiring unit, configured to acquire first interaction information; and the first encoding unit is connected to the acquiring unit, and is configured to set the first interaction information and The voice and/or video code stream in the conference is encoded to obtain audio and/or video data carrying the first interaction information; the first sending unit is connected to the first coding unit and configured to send the bearer to the multi-point audio video communication terminal.
- the foregoing second sending module may include: a second encoding unit configured to encode the processing result and the voice and/or video code stream in the conference to obtain an audio carrying the processing result. And/or video data; the second transmitting unit is connected to the second encoding unit and configured to transmit the audio and/or video data carrying the processing result to the multi-point audio video communication terminal.
- FIG. 5 is a flowchart of a method for remote interaction in another multi-point audio video communication according to the first embodiment of the present invention. As shown in 5, the process includes the following steps:
- Step S502 the multipoint audio video communication service management system instructs the multipoint audio video communication multipoint control processing unit to send the first interaction information to the multipoint audio video communication terminal;
- Step S504 the multi-point audio video communication service management system receives the second interaction information that is sent by the multi-point audio and video communication terminal sent by the multi-point audio and video communication multi-point control processing unit according to the first interaction information;
- Step S506 the multi-point audio video communication service management system processes the second interaction information.
- the multi-point audio video communication service management system may further indicate multi-point audio and video communication.
- the point control processing unit transmits the processing result to the multipoint audio video communication terminal.
- the first interaction information includes voting information and/or scoring information
- the second interaction information includes: voting selection and/or scoring score
- the processing of the second interactive information by the communication service management system may include: the multi-point audio video communication service management system performs statistical analysis on the voting selection and/or the scoring score to obtain a voting result and/or a scoring result.
- a multi-point audio and video communication service management system is further provided.
- the unit is configured to implement the foregoing embodiments and preferred embodiments, and details are not described herein.
- the term "module” may implement a combination of software and/or hardware of a predetermined function.
- the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
- FIG. 6 is a structural block diagram of a multipoint audio video communication service management system according to Embodiment 1 of the present invention.
- the system includes: a first indication module 610, configured to indicate multipoint audio and video communication multipoint control processing.
- the unit sends the first interaction information to the multi-point audio and video communication terminal;
- the receiving module 620 is connected to the first indication module 610, and is configured to receive the multi-point audio and video communication multi-point control processing unit to send the multi-point audio and video communication terminal according to the first
- the processing module 630 is connected to the receiving module 620 and configured to process the second interactive information.
- the multi-point audio video communication service management system further includes: a second indication module, configured to instruct the multi-point audio and video communication multi-point control processing unit to the multi-point audio and video communication terminal Send the processing result.
- the first interaction information includes voting information and/or scoring information
- the second interaction information includes at least one of: voting selection and/or scoring score
- the processing module is configured to The multi-point audio video communication service management system performs statistical analysis on voting selection and/or scoring scores to obtain voting results and/or scoring results.
- FIG. 7 is a flowchart of a method for remote interaction in multi-point audio video communication according to Embodiment 2 of the present invention, as shown in FIG. The process includes the following steps:
- Step S702 the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface corresponding to the current interaction, which is set to guide the user input, to each participating terminal; wherein each participating terminal supports the user input channel;
- Step S704 the multi-point audio video communication multi-point control processing unit receives the interaction information fed back by each participating terminal according to the instruction of the audio and/or video interface;
- Step S706 the multi-point audio video communication multi-point control processing unit sends the interaction information to the multi-point audio video communication service management system;
- Step S708 the multipoint audio and video communication multipoint control processing unit receives the statistical result obtained by using the interactive information by the multipoint audio and video communication service management system;
- Step S710 the multi-point audio video communication multi-point control processing unit sends the statistical result to each participating terminal.
- the multi-point audio video communication multi-point control processing unit sends the audio and/or video interface to each participating terminal, and further includes: multi-point audio video communication multi-point control processing unit acquiring The audio file and the image file corresponding to the current interaction are pre-stored, and the audio file and the image file are separately synthesized to form an audio and video interface for guiding user input.
- the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface that directs user input to each participating terminal, including: multi-point audio and video communication.
- the control processing unit acquires the type of information supported by each participating terminal; the multi-point audio and video multi-point control processing unit transmits the audio and/or video interfaces each guiding the user input to each participating terminal based on the type of information supported by each participating terminal.
- the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface that directs user input to each participating terminal, including: multi-point audio video communication multi-point control processing.
- the unit acquires the type of information supported by each participating terminal, and detects whether each participating terminal sets the type of the received information; when the participating terminal sets the type of the received information, it detects whether the type of information set by the corresponding participating terminal belongs to its support.
- the type of information if yes, sending an audio and/or video interface to the corresponding participant terminal based on the set information type; if not, transmitting an audio and/or video interface to the corresponding participant terminal based on the type of information supported by the participant terminal; and/or when When the participating terminal does not set the type of the received information, the audio and/or video interface is transmitted to each participating terminal based on the type of information supported by each participating terminal.
- the multi-point audio video communication multi-point control processing unit sends the statistical result to each participating terminal, including: a multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text and audio. And video, and based on the type of information supported by the respective participating terminals, the statistical results of text, audio and/or video are sent to the participating terminals.
- the multi-point audio video communication multi-point control processing unit acquires the information type supported by each participating terminal, including: multi-point audio video communication multi-point control processing unit acquisition is established for each participating terminal
- the logical channel determines the type of information supported by the participant terminal according to the type of the logical channel; the types of the logical channel include: audio, video, and data.
- the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface that directs user input to each participating terminal, including: multi-point audio video communication multi-point control processing.
- the unit is configured to direct the audio and/or video interface input by the user to be encoded into the media stream format together with the data, audio and video code streams in the current multi-point audio video communication, and then sent to each participating terminal; multi-point audio and video communication is multi-point
- the control processing unit sends the statistical result to each participating terminal, including: the multi-point audio video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then sends the result to each participant. terminal.
- the types of interaction include: voting, scoring, and voting.
- the multipoint audio video communication includes an audio video conference;
- the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
- a multi-point audio and video communication multi-point control processing unit is further provided, and the unit is configured to implement the above-mentioned embodiments and preferred embodiments, and the detailed description thereof has been omitted.
- the term "module” may implement a combination of software and/or hardware of a predetermined function.
- the method includes: an interaction initiation module 810, which is configured to be in a multipoint audio and video communication.
- the interaction corresponding to the secondary interaction is to send the audio and/or video interface input by the user to each participating terminal; wherein each participating terminal supports the user input channel;
- the interaction processing module 820 is configured to receive the respective terminal according to the audio and/or video.
- the interactive information fed back by the interface, and the interactive information is sent to the multi-point audio and video communication service management system, and the statistical result obtained by the conference service management system using the interactive information is collected, and the statistical result is sent to each participating terminal.
- the interaction initiation module 810 is further configured to acquire a pre-stored audio file and a picture file corresponding to the current interaction, and separately synthesize the audio file and the image file for formation. To guide the user input audio and video interface.
- the interaction initiation module 810 is configured to obtain an information type supported by each participant terminal, and send an audio and/or video interface to each participant terminal based on the information type.
- the interaction initiation module 810 is configured to acquire information types supported by the conference terminals, and detect whether each conference terminal sets the type of the received information, and if configured, Then, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, the information type supported by the conference terminal is used to correspond to the conference terminal.
- the interaction processing module 820 is further configured to synthesize the statistical result into text, audio, and video, and send the message to each participating terminal based on the type of information supported by each participating terminal. Statistical results for text, audio, and/or video.
- the interaction initiation module 810 is further configured to encode the audio and/or video interface set to direct user input with the data, audio, and video code streams in the current conference as the media.
- the stream format is sent to each participant terminal.
- the interaction processing module 820 is further configured to encode the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then send the result to each conference terminal.
- the multipoint audio video communication includes a voice video conference;
- the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
- multi-point audio video communication may include various types of communication, such as video conferencing, voice conferencing, etc., but is not limited thereto.
- video conferencing voice conferencing
- voice conferencing voice conferencing
- multi-point audio video communication may include various types of communication, such as video conferencing, voice conferencing, etc., but is not limited thereto.
- the following describes an optional implementation manner of the embodiment of the present invention by taking an audio and video conference as an example.
- an interaction method in an audio video conference is provided.
- the audio and/or video interface input by the user is used as an example to describe an example.
- the method includes the following steps:
- Step S902 the multi-point audio and video communication multi-point control processing unit sends an audio and/or video interface corresponding to the current interaction, which is set to guide the user input, to each of the participating multi-point audio and video communication terminals;
- the types of interactions mentioned above include, but are not limited to, voting, scoring, and voting.
- each of the multi-point audio and video communication terminals supports the user input channel; specifically, the multi-point audio and video communication multi-point control processing unit opens the user input channel for each participating terminal when the participating terminals join the conference, so that Each participating terminal supports the user input channel, thereby providing technical support for the secondary input of the end user.
- the manner of obtaining the audio and/or video interface that is set to direct the user input includes: the multi-point audio video communication multi-point control processing unit acquires the pre-stored audio file and the image file corresponding to the current interaction, and The audio file and the image file are separately synthesized to form an audio and video interface for guiding user input.
- the multi-point audio and video communication multi-point control processing unit delivers audio, video or audio and video to each participating terminal, which is determined by the type of information supported by the participating terminal. For example, if the participating terminal only supports audio or video, Then send an audio or video interface to it, and if the participating terminal supports audio and video, send an audio and video interface to it.
- the manner in which the multi-point audio and video communication multi-point control processing unit acquires the information types supported by the participating terminals includes, but is not limited to, multi-point audio video communication multi-point control processing unit acquires a logical channel established for each participating terminal, according to logic
- the type of channel determines the type of information supported by the participating terminal; the types of the logical channel include: audio, video, and data.
- the multi-point audio and video communication multi-point control processing unit acquires the information types supported by the participating terminals, and detects whether each participating terminal sets the type of the received information;
- the setting is made, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, based on the conference terminal support
- the information type sends an audio or video interface to the corresponding conference terminal;
- an audio and/or video interface is transmitted to each participating terminal based on the type of information supported by each participating terminal.
- Step S904 the multi-point audio video communication service management system collects interactive information that each participating terminal feeds back according to the guidance of the audio and/or video interface;
- Step S906 the multipoint audio video communication multipoint control processing unit sends the statistical result obtained by the multipoint audio video communication service management system to each participating terminal.
- the multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text, audio and video, and sends text, audio and/or video to each participating terminal based on the type of information supported by each participating terminal. Statistical results.
- the network resources are saved, and the information of the interaction between the multipoint audio and video communication multipoint control processing unit and each participating terminal and the data in the conference,
- the audio and video streams are encoded together into a media stream format for interaction.
- the specific performance is:
- the multi-point audio video communication multi-point control processing unit encodes the audio and/or video interface set to direct the user input together with the data, audio and video code streams in the current conference into a media stream format and transmits them to each participating terminal.
- the multi-point audio and video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then transmits the result to each participating terminal.
- the conference end users can vote as if they are on the scene.
- the multipoint audio video communication may be a television video conference
- the multipoint audio video communication multipoint control processing unit may be an MCU.
- an interaction method in an audio video conference includes the following steps:
- step S1002 the conference administrator selects a terminal list in the multi-point audio video communication service management system to hold a video conference.
- the multi-point audio video communication service management system notification protocol and the channel control module hold a video conference, and call each participating terminal to join the conference.
- the user input channel (UserInput) is supported during the capability negotiation and logical channel opening process.
- the user input channel By supporting the user input channel, the user provides the necessary technical support according to the instruction input during the interaction.
- step S1004 the service management system judges whether the voting, scoring, or voting process needs to be enabled according to the needs of the meeting site. If not, the meeting maintains the normal mode to continue, and if necessary, enters the voting, scoring, or voting process, and proceeds to step S1006. .
- Step S1006 The multi-point audio video communication multi-point control processing unit separately synthesizes the already saved audio file and the picture file to form an audio and video interface (ie, a user interface UI of the video) that prompts the user to input.
- an audio and video interface ie, a user interface UI of the video
- Step S1008 The multipoint audio video communication multipoint control processing unit encodes the audio and video interface together with the data, audio and video code streams in the current conference into a media stream format.
- Step S1010 The multipoint audio video communication multipoint control processing unit sends the audio and video code streams to the conference terminal;
- the video channel will not be opened in step S1002, and therefore the video media stream will not be sent to the corresponding conference terminal.
- step S1012 the participating terminal inputs its own voting, scoring or voting data and submits it according to the guidance of the received audio and/or video interface, using a remote controller, a keyboard, or the like.
- the operation according to the guidance of the received audio and/or video interface is an interactive audio video response (IVR or IVVR) technology.
- IVR interactive audio video response
- step S1014 the multipoint audio and video communication multipoint control processing unit receives the voting, scoring or voting data submitted by all the participating terminals, and submits the data to the multipoint audio and video communication service management system for decision.
- step S1016 the multi-point audio video communication service management system aggregates all the voting, scoring or voting data, makes a decision according to the user's will, and sends the summary result to the multi-point audio and video communication multi-point control processing unit.
- step S1018 the multi-point audio video communication multi-point control processing unit sends the result of voting, scoring or voting to the participating terminal by synthesizing text, audio and video. Among them, for audio conferences and terminals that only support audio capabilities, only audio is sent to the corresponding conference terminal to inform the result.
- step S1020 after the voting, scoring or voting process ends, the conference continues in the normal mode. As long as the voting process needs to be performed in the conference, the process proceeds to the above process until the conference ends.
- the solution according to the embodiment of the present invention is based on the original networking of the audio video conferencing system, and helps the user to vote and score in the scene of multimedia remote field access such as conference, interview, and remote interaction. And express their own opinions and realize the interaction in audio and video conferences.
- the solution in the embodiment of the present invention is simple to implement, and does not need to change the current conference system networking of the user, and does not need to add a traditional third-party voting system, and does not need to manually count and vote for the members in the conference.
- the system uses interactive audio and video response (IVR, IVVR) to allow users to easily vote and score.
- the system displays the automatic statistics and calculation results and presents them to the user in multimedia mode.
- An embodiment of the present invention provides a multipoint audio and video communication multipoint control processing unit, as shown in FIG.
- the interaction initiation module 1110 is configured to, in the multi-point audio video communication, send an audio and/or video interface corresponding to the current interaction to guide the user input to each communication terminal; wherein each communication terminal supports the user input channel ;
- the interaction processing module 1120 is configured to receive interaction information that each communication terminal feeds back according to the instruction of the audio and/or video interface, and send the interaction information to the multi-point audio video communication service management system, and receive the multi-point audio video communication service.
- the management system uses the interaction information to perform statistics and obtains the statistical result, and sends the statistical result to each communication terminal.
- the interaction initiation module 1110 forms an audio and video interface for guiding user input by acquiring an audio file and a picture file corresponding to the current interaction and synthesizing the audio file and the picture file respectively.
- the interaction initiation module 1110 acquires an information type supported by each communication terminal, and sends an audio and/or video interface to each communication terminal based on the information type; or acquires an information type supported by each communication terminal, and detects whether each communication terminal receives the information. Setting the type of information, if it is set, detecting whether the type of information set by the corresponding communication terminal belongs to the type of information it supports, and if so, sending an audio or video interface to the corresponding communication terminal based on the set information type; Otherwise, an audio or video interface is sent to the corresponding communication terminal based on the type of information supported by the communication terminal; if not, the audio and/or video interface is transmitted to each communication terminal based on the type of information supported by each communication terminal.
- the interaction processing module 1120 synthesizes the statistical result into text, audio, and video, and transmits the statistical result of the text, audio, and/or video to each communication terminal based on the type of information supported by each of the previously acquired communication terminals.
- the network resource is saved, and the information of the multipoint audio and video communication multipoint control processing unit and each communication terminal and the data in the conference are exchanged.
- the audio and video streams are encoded together into a media stream format for interaction.
- the specific performance is:
- the interaction initiation module 1110, the audio and/or video interface set to direct the user input is encoded into the media stream format together with the data, audio and video code streams in the current communication, and then sent to each communication terminal;
- the interaction processing module 1120 encodes the statistical result together with the data, audio and video code streams in the current communication into a media stream format, and then sends the result to each communication terminal.
- the multi-point audio and video communication multi-point control processing unit in the embodiment does not change the user's conference system networking, and does not change the user's original usage habits of the conference system.
- the multi-point audio and video communication multi-point control processing unit described can enable the communication terminal user to have an interactive experience of voting, scoring or voting as if it were on-site.
- the embodiment of the present invention provides a remote interaction system in distance education, as shown in FIG. 12, including: a multi-point audio and video communication service management system 1210, a multi-point audio and video communication terminal 1220, and multiple points of the optional embodiment 3. Audio and video communication multipoint control processing unit 1230; wherein:
- the remote multi-point audio video communication terminal 1220 is configured to, after receiving the audio and/or video interface corresponding to the current interaction sent by the multi-point audio video communication multi-point control processing unit 1230 to guide the user input, according to the audio and / or the interface of the video interface to perform an interactive operation, and the interactive information obtained by the operation is sent to the multi-point audio video communication multi-point control processing unit 1230;
- the multi-point audio video communication service management system 1210 is configured to receive the interaction information that each remote terminal sends back according to the guidance of the audio and/or video interface sent by the multi-point audio and video communication multi-point control processing unit 630, and the interaction information is After the statistics are performed, the statistical results are sent to the multipoint audio video communication multipoint control processing unit 1230.
- the system in this embodiment does not change the conference system of the user, does not change the original usage habits of the user to the conference system, and uses the multipoint audio and video communication described in this embodiment.
- the control processing unit enables the participating end users to have an interactive experience of voting, scoring or voting as if they were on the spot.
- modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
- the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module.
- the invention is not limited to any specific combination of hardware and software.
- the method and device for remote interaction in multi-point audio and video communication provided by the embodiments of the present invention have the following beneficial effects: solving the related art, introducing a third-party interactive system to implement an interaction in a multi-point audio and video communication system The resulting defect enables remote interaction in a multi-point audio video communication system, eliminating the need to introduce third-party devices, increasing operability and reducing costs.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
Abstract
Disclosed are a remote interaction method and device in multipoint audio and video communication. The remote interaction method in multipoint audio and video communication comprises: a multipoint control processing unit in multipoint audio and video communication sends, to each participant terminal, an audio interface and/or a video interface corresponding to current interaction and set to guide user input, each participant terminal supporting a user input channel; a service management system in multipoint audio and video communication performs statistics on interaction information fed back by each participant terminal under the guidance of the audio interface and/or the video interface; and the multipoint control processing unit in multipoint audio and video communication sends a statistics result obtained by the service management system in multipoint audio and video communication to each participant terminal. By means of the present invention, remote interaction is implemented in a multipoint audio and video communication system, so that a third-party device does not need to be introduced, operability is improved, and costs are reduced.
Description
本发明涉及通信领域,具体而言,涉及一种多点音频视频通信中远程互动的方法及设备。The present invention relates to the field of communications, and in particular to a method and device for remote interaction in multi-point audio video communication.
随着IP网络和多媒体通信技术的快速发展,以及全球环保低碳的理念的影响,用户已经不需要亲临现场就可以通过音频视频多媒体技术实现的远程通信参与到会议、决策等,这使得音频视频远程通信在各行各业中得到了广泛的应用。With the rapid development of IP networks and multimedia communication technologies, and the impact of the global concept of environmentally friendly low carbon, users can participate in conferences, decision-making, etc. by remote communication via audio and video multimedia technology without having to visit the site. Remote communication has been widely used in various industries.
会议电视系统、远程教育、远程医疗就是比较典型的应用。图1是根据相关技术的多点音频视频通信系统的结构示意图,如图1所示,多点音频视频通信系统包括:多点音频视频通信业务管理系统1和多点音频视频通信多点控制处理单元2。在如图1所示的系统中,通过多点音频视频通信多点控制处理单元2连接分布各地的多个多点音频视频通信终端,实现与会人员的互动和沟通。Conference TV systems, distance education, and telemedicine are typical applications. 1 is a schematic structural diagram of a multipoint audio video communication system according to the related art. As shown in FIG. 1, a multipoint audio video communication system includes: a multipoint audio video communication service management system 1 and a multipoint audio and video communication multipoint control process. Unit 2. In the system shown in FIG. 1, a multipoint audio and video communication multipoint control processing unit 2 is connected to a plurality of multipoint audio and video communication terminals distributed around the country to realize interaction and communication between the participants.
在现实应用中,例如远程教育和远程培训,经常会用到调查问卷,小测验之类的使用场景。而对于一些需要用户进行投票、打分、表决的使用场景下,大多数情况都是需要用户亲临现场,对用户的差旅费和时间都是很大的浪费。In real-world applications, such as distance education and distance training, questionnaires, quizzes and other usage scenarios are often used. For some usage scenarios that require users to vote, score, and vote, most of the cases require the user to visit the site, which is a great waste of travel expenses and time.
为了实现远程的投票、打分、表决,有必要增加表决功能。图2是根据相关技术的具有表决功能的多点音频视频通信系统的结构示意图,如图2所示,在如图1所示的系统中增加第三方的表决系统。In order to achieve remote voting, scoring, voting, it is necessary to increase the voting function. 2 is a schematic diagram showing the structure of a multipoint audio and video communication system having a voting function according to the related art. As shown in FIG. 2, a third party voting system is added to the system shown in FIG.
上述通过引入第三方表决系统的方案存在一定的缺陷:一方面,需要改变多点音频视频通信系统的组网,在资源、配置等各个方面均需要匹配设置,专业性要求较高,对其使用效率产生影响;另一方面,购入第三方表决系统也会增大资金投入,不便于广泛推广。The above schemes for introducing a third-party voting system have certain defects: on the one hand, it is necessary to change the networking of the multi-point audio and video communication system, and matching settings are required in various aspects such as resources and configuration, and the professional requirements are high, and the use thereof is used. Efficiency has an impact; on the other hand, the purchase of a third-party voting system will also increase capital investment, which is not convenient for widespread promotion.
发明内容Summary of the invention
本发明实施例提供了一种多点音频视频通信中远程互动的方法及设备,以至少解决相关技术中在多点音频视频通信系统中引入第三方交互系统实现交互所导致的缺陷。
The embodiments of the present invention provide a method and a device for remote interaction in multi-point audio and video communication, so as to at least solve the defects caused by the introduction of a third-party interactive system in a multi-point audio and video communication system.
根据本发明的一个实施例,提供了一种音频视频会议中的远程互动方法,包括:MCU将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;会议业务管理系统统计各与会终端根据音频和/或视频界面的指引而反馈的互动信息;MCU将会议业务管理系统得到的统计结果发送至各与会终端。According to an embodiment of the present invention, a remote interaction method in an audio video conference is provided, including: the MCU sends an audio and/or video interface corresponding to the current interaction to direct user input to each participating terminal; Each participating terminal supports the user input channel; the conference service management system collects the interactive information fed back by the participating terminals according to the audio and/or video interface; the MCU sends the statistical result obtained by the conference service management system to each participating terminal.
MCU将音频和/或视频界面发送至各与会终端前包括:MCU获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。Before the MCU sends the audio and/or video interface to each participant terminal, the MCU obtains the pre-stored audio file and the image file corresponding to the interaction, and synthesizes the audio file and the image file separately to form a user input. Audio and video interface.
MCU将指引用户输入的音频和/或视频界面发送至各与会终端,具体包括:MCU获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。The MCU sends the audio and/or video interface input by the user to each participating terminal, and specifically includes: the MCU acquires the type of information supported by each participating terminal, and sends an audio and/or video interface to each participating terminal based on the type of the information.
MCU将指引用户输入的音频和/或视频界面发送至各与会终端,具体包括:MCU获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定;若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。The MCU will send the audio and/or video interface input by the user to each participating terminal, and specifically includes: the MCU acquires the type of information supported by each participating terminal, and detects whether each participating terminal sets the type of the received information; If yes, it detects whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, sends an audio or video interface to the corresponding conference terminal based on the set information type; otherwise, according to the information type supported by the conference terminal The participating terminal sends an audio or video interface; if not set, the audio and/or video interface is sent to each participating terminal based on the type of information supported by each participating terminal.
MCU将统计结果发送至各与会终端,具体包括:MCU将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The MCU sends the statistics to each participant terminal, including: the MCU synthesizes the statistical result into text, audio, and video, and sends text, audio, and/or video to each participating terminal based on the type of information previously supported by each participating terminal. Statistical results.
MCU获取各与会终端支持的信息类型的方式包括:MCU获取为各与会终端建立的逻辑通道,根据逻辑通道的类型确定与会终端支持的信息类型;逻辑通道的类型包括:音频、视频和数据。The manner in which the MCU obtains the information types supported by the participating terminals includes: the MCU acquires a logical channel established for each participating terminal, and determines the type of information supported by the participating terminal according to the type of the logical channel; the types of the logical channel include: audio, video, and data.
MCU将指引用户输入的音频和/或视频界面发送至各与会终端,进一步包括:MCU将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;MCU将统计结果发送至各与会终端,进一步包括:MCU将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The MCU will direct the audio and/or video interface input by the user to each participating terminal, and further includes: the MCU sets the audio and/or video interface set to direct the user input together with the data, audio and video streams in the current conference as The media stream format is sent to each participating terminal; the MCU sends the statistics to each participant terminal, and further includes: the MCU encodes the statistical result together with the data, audio, and video streams in the current conference into a media stream format, and then sends the result to each participant. terminal.
互动的类型包括:投票、打分和表决。
The types of interaction include: voting, scoring, and voting.
根据本发明的另一个实施例,提供了一种多点控制单元MCU,包括:互动发起模块,设置为在视频会议中,将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;互动处理模块,设置为接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并将互动信息发送至会议业务管理系统,以及接收会议业务管理系统利用互动信息进行统计得到的统计结果,将统计结果发送至各与会终端。According to another embodiment of the present invention, there is provided a multipoint control unit MCU, comprising: an interaction initiation module, configured to set, in a video conference, an audio and/or video corresponding to the current interaction to guide user input. The interface is sent to each participating terminal; wherein each participating terminal supports a user input channel; the interactive processing module is configured to receive interactive information fed back by each participating terminal according to the audio and/or video interface, and send the interactive information to the meeting. The business management system, and the statistical result obtained by the conference service management system using the interactive information, and the statistical result is sent to each participating terminal.
互动发起模块,还设置为获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The interaction initiation module is further configured to obtain a pre-stored audio file and a picture file corresponding to the interaction, and separately synthesize the audio file and the picture file to form an audio and video interface for guiding user input.
互动发起模块,具体设置为获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。The interaction initiation module is specifically configured to obtain an information type supported by each conference terminal, and send an audio and/or video interface to each conference terminal based on the information type.
互动发起模块,具体设置为获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定,若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。The interaction initiation module is specifically configured to obtain the type of information supported by each conference terminal, and detect whether each conference terminal sets the type of the received information. If the setting is performed, it is detected whether the information type set by the corresponding conference terminal belongs to the The type of information supported, if yes, sending an audio or video interface to the corresponding conference terminal based on the set information type; otherwise, sending an audio or video interface to the corresponding conference terminal based on the type of information supported by the conference terminal; if not, An audio and/or video interface is sent to each participating terminal based on the type of information supported by each participating terminal.
互动处理模块,进一步设置为将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The interactive processing module is further configured to synthesize the statistical result into text, audio, and video, and send the statistical result of the text, audio, and/or video to each participating terminal based on the type of information supported by each of the participating terminals.
互动发起模块,进一步设置为将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;互动处理模块,进一步设置为将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The interaction initiation module is further configured to encode the audio and/or video interface set to direct the user input together with the data, audio and video code streams in the current conference as a media stream format, and then send the format to the conference terminal; the interaction processing module further It is set to encode the statistical result together with the data, audio and video streams in the current conference into the media stream format and then send it to each participating terminal.
根据本发明的再一个实施例,提供了一种音频视频会议中的远程互动系统,包括:会议业务管理系统、与会终端、以及上述的MCU;与会终端,设置为在接收到MCU发送的与本次互动对应的设置为指引用户输入的音频和/或视频界面后,根据音频和/或视频界面的指引进行互动操作,并将操作得到的互动信息发送至MCU;会议业务管理系统,设置为接收MCU发送的各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并对互动信息进行统计后,将统计结果发送至MCU。
According to still another embodiment of the present invention, a remote interaction system in an audio video conference is provided, including: a conference service management system, a conference terminal, and the above-mentioned MCU; the conference terminal is set to receive the data transmitted by the MCU. After the interaction is set to guide the user to input the audio and/or video interface, the interactive operation is performed according to the instructions of the audio and/or video interface, and the interactive information obtained by the operation is sent to the MCU; the conference service management system is set to receive The interactive information sent by each participating terminal sent by the MCU according to the guidance of the audio and/or video interface, and the statistical information is collected, and the statistical result is sent to the MCU.
根据本发明的再一个实施例,提供了一种多点音频视频通信中远程互动方法,包括:多点音频视频通信多点控制处理单元将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;多点音频视频通信多点控制处理单元接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息;多点音频视频通信多点控制处理单元将互动信息发送至多点音频视频通信业务管理系统;多点音频视频通信多点控制处理单元接收多点音频视频通信业务管理系统利用互动信息进行统计得到的统计结果;多点音频视频通信多点控制处理单元将统计结果发送至各与会终端。According to still another embodiment of the present invention, a remote interaction method in multi-point audio video communication is provided, including: a multi-point audio video communication multi-point control processing unit sets the audio corresponding to the current interaction to be set to guide the user input. And/or the video interface is sent to each participating terminal; wherein each participating terminal supports the user input channel; the multi-point audio and video communication multi-point control processing unit receives the interactive information fed back by each participating terminal according to the instruction of the audio and/or video interface; The multi-point audio and video communication multi-point control processing unit sends the interactive information to the multi-point audio video communication service management system; the multi-point audio and video communication multi-point control processing unit receives the statistical data obtained by the multi-point audio and video communication service management system using the interactive information. As a result, the multipoint audio video communication multipoint control processing unit transmits the statistical result to each participating terminal.
多点音频视频通信多点控制处理单元将音频和/或视频界面发送至各与会终端之前,还包括:多点音频视频通信多点控制处理单元获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The multi-point audio video communication multi-point control processing unit sends the audio and/or video interface to each participating terminal, and further includes: a multi-point audio video communication multi-point control processing unit acquires a pre-stored audio file corresponding to the interaction and The image file, and the audio file and the image file are separately synthesized to form an audio and video interface for guiding user input.
多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型;多点音频视频多点控制处理单元基于各与会终端支持的信息类型向各将指引用户输入的音频和/或视频界面发送至各与会终端。The multi-point audio and video communication multi-point control processing unit sends the audio and/or video interface that directs the user input to each participating terminal, including: the multi-point audio video communication multi-point control processing unit acquires the type of information supported by each participating terminal; The audio video multipoint control processing unit transmits an audio and/or video interface each guiding the user input to each participating terminal based on the type of information supported by each participating terminal.
多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定;当与会终端设定了接收信息的类型时,检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,基于设定的信息类型向对应与会终端发送音频和/或视频界面;若否,基于与会终端支持的信息类型向对应与会终端发送音频和/或视频界面;和/或当与会终端未设定接收信息的类型时,基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。The multi-point audio and video communication multi-point control processing unit sends an audio and/or video interface guiding the user input to each participating terminal, including: a multi-point audio video communication multi-point control processing unit acquires information types supported by each participating terminal, and detects Whether each participating terminal sets the type of the received information; when the participating terminal sets the type of the received information, it detects whether the type of information set by the corresponding participating terminal belongs to the type of information it supports, and if so, based on the type of the set information Sending an audio and/or video interface to the corresponding conference terminal; if not, transmitting an audio and/or video interface to the corresponding conference terminal based on the type of information supported by the conference terminal; and/or when the conference terminal does not set the type of the reception information, based on The type of information supported by each participating terminal sends an audio and/or video interface to each participating terminal.
多点音频视频通信多点控制处理单元将统计结果发送至各与会终端,包括:多点音频视频通信多点控制处理单元将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The multi-point audio and video communication multi-point control processing unit sends the statistical result to each participating terminal, including: a multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text, audio and video, and is based on the previously acquired each participating terminal The type of information supported, sending statistical results of text, audio and/or video to each participating terminal.
多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型的方式包括:多点音频视频通信多点控制处理单元获取为各与会终端建立的逻辑通道,根据逻辑通道的类型确定与会终端支持的信息类型;逻辑通道的类型包括:音频、视频和数据。
The multi-point audio and video communication multi-point control processing unit acquires the information types supported by the participating terminals, including: the multi-point audio video communication multi-point control processing unit acquires a logical channel established for each participating terminal, and determines the participating terminal according to the type of the logical channel. Types of information supported; types of logical channels include: audio, video, and data.
多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点控制处理单元将设置为指引用户输入的音频和/或视频界面与当前多点音频视频通信中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;多点音频视频通信多点控制处理单元将统计结果发送至各与会终端,包括:多点音频视频通信多点控制处理单元将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The multipoint audio video communication multipoint control processing unit transmits an audio and/or video interface directing user input to each of the participating terminals, including: a multipoint audio video communication multipoint control processing unit will be set to direct user input audio and/or The video interface is encoded into the media stream format together with the data, audio and video code streams in the current multi-point audio and video communication, and then sent to each participating terminal; the multi-point audio and video communication multi-point control processing unit sends the statistical result to each participating terminal. The multi-point audio and video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then sends the result to each participating terminal.
互动的类型包括:投票、打分和表决。The types of interaction include: voting, scoring, and voting.
多点音频视频通信多点控制处理单元包括多点控制单元MCU。The multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
根据本发明的再一个实施例,提供了一种多点音频视频通信多点控制处理单元,包括:互动发起模块,设置为在多点音频视频通信中,将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;互动处理模块,设置为接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并将互动信息发送至多点音频视频通信业务管理系统,以及接收会议业务管理系统利用互动信息进行统计得到的统计结果,将统计结果发送至各与会终端。According to still another embodiment of the present invention, a multipoint audio video communication multipoint control processing unit is provided, including: an interaction initiation module, configured to set a guide corresponding to the interaction in the multipoint audio video communication as a guide The audio and/or video interface input by the user is sent to each participating terminal; wherein each participating terminal supports a user input channel; the interactive processing module is configured to receive interactive information fed back by each participating terminal according to the audio and/or video interface guidelines. And transmitting the interactive information to the multi-point audio video communication service management system, and receiving the statistical result obtained by the conference service management system using the interactive information, and transmitting the statistical result to each participating terminal.
互动发起模块,还设置为获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The interaction initiation module is further configured to obtain a pre-stored audio file and a picture file corresponding to the interaction, and separately synthesize the audio file and the picture file to form an audio and video interface for guiding user input.
互动发起模块,设置为获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。The interaction initiation module is configured to obtain an information type supported by each participant terminal, and send an audio and/or video interface to each participant terminal based on the information type.
互动发起模块,设置为获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定,若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。The interaction initiation module is configured to obtain the type of information supported by each conference terminal, and detect whether each conference terminal sets the type of the received information. If the setting is performed, it is detected whether the information type set by the corresponding conference terminal belongs to the support. The information type, if yes, sends an audio or video interface to the corresponding conference terminal based on the set information type; otherwise, the audio or video interface is sent to the corresponding conference terminal based on the type of information supported by the conference terminal; if not, the The type of information supported by each participating terminal sends an audio and/or video interface to each participating terminal.
互动处理模块,进一步设置为将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The interactive processing module is further configured to synthesize the statistical result into text, audio, and video, and send the statistical result of the text, audio, and/or video to each participating terminal based on the type of information supported by each of the participating terminals.
互动发起模块,进一步设置为将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;互动处
理模块,进一步设置为将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The interaction initiation module is further configured to encode the audio and/or video interface set to direct the user input together with the data, audio and video code streams in the current conference as a media stream format, and then send the format to the conference terminal;
The management module is further configured to encode the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then send the result to each conference terminal.
多点音频视频通信包括语音视频会议;多点音频视频通信多点控制处理单元包括多点控制单元MCU。The multipoint audio video communication includes a voice video conference; the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
通过本发明实施例,在多点音频视频通信系统中实现了远程交互,从而无需引入第三方设备,提高了可操作性并降低了成本。Through the embodiments of the present invention, remote interaction is implemented in a multi-point audio video communication system, thereby eliminating the need to introduce third-party devices, improving operability and reducing costs.
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The drawings described herein are intended to provide a further understanding of the invention, and are intended to be a part of the invention. In the drawing:
图1是根据相关技术的多点音频视频通信系统的结构示意图;1 is a schematic structural diagram of a multipoint audio video communication system according to the related art;
图2是根据相关技术的具有表决功能的多点音频视频通信系统的结构示意图;2 is a schematic structural diagram of a multipoint audio video communication system with a voting function according to the related art;
图3是根据本发明实施例一的多点音频视频通信中远程互动的方法的流程图;3 is a flow chart of a method for remote interaction in multi-point audio video communication according to Embodiment 1 of the present invention;
图4是根据本发明实施例一的多点音频视频通信多点控制处理单元的结构框图;4 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to Embodiment 1 of the present invention;
图5是根据本发明实施例一的另一多点音频视频通信中远程互动的方法的流程图;5 is a flowchart of a method for remote interaction in another multi-point audio video communication according to Embodiment 1 of the present invention;
图6是根据本发明实施例一的多点音频视频通信业务管理系统的结构框图;6 is a structural block diagram of a multipoint audio video communication service management system according to Embodiment 1 of the present invention;
图7是根据本发明实施例二的多点音频视频通信中远程互动的方法的流程图;7 is a flowchart of a method for remote interaction in multi-point audio video communication according to Embodiment 2 of the present invention;
图8是根据本发明实施例二的多点音频视频通信多点控制处理单元的结构框图;8 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to Embodiment 2 of the present invention;
图9是根据本发明实施例可选实施方式一的多点音频视频通信中远程互动的方法的流程图;9 is a flowchart of a method for remote interaction in multi-point audio video communication according to an alternative embodiment 1 of the present invention;
图10是根据本发明实施例可选实施方式二的多点音频视频通信中远程互动的方法的流程图;10 is a flowchart of a method for remote interaction in multi-point audio video communication according to an alternative embodiment 2 of the present invention;
图11是根据本发明实施例可选实施方式三的多点音频视频通信多点控制处理单元的结构框图;以及
11 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to an alternative embodiment 3 of the embodiment of the present invention;
图12是根据本发明实施例可选实施方式四的远程教育中的远程互动系统的结构示意图。FIG. 12 is a schematic structural diagram of a remote interaction system in distance education according to an alternative embodiment 4 of the embodiment of the present invention.
下文中将参考附图并结合实施例来详细说明本发明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。The invention will be described in detail below with reference to the drawings in conjunction with the embodiments. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
实施例一Embodiment 1
在本实施例中提供了一种多点音频视频通信中远程互动的方法,图3是根据本发明实施例一的多点音频视频通信中远程互动的方法的流程图,如图3所示,该流程包括如下步骤:In the embodiment, a method for remote interaction in multi-point audio video communication is provided. FIG. 3 is a flowchart of a method for remote interaction in multi-point audio video communication according to the first embodiment of the present invention, as shown in FIG. The process includes the following steps:
步骤S302,多点音频视频通信多点控制处理单元向多点音频视频通信终端发送承载第一互动信息的音频和/或视频数据;Step S302, the multipoint audio video communication multipoint control processing unit sends the audio and/or video data carrying the first interactive information to the multipoint audio video communication terminal;
步骤S304,多点音频视频通信多点控制处理单元接收多点音频视频通信终端根据第一互动信息发送的承载第二互动信息的音频和/或视频数据;Step S304, the multi-point audio video communication multi-point control processing unit receives the audio and/or video data that carries the second interaction information that is sent by the multi-point audio video communication terminal according to the first interaction information;
步骤S306,多点音频视频通信多点控制处理单元处理第二互动信息。Step S306, the multipoint audio video communication multipoint control processing unit processes the second interactive information.
通过本发明实施例,在多点音频视频通信系统中实现了远程交互,从而无需引入第三方设备,提高了可操作性并降低了成本。Through the embodiments of the present invention, remote interaction is implemented in a multi-point audio video communication system, thereby eliminating the need to introduce third-party devices, improving operability and reducing costs.
在本发明实施例的一个可选实施方式中,上述步骤S306中多点音频视频通信多点控制处理单元处理第二互动信息可以包括:多点音频视频通信多点控制处理单元向多点音频视频通信业务管理系统发送上述第二互动信息,接收多点音频视频通信业务管理系统根据上述第二互动信息发送的处理结果。通过该可选实施方式,降低了多点音频视频通信多点控制处理单元的处理负荷。In an optional implementation manner of the embodiment of the present invention, the multi-point audio and video communication multi-point control processing unit processing the second interaction information in the step S306 may include: multi-point audio and video communication multi-point control processing unit to multi-point audio video The communication service management system sends the second interaction information, and receives a processing result sent by the multi-point audio video communication service management system according to the second interaction information. With this alternative embodiment, the processing load of the multipoint audio video communication multipoint control processing unit is reduced.
在本发明实施例的一个可选实施方式中,多点音频视频通信多点控制处理单元接收多点音频视频通信业务管理系统根据第二互动信息发送的处理结果之后,还可以向多点音频视频通信终端发送承载上述处理结果的音频和/或视频数据。In an optional implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit receives the processing result sent by the multi-point audio video communication service management system according to the second interactive information, and may also provide multi-point audio video. The communication terminal transmits audio and/or video data carrying the above processing result.
在本发明实施例的一个可选实施方式中,多点音频视频通信多点控制处理单元向多点音频视频通信终端发送承载第一互动信息的音频和/或视频数据,包括:多点音频视频通信多点控制处理单元获取第一互动信息,将该第一互动信息和会议中的语音和/
或视频码流编码,得到承载上述第一互动信息的音频和/或视频数据,向多点音频视频通信终端发送上述承载第一互动信息的音频和/或视频数据。通过该可选实施方式,将会以中的信息和交互信息编码后发送给多点音频视频通信终端。In an optional implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit sends the audio and/or video data carrying the first interaction information to the multi-point audio video communication terminal, including: multi-point audio video. The communication multi-point control processing unit acquires the first interaction information, and the first interaction information and the voice in the conference and/or
Or video stream encoding, obtaining audio and/or video data carrying the first interactive information, and transmitting the audio and/or video data carrying the first interactive information to the multi-point audio video communication terminal. With this alternative embodiment, the information and the interactive information will be encoded and sent to the multi-point audio video communication terminal.
在本发明实施例的一个可选实施方式中,多点音频视频通信多点控制处理单元向多点音频视频通信终端发送上述承载处理结果的音频和/或视频数据,包括:多点音频视频通信多点控制处理单元将上述处理结果和会议中的语音和/或视频码流编码,得到承载处理结果的音频和/或视频数据,向多点音频视频通信终端发送该承载处理结果的音频和/或视频数据。In an optional implementation manner of the embodiment of the present invention, the multipoint audio video communication multipoint control processing unit sends the audio and/or video data of the bearer processing result to the multipoint audio video communication terminal, including: multipoint audio video communication. The multipoint control processing unit encodes the above processing result and the voice and/or video code stream in the conference to obtain audio and/or video data carrying the processing result, and transmits the audio of the bearer processing result to the multipoint audio video communication terminal and/or Or video data.
在本实施例中还提供了一种多点音频视频通信多点控制处理单元,该单元设置为实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。In the embodiment, a multi-point audio and video communication multi-point control processing unit is further provided, and the unit is configured to implement the above-mentioned embodiments and preferred embodiments, and the detailed description thereof has been omitted. As used below, the term "module" may implement a combination of software and/or hardware of a predetermined function. Although the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
图4是根据本发明实施例一的多点音频视频通信多点控制处理单元的结构框图,如图4所示,该多点音频视频通信多点控制处理单元包括:第一发送模块410,设置为向多点音频视频通信终端发送承载第一互动信息的音频和/或视频数据;接收模块420,与第一发送模块410相连,设置为接收多点音频视频通信终端根据上述第一互动信息发送的承载第二互动信息的音频和/或视频数据;处理模块430,与接收模块420相连,设置为处理上述第二互动信息。4 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to Embodiment 1 of the present invention. As shown in FIG. 4, the multipoint audio video communication multipoint control processing unit includes: a first sending module 410, setting The receiving module 420 is connected to the first sending module 410 and configured to receive the multi-point audio and video communication terminal according to the first interactive information, to send the audio and/or video data that carries the first interactive information to the multi-point audio and video communication terminal. The audio and/or video data carrying the second interactive information; the processing module 430 is connected to the receiving module 420 and configured to process the second interactive information.
在本发明实施例的一个可选实施方式中,上述处理模块430可以包括:发送单元,设置为向多点音频视频通信业务管理系统发送第二互动信息;接收单元,与发送单元相连,设置为接收多点音频视频通信业务管理系统根据第二互动信息发送的处理结果。In an optional implementation manner of the embodiment of the present invention, the processing module 430 may include: a sending unit, configured to send second interaction information to the multi-point audio video communication service management system; and the receiving unit is connected to the sending unit, and is configured to Receiving a processing result sent by the multi-point audio video communication service management system according to the second interactive information.
在本发明实施例的一个可选实施方式中,上述多点音频视频通信多点控制处理单元还包括:第二发送模块,设置为向多点音频视频通信终端发送承载处理结果的音频和/或视频数据。In an optional implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit further includes: a second sending module, configured to send the audio carrying the processing result to the multi-point audio video communication terminal and/or Video data.
在本发明实施例的一个实施方式中,上述第一发送模块410可以包括:获取单元,设置为获取第一互动信息;第一编码单元,与上述获取单元相连,设置为将第一互动信息和会议中的语音和/或视频码流编码,得到承载第一互动信息的音频和/或视频数据;第一发送单元,与第一编码单元相连,设置为向多点音频视频通信终端发送承载第一互动信息的音频和/或视频数据。
In an embodiment of the present invention, the first sending module 410 may include: an acquiring unit, configured to acquire first interaction information; and the first encoding unit is connected to the acquiring unit, and is configured to set the first interaction information and The voice and/or video code stream in the conference is encoded to obtain audio and/or video data carrying the first interaction information; the first sending unit is connected to the first coding unit and configured to send the bearer to the multi-point audio video communication terminal. An interactive audio and/or video data.
在本发明实施例的一个可选实施方式中,上述第二发送模块可以包括:第二编码单元,设置为将处理结果和会议中的语音和/或视频码流编码,得到承载处理结果的音频和/或视频数据;第二发送单元,与第二编码单元相连,设置为向多点音频视频通信终端发送承载处理结果的音频和/或视频数据。In an optional implementation manner of the embodiment of the present invention, the foregoing second sending module may include: a second encoding unit configured to encode the processing result and the voice and/or video code stream in the conference to obtain an audio carrying the processing result. And/or video data; the second transmitting unit is connected to the second encoding unit and configured to transmit the audio and/or video data carrying the processing result to the multi-point audio video communication terminal.
在本实施例中还提供了另一种多点音频视频通信中远程互动的方法,图5是根据本发明实施例一的另一多点音频视频通信中远程互动的方法的流程图,如图5所示,该流程包括如下步骤:In this embodiment, another method for remote interaction in multi-point audio video communication is also provided. FIG. 5 is a flowchart of a method for remote interaction in another multi-point audio video communication according to the first embodiment of the present invention. As shown in 5, the process includes the following steps:
步骤S502,多点音频视频通信业务管理系统指示多点音频视频通信多点控制处理单元向多点音频视频通信终端发送第一互动信息;Step S502, the multipoint audio video communication service management system instructs the multipoint audio video communication multipoint control processing unit to send the first interaction information to the multipoint audio video communication terminal;
步骤S504,该多点音频视频通信业务管理系统接收多点音频视频通信多点控制处理单元发送的多点音频视频通信终端根据上述第一互动信息反馈的第二互动信息;Step S504, the multi-point audio video communication service management system receives the second interaction information that is sent by the multi-point audio and video communication terminal sent by the multi-point audio and video communication multi-point control processing unit according to the first interaction information;
步骤S506,多点音频视频通信业务管理系统处理该第二互动信息。Step S506, the multi-point audio video communication service management system processes the second interaction information.
在本发明实施例的一个可选实施方式中,上述步骤S506,多点音频视频通信业务管理系统处理上述第二互动信息之后,多点音频视频通信业务管理系统还可以指示多点音频视频通信多点控制处理单元向多点音频视频通信终端发送处理结果。In an optional implementation manner of the embodiment of the present invention, after the multi-point audio video communication service management system processes the second interaction information, the multi-point audio video communication service management system may further indicate multi-point audio and video communication. The point control processing unit transmits the processing result to the multipoint audio video communication terminal.
在本发明实施例的一个可选实施方式中,上述第一互动信息包括投票信息和/或打分信息;上述第二互动信息包括:投票选择和/或打分分数;上述步骤S506中多点音频视频通信业务管理系统处理上述第二互动信息可以包括:多点音频视频通信业务管理系统对投票选择和/或打分分数进行统计分析,得到投票结果和/或打分结果。In an optional implementation manner of the embodiment of the present invention, the first interaction information includes voting information and/or scoring information; the second interaction information includes: voting selection and/or scoring score; and multi-point audio video in step S506 The processing of the second interactive information by the communication service management system may include: the multi-point audio video communication service management system performs statistical analysis on the voting selection and/or the scoring score to obtain a voting result and/or a scoring result.
在本实施例中还提供了一种多点音频视频通信业务管理系统,该单元设置为实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。In this embodiment, a multi-point audio and video communication service management system is further provided. The unit is configured to implement the foregoing embodiments and preferred embodiments, and details are not described herein. As used below, the term "module" may implement a combination of software and/or hardware of a predetermined function. Although the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
图6是根据本发明实施例一的多点音频视频通信业务管理系统的结构框图,如图6所示,该系统包括:第一指示模块610,设置为指示多点音频视频通信多点控制处理单元向多点音频视频通信终端发送第一互动信息;接收模块620,与第一指示模块610相连,设置为接收多点音频视频通信多点控制处理单元发送的多点音频视频通信终端根据第一互动信息反馈的第二互动信息;处理模块630,与接收模块620相连,设置为处理第二互动信息。
FIG. 6 is a structural block diagram of a multipoint audio video communication service management system according to Embodiment 1 of the present invention. As shown in FIG. 6, the system includes: a first indication module 610, configured to indicate multipoint audio and video communication multipoint control processing. The unit sends the first interaction information to the multi-point audio and video communication terminal; the receiving module 620 is connected to the first indication module 610, and is configured to receive the multi-point audio and video communication multi-point control processing unit to send the multi-point audio and video communication terminal according to the first The second interactive information of the interactive information feedback; the processing module 630 is connected to the receiving module 620 and configured to process the second interactive information.
在本发明实施例的一个可选实施方式中,上述多点音频视频通信业务管理系统还包括:第二指示模块,设置为指示多点音频视频通信多点控制处理单元向多点音频视频通信终端发送处理结果。In an optional implementation manner of the embodiment of the present invention, the multi-point audio video communication service management system further includes: a second indication module, configured to instruct the multi-point audio and video communication multi-point control processing unit to the multi-point audio and video communication terminal Send the processing result.
在本发明实施例的一个可选实施方式中,第一互动信息包括投票信息和/或打分信息;第二互动信息包括以下至少之一:投票选择和/或打分分数;上述处理模块,设置为多点音频视频通信业务管理系统对投票选择和/或打分分数进行统计分析得到投票结果和/或打分结果。In an optional implementation manner of the embodiment of the present invention, the first interaction information includes voting information and/or scoring information; the second interaction information includes at least one of: voting selection and/or scoring score; the processing module is configured to The multi-point audio video communication service management system performs statistical analysis on voting selection and/or scoring scores to obtain voting results and/or scoring results.
实施例二Embodiment 2
在本实施例中提供了一种多点音频视频通信中远程互动的方法,图7是根据本发明实施例二的多点音频视频通信中远程互动的方法的流程图,如图7所示,该流程包括如下步骤:In this embodiment, a method for remote interaction in multi-point audio video communication is provided. FIG. 7 is a flowchart of a method for remote interaction in multi-point audio video communication according to Embodiment 2 of the present invention, as shown in FIG. The process includes the following steps:
步骤S702,多点音频视频通信多点控制处理单元将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;Step S702, the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface corresponding to the current interaction, which is set to guide the user input, to each participating terminal; wherein each participating terminal supports the user input channel;
步骤S704,多点音频视频通信多点控制处理单元接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息;Step S704, the multi-point audio video communication multi-point control processing unit receives the interaction information fed back by each participating terminal according to the instruction of the audio and/or video interface;
步骤S706,多点音频视频通信多点控制处理单元将互动信息发送至多点音频视频通信业务管理系统;Step S706, the multi-point audio video communication multi-point control processing unit sends the interaction information to the multi-point audio video communication service management system;
步骤S708,多点音频视频通信多点控制处理单元接收多点音频视频通信业务管理系统利用互动信息进行统计得到的统计结果;Step S708, the multipoint audio and video communication multipoint control processing unit receives the statistical result obtained by using the interactive information by the multipoint audio and video communication service management system;
步骤S710,多点音频视频通信多点控制处理单元将统计结果发送至各与会终端。Step S710, the multi-point audio video communication multi-point control processing unit sends the statistical result to each participating terminal.
在本发明实施例的一个可以实施方式中,多点音频视频通信多点控制处理单元将音频和/或视频界面发送至各与会终端之前,还包括:多点音频视频通信多点控制处理单元获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。In an implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit sends the audio and/or video interface to each participating terminal, and further includes: multi-point audio video communication multi-point control processing unit acquiring The audio file and the image file corresponding to the current interaction are pre-stored, and the audio file and the image file are separately synthesized to form an audio and video interface for guiding user input.
在本发明实施例的一个可以实施方式中,多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点
控制处理单元获取各与会终端支持的信息类型;多点音频视频多点控制处理单元基于各与会终端支持的信息类型向各将指引用户输入的音频和/或视频界面发送至各与会终端。In an implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface that directs user input to each participating terminal, including: multi-point audio and video communication.
The control processing unit acquires the type of information supported by each participating terminal; the multi-point audio and video multi-point control processing unit transmits the audio and/or video interfaces each guiding the user input to each participating terminal based on the type of information supported by each participating terminal.
在本发明实施例的一个可以实施方式中,多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定;当与会终端设定了接收信息的类型时,检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,基于设定的信息类型向对应与会终端发送音频和/或视频界面;若否,基于与会终端支持的信息类型向对应与会终端发送音频和/或视频界面;和/或当与会终端未设定接收信息的类型时,基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。In an implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface that directs user input to each participating terminal, including: multi-point audio video communication multi-point control processing. The unit acquires the type of information supported by each participating terminal, and detects whether each participating terminal sets the type of the received information; when the participating terminal sets the type of the received information, it detects whether the type of information set by the corresponding participating terminal belongs to its support. The type of information, if yes, sending an audio and/or video interface to the corresponding participant terminal based on the set information type; if not, transmitting an audio and/or video interface to the corresponding participant terminal based on the type of information supported by the participant terminal; and/or when When the participating terminal does not set the type of the received information, the audio and/or video interface is transmitted to each participating terminal based on the type of information supported by each participating terminal.
在本发明实施例的一个可以实施方式中,多点音频视频通信多点控制处理单元将统计结果发送至各与会终端,包括:多点音频视频通信多点控制处理单元将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。In an implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit sends the statistical result to each participating terminal, including: a multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text and audio. And video, and based on the type of information supported by the respective participating terminals, the statistical results of text, audio and/or video are sent to the participating terminals.
在本发明实施例的一个可以实施方式中,多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型的方式包括:多点音频视频通信多点控制处理单元获取为各与会终端建立的逻辑通道,根据逻辑通道的类型确定与会终端支持的信息类型;逻辑通道的类型包括:音频、视频和数据。In an implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit acquires the information type supported by each participating terminal, including: multi-point audio video communication multi-point control processing unit acquisition is established for each participating terminal The logical channel determines the type of information supported by the participant terminal according to the type of the logical channel; the types of the logical channel include: audio, video, and data.
在本发明实施例的一个可以实施方式中,多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点控制处理单元将设置为指引用户输入的音频和/或视频界面与当前多点音频视频通信中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;多点音频视频通信多点控制处理单元将统计结果发送至各与会终端,包括:多点音频视频通信多点控制处理单元将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。In an implementation manner of the embodiment of the present invention, the multi-point audio video communication multi-point control processing unit sends an audio and/or video interface that directs user input to each participating terminal, including: multi-point audio video communication multi-point control processing. The unit is configured to direct the audio and/or video interface input by the user to be encoded into the media stream format together with the data, audio and video code streams in the current multi-point audio video communication, and then sent to each participating terminal; multi-point audio and video communication is multi-point The control processing unit sends the statistical result to each participating terminal, including: the multi-point audio video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then sends the result to each participant. terminal.
在本发明实施例的一个可以实施方式中,互动的类型包括:投票、打分和表决。In an implementation manner of an embodiment of the present invention, the types of interaction include: voting, scoring, and voting.
在本发明实施例的一个可以实施方式中,多点音频视频通信包括音频视频会议;多点音频视频通信多点控制处理单元包括多点控制单元MCU。
In an implementation manner of the embodiment of the present invention, the multipoint audio video communication includes an audio video conference; the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
在本实施例中还提供了一种多点音频视频通信多点控制处理单元,该单元设置为实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。In the embodiment, a multi-point audio and video communication multi-point control processing unit is further provided, and the unit is configured to implement the above-mentioned embodiments and preferred embodiments, and the detailed description thereof has been omitted. As used below, the term "module" may implement a combination of software and/or hardware of a predetermined function. Although the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
图8是根据本发明实施例二的多点音频视频通信多点控制处理单元的结构框图,如图8所示,包括:互动发起模块810,设置为在多点音频视频通信中,将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;互动处理模块820,设置为接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并将互动信息发送至多点音频视频通信业务管理系统,以及接收会议业务管理系统利用互动信息进行统计得到的统计结果,将统计结果发送至各与会终端。8 is a structural block diagram of a multipoint audio video communication multipoint control processing unit according to Embodiment 2 of the present invention. As shown in FIG. 8, the method includes: an interaction initiation module 810, which is configured to be in a multipoint audio and video communication. The interaction corresponding to the secondary interaction is to send the audio and/or video interface input by the user to each participating terminal; wherein each participating terminal supports the user input channel; the interaction processing module 820 is configured to receive the respective terminal according to the audio and/or video. The interactive information fed back by the interface, and the interactive information is sent to the multi-point audio and video communication service management system, and the statistical result obtained by the conference service management system using the interactive information is collected, and the statistical result is sent to each participating terminal.
在本发明实施例的一个可选实施方式中,互动发起模块810,还设置为获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。In an optional implementation manner of the embodiment of the present invention, the interaction initiation module 810 is further configured to acquire a pre-stored audio file and a picture file corresponding to the current interaction, and separately synthesize the audio file and the image file for formation. To guide the user input audio and video interface.
在本发明实施例的一个可选实施方式中,互动发起模块810,设置为获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。In an optional implementation manner of the embodiment of the present invention, the interaction initiation module 810 is configured to obtain an information type supported by each participant terminal, and send an audio and/or video interface to each participant terminal based on the information type.
在本发明实施例的一个可选实施方式中,互动发起模块810,设置为获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定,若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。In an optional implementation manner of the embodiment of the present invention, the interaction initiation module 810 is configured to acquire information types supported by the conference terminals, and detect whether each conference terminal sets the type of the received information, and if configured, Then, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, the information type supported by the conference terminal is used to correspond to the conference terminal. Send an audio or video interface; if not set, send an audio and/or video interface to each participating terminal based on the type of information supported by each participating terminal.
在本发明实施例的一个可选实施方式中,互动处理模块820,进一步设置为将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。In an optional implementation manner of the embodiment of the present invention, the interaction processing module 820 is further configured to synthesize the statistical result into text, audio, and video, and send the message to each participating terminal based on the type of information supported by each participating terminal. Statistical results for text, audio, and/or video.
在本发明实施例的一个可选实施方式中,互动发起模块810,进一步设置为将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;互动处理模块820,进一步设置为将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。
In an optional implementation manner of the embodiment of the present invention, the interaction initiation module 810 is further configured to encode the audio and/or video interface set to direct user input with the data, audio, and video code streams in the current conference as the media. The stream format is sent to each participant terminal. The interaction processing module 820 is further configured to encode the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then send the result to each conference terminal.
在本发明实施例的一个可选实施方式中,多点音频视频通信包括语音视频会议;多点音频视频通信多点控制处理单元包括多点控制单元MCU。In an optional implementation manner of the embodiment of the present invention, the multipoint audio video communication includes a voice video conference; the multipoint audio video communication multipoint control processing unit includes a multipoint control unit MCU.
在发明实施例中,多点音频视频通信可以包括各种类型的通信,例如视频会议、语音会议等,但是不限于此。下面以语音频视频会议为例,对本发明实施例的可选实施方式进行描述。In an embodiment of the invention, multi-point audio video communication may include various types of communication, such as video conferencing, voice conferencing, etc., but is not limited thereto. The following describes an optional implementation manner of the embodiment of the present invention by taking an audio and video conference as an example.
可选实施方式一Alternative embodiment 1
在该可选实施方式中提供一种音频视频会议中的互动方法,在本可选实施方式中,以上述的第一互动信息为指引用户输入的音频和/或视频界面为例进行说明,如图9所示,该方法包括如下步骤:In this alternative embodiment, an interaction method in an audio video conference is provided. In the optional implementation manner, the audio and/or video interface input by the user is used as an example to describe an example. As shown in FIG. 9, the method includes the following steps:
步骤S902,多点音频视频通信多点控制处理单元将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会多点音频视频通信终端;Step S902, the multi-point audio and video communication multi-point control processing unit sends an audio and/or video interface corresponding to the current interaction, which is set to guide the user input, to each of the participating multi-point audio and video communication terminals;
该步骤中,上述互动的类型包括但不限于为:投票、打分和表决。In this step, the types of interactions mentioned above include, but are not limited to, voting, scoring, and voting.
该步骤中,各与会多点音频视频通信终端均支持用户输入通道;确切地讲,多点音频视频通信多点控制处理单元在各与会终端入会时,为各与会终端开启用户输入通道,以使各与会终端支持用户输入通道,进而为终端用户的二次输入提供技术支持。In this step, each of the multi-point audio and video communication terminals supports the user input channel; specifically, the multi-point audio and video communication multi-point control processing unit opens the user input channel for each participating terminal when the participating terminals join the conference, so that Each participating terminal supports the user input channel, thereby providing technical support for the secondary input of the end user.
该步骤中,上述设置为指引用户输入的音频和/或视频界面的获取方式包括:多点音频视频通信多点控制处理单元获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。In this step, the manner of obtaining the audio and/or video interface that is set to direct the user input includes: the multi-point audio video communication multi-point control processing unit acquires the pre-stored audio file and the image file corresponding to the current interaction, and The audio file and the image file are separately synthesized to form an audio and video interface for guiding user input.
该步骤中,多点音频视频通信多点控制处理单元向各与会终端下发音频、视频还是音频和视频,由与会终端支持的信息类型来决定,举例说明:若与会终端仅支持音频或视频,则向其发送音频或者视频界面,若与会终端支持音频和视频,则向其发送音频和视频界面。其中,多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型的方式包括但不限于为:多点音频视频通信多点控制处理单元获取为各与会终端建立的逻辑通道,根据逻辑通道的类型确定与会终端支持的信息类型;该逻辑通道的类型包括:音频、视频和数据。
In this step, the multi-point audio and video communication multi-point control processing unit delivers audio, video or audio and video to each participating terminal, which is determined by the type of information supported by the participating terminal. For example, if the participating terminal only supports audio or video, Then send an audio or video interface to it, and if the participating terminal supports audio and video, send an audio and video interface to it. The manner in which the multi-point audio and video communication multi-point control processing unit acquires the information types supported by the participating terminals includes, but is not limited to, multi-point audio video communication multi-point control processing unit acquires a logical channel established for each participating terminal, according to logic The type of channel determines the type of information supported by the participating terminal; the types of the logical channel include: audio, video, and data.
或者,将用户的意愿考虑进来,实现方式如下:多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定;Or, the user's will is taken into consideration, and the implementation manner is as follows: the multi-point audio and video communication multi-point control processing unit acquires the information types supported by the participating terminals, and detects whether each participating terminal sets the type of the received information;
若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;If the setting is made, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, based on the conference terminal support The information type sends an audio or video interface to the corresponding conference terminal;
若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。If no setting is made, an audio and/or video interface is transmitted to each participating terminal based on the type of information supported by each participating terminal.
步骤S904,多点音频视频通信业务管理系统统计各与会终端根据音频和/或视频界面的指引而反馈的互动信息;Step S904, the multi-point audio video communication service management system collects interactive information that each participating terminal feeds back according to the guidance of the audio and/or video interface;
步骤S906,多点音频视频通信多点控制处理单元将多点音频视频通信业务管理系统得到的统计结果发送至各与会终端。Step S906, the multipoint audio video communication multipoint control processing unit sends the statistical result obtained by the multipoint audio video communication service management system to each participating terminal.
该步骤中,多点音频视频通信多点控制处理单元将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。In this step, the multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text, audio and video, and sends text, audio and/or video to each participating terminal based on the type of information supported by each participating terminal. Statistical results.
在该可选实施方式中为了将互动过程与现有的会议过程进行有效结合,节省网络资源,将多点音频视频通信多点控制处理单元与各与会终端间交互的信息与会议中的数据、音频和视频码流一起编码为媒体流格式后进行交互。具体表现为:In this optional implementation manner, in order to effectively combine the interaction process with the existing conference process, the network resources are saved, and the information of the interaction between the multipoint audio and video communication multipoint control processing unit and each participating terminal and the data in the conference, The audio and video streams are encoded together into a media stream format for interaction. The specific performance is:
多点音频视频通信多点控制处理单元将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The multi-point audio video communication multi-point control processing unit encodes the audio and/or video interface set to direct the user input together with the data, audio and video code streams in the current conference into a media stream format and transmits them to each participating terminal.
多点音频视频通信多点控制处理单元将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The multi-point audio and video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference into a media stream format, and then transmits the result to each participating terminal.
综上所述,可知在本可选实施方式中,在不改变用户的会议系统组网、不改变用户对会议系统的原始使用习惯的基础上,能使得与会终端用户能够有如临现场一样进行投票、打分或表决的互动体验。In summary, it can be seen that, in the optional implementation manner, on the basis of not changing the conference system of the user and changing the original usage habits of the user to the conference system, the conference end users can vote as if they are on the scene. An interactive experience of scoring, voting, or voting.
在该可选实施方式中,多点音频视频通信可以为电视视频会议,多点音频视频通信多点控制处理单元可以为MCU。In this optional implementation, the multipoint audio video communication may be a television video conference, and the multipoint audio video communication multipoint control processing unit may be an MCU.
可选实施方式二
Optional implementation method 2
在该可选实施方式中提供一种音频视频会议中的互动方法,如图10所示,该方法包括如下步骤:In this optional implementation, an interaction method in an audio video conference is provided. As shown in FIG. 10, the method includes the following steps:
步骤S1002,会议管理员在多点音频视频通信业务管理系统选择终端列表,召开视频会议。多点音频视频通信业务管理系统通知协议与通道控制模块召开视频会议,呼叫各与会终端入会。In step S1002, the conference administrator selects a terminal list in the multi-point audio video communication service management system to hold a video conference. The multi-point audio video communication service management system notification protocol and the channel control module hold a video conference, and call each participating terminal to join the conference.
其中,能力协商和逻辑通道打开的过程中就支持用户输入的通道(UserInput)。通过支持用户输入通道为以后互动过程中用户按照指示输入提供必要的技术支持。Among them, the user input channel (UserInput) is supported during the capability negotiation and logical channel opening process. By supporting the user input channel, the user provides the necessary technical support according to the instruction input during the interaction.
步骤S1004,业务管理系统根据会议现场的需要判断,是否需要启用投票、打分或者表决的流程,如果不需要则会议保持普通的模式继续,如果需要则进入投票、打分或者表决的流程,转步骤S1006。In step S1004, the service management system judges whether the voting, scoring, or voting process needs to be enabled according to the needs of the meeting site. If not, the meeting maintains the normal mode to continue, and if necessary, enters the voting, scoring, or voting process, and proceeds to step S1006. .
步骤S1006,多点音频视频通信多点控制处理单元对已经保存好的音频文件、图片文件分别进行合成,形成提醒用户输入的音频和视频界面(即视频的用户界面UI)。Step S1006: The multi-point audio video communication multi-point control processing unit separately synthesizes the already saved audio file and the picture file to form an audio and video interface (ie, a user interface UI of the video) that prompts the user to input.
步骤S1008:多点音频视频通信多点控制处理单元将音频和视频界面与当前会议中的数据、音频和视频码流一起进行编码成媒体流格式。Step S1008: The multipoint audio video communication multipoint control processing unit encodes the audio and video interface together with the data, audio and video code streams in the current conference into a media stream format.
步骤S1010:多点音频视频通信多点控制处理单元将音频和视频码流发送给与会终端;Step S1010: The multipoint audio video communication multipoint control processing unit sends the audio and video code streams to the conference terminal;
对于音频会议或只支持音频能力的终端,在S1002步骤中就不会打开视频通道,因此也不会将视频媒体流发送给对应的与会终端。For audio conferences or terminals that only support audio capabilities, the video channel will not be opened in step S1002, and therefore the video media stream will not be sent to the corresponding conference terminal.
步骤S1012,与会终端根据收到的音频和/或视频界面的引导,使用遥控器、键盘等输入自己的投票、打分或表决数据并进行提交。In step S1012, the participating terminal inputs its own voting, scoring or voting data and submits it according to the guidance of the received audio and/or video interface, using a remote controller, a keyboard, or the like.
其中,根据收到的音频和/视频界面的引导进行操作是一种交互式音频视频应答(IVR或者IVVR)技术。Among them, the operation according to the guidance of the received audio and/or video interface is an interactive audio video response (IVR or IVVR) technology.
步骤S1014,多点音频视频通信多点控制处理单元收到所有与会终端提交的投票、打分或表决数据,提交给多点音频视频通信业务管理系统进行决策。In step S1014, the multipoint audio and video communication multipoint control processing unit receives the voting, scoring or voting data submitted by all the participating terminals, and submits the data to the multipoint audio and video communication service management system for decision.
步骤S1016,多点音频视频通信业务管理系统将所有的投票、打分或表决数据进行汇总,根据用户意志进行决策,并将汇总结果发给多点音频视频通信多点控制处理单元。
In step S1016, the multi-point audio video communication service management system aggregates all the voting, scoring or voting data, makes a decision according to the user's will, and sends the summary result to the multi-point audio and video communication multi-point control processing unit.
步骤S1018,多点音频视频通信多点控制处理单元将投票、打分或表决的结果合成文字、音频、视频发送给与会终端。其中,对于音频会议和只支持音频能力的终端,则只会发送音频给对应的与会终端告知其结果。In step S1018, the multi-point audio video communication multi-point control processing unit sends the result of voting, scoring or voting to the participating terminal by synthesizing text, audio and video. Among them, for audio conferences and terminals that only support audio capabilities, only audio is sent to the corresponding conference terminal to inform the result.
步骤S1020,投票、打分或表决流程结束之后会议保持普通模式继续,只要会议中需要进行投票打分的过程就进入到上述流程进行,直到会议结束。In step S1020, after the voting, scoring or voting process ends, the conference continues in the normal mode. As long as the voting process needs to be performed in the conference, the process proceeds to the above process until the conference ends.
综上所述,可知本发明实施例所述方案,基于音频视频会议系统的原有组网,在会议、面试、远程互动等有关的多媒体远程现场接入的场景中,帮助用户进行投票、打分、发表自己的观点,实现了音频视频会议中的互动。In summary, it can be seen that the solution according to the embodiment of the present invention is based on the original networking of the audio video conferencing system, and helps the user to vote and score in the scene of multimedia remote field access such as conference, interview, and remote interaction. And express their own opinions and realize the interaction in audio and video conferences.
本发明实施例所述方案,实现简单,完全不需要改变用户当前的会议系统组网,也不需增加如传统的第三方表决系统,更不需要人工的在会议中统计和核算与会成员的投票打分结果,系统采用互动式音频视频应答方式(IVR、IVVR)让用户方便的进行投票打分,系统根据设置自动的统计和计算结果并以多媒体的方式展示给用户。The solution in the embodiment of the present invention is simple to implement, and does not need to change the current conference system networking of the user, and does not need to add a traditional third-party voting system, and does not need to manually count and vote for the members in the conference. As a result of the scoring, the system uses interactive audio and video response (IVR, IVVR) to allow users to easily vote and score. The system displays the automatic statistics and calculation results and presents them to the user in multimedia mode.
可选实施方式三Alternative embodiment three
本发明实施例提供一种多点音频视频通信多点控制处理单元,如图11所示,包括:An embodiment of the present invention provides a multipoint audio and video communication multipoint control processing unit, as shown in FIG.
互动发起模块1110,设置为在多点音频视频通信中,将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各通信终端;其中,各通信终端均支持用户输入通道;The interaction initiation module 1110 is configured to, in the multi-point audio video communication, send an audio and/or video interface corresponding to the current interaction to guide the user input to each communication terminal; wherein each communication terminal supports the user input channel ;
互动处理模块1120,设置为接收各通信终端根据音频和/或视频界面的指引而反馈的互动信息,并将所述互动信息发送至多点音频视频通信业务管理系统,以及接收多点音频视频通信业务管理系统利用所述互动信息进行统计得到的统计结果,将所述统计结果发送至各通信终端。The interaction processing module 1120 is configured to receive interaction information that each communication terminal feeds back according to the instruction of the audio and/or video interface, and send the interaction information to the multi-point audio video communication service management system, and receive the multi-point audio video communication service. The management system uses the interaction information to perform statistics and obtains the statistical result, and sends the statistical result to each communication terminal.
基于上述结构框架及实施原理,下面给出在上述结构下的几个具体及优选实施方式,用以细化和优化本发明所述多点音频视频通信多点控制处理单元的功能,具体涉及如下内容:Based on the above structural framework and implementation principle, several specific and preferred embodiments under the above structure are given below to refine and optimize the functions of the multipoint audio and video communication multipoint control processing unit of the present invention, specifically the following content:
互动发起模块1110,通过获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成所述用以指引用户输入的音频和视频界面。
The interaction initiation module 1110 forms an audio and video interface for guiding user input by acquiring an audio file and a picture file corresponding to the current interaction and synthesizing the audio file and the picture file respectively.
互动发起模块1110,获取各通信终端支持的信息类型,并基于该信息类型向各通信终端发送音频和/或视频界面;或者,获取各通信终端支持的信息类型,并检测各通信终端是否对接收信息的类型进行设定,若进行了设定,则检测对应通信终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应通信终端发送音频或视频界面;否则,基于通信终端支持的信息类型向对应通信终端发送音频或视频界面;若未进行设定,则基于各通信终端支持的信息类型,向各通信终端发送音频和/或视频界面。The interaction initiation module 1110 acquires an information type supported by each communication terminal, and sends an audio and/or video interface to each communication terminal based on the information type; or acquires an information type supported by each communication terminal, and detects whether each communication terminal receives the information. Setting the type of information, if it is set, detecting whether the type of information set by the corresponding communication terminal belongs to the type of information it supports, and if so, sending an audio or video interface to the corresponding communication terminal based on the set information type; Otherwise, an audio or video interface is sent to the corresponding communication terminal based on the type of information supported by the communication terminal; if not, the audio and/or video interface is transmitted to each communication terminal based on the type of information supported by each communication terminal.
互动处理模块1120,将统计结果合成文字、音频和视频,并基于在先获取的各通信终端支持的信息类型,向各通信终端发送文字、音频和/或视频的统计结果。The interaction processing module 1120 synthesizes the statistical result into text, audio, and video, and transmits the statistical result of the text, audio, and/or video to each communication terminal based on the type of information supported by each of the previously acquired communication terminals.
可选地,本实施例中为了将互动过程与现有的会议过程进行有效结合,节省网络资源,将多点音频视频通信多点控制处理单元与各通信终端间交互的信息与会议中的数据、音频和视频码流一起编码为媒体流格式后进行交互。具体表现为:Optionally, in this embodiment, in order to effectively combine the interaction process with the existing conference process, the network resource is saved, and the information of the multipoint audio and video communication multipoint control processing unit and each communication terminal and the data in the conference are exchanged. The audio and video streams are encoded together into a media stream format for interaction. The specific performance is:
互动发起模块1110,将设置为指引用户输入的音频和/或视频界面与当前通信中的数据、音频及视频码流一同编码为媒体流格式后发送至各通信终端;The interaction initiation module 1110, the audio and/or video interface set to direct the user input is encoded into the media stream format together with the data, audio and video code streams in the current communication, and then sent to each communication terminal;
互动处理模块1120,将统计结果与当前通信中的数据、音频及视频码流一同编码为媒体流格式后发送至各通信终端。The interaction processing module 1120 encodes the statistical result together with the data, audio and video code streams in the current communication into a media stream format, and then sends the result to each communication terminal.
综上所述,可知本实施例所述多点音频视频通信多点控制处理单元在不改变用户的会议系统组网、不改变用户对会议系统的原始使用习惯的基础上,通过本实施例所阐述的多点音频视频通信多点控制处理单元,能使得通信终端用户能够有如临现场一样进行投票、打分或表决的互动体验。In summary, it can be seen that the multi-point audio and video communication multi-point control processing unit in the embodiment does not change the user's conference system networking, and does not change the user's original usage habits of the conference system. The multi-point audio and video communication multi-point control processing unit described can enable the communication terminal user to have an interactive experience of voting, scoring or voting as if it were on-site.
可选实施方式四Optional implementation four
本发明实施例提供一种远程教育中的远程互动系统,如图12所示,包括:多点音频视频通信业务管理系统1210、多点音频视频通信终端1220、以及可选实施方式三的多点音频视频通信多点控制处理单元1230;其中:The embodiment of the present invention provides a remote interaction system in distance education, as shown in FIG. 12, including: a multi-point audio and video communication service management system 1210, a multi-point audio and video communication terminal 1220, and multiple points of the optional embodiment 3. Audio and video communication multipoint control processing unit 1230; wherein:
远程多点音频视频通信终端1220,设置为在接收到多点音频视频通信多点控制处理单元1230发送的与本次互动对应的设置为指引用户输入的音频和/或视频界面后,根据音频和/或视频界面的指引进行互动操作,并将操作得到的互动信息发送至多点音频视频通信多点控制处理单元1230;
The remote multi-point audio video communication terminal 1220 is configured to, after receiving the audio and/or video interface corresponding to the current interaction sent by the multi-point audio video communication multi-point control processing unit 1230 to guide the user input, according to the audio and / or the interface of the video interface to perform an interactive operation, and the interactive information obtained by the operation is sent to the multi-point audio video communication multi-point control processing unit 1230;
多点音频视频通信业务管理系统1210,设置为接收多点音频视频通信多点控制处理单元630发送的各远程终端根据音频和/或视频界面的指引而反馈的互动信息,并对所述互动信息进行统计后,将统计结果发送至多点音频视频通信多点控制处理单元1230。The multi-point audio video communication service management system 1210 is configured to receive the interaction information that each remote terminal sends back according to the guidance of the audio and/or video interface sent by the multi-point audio and video communication multi-point control processing unit 630, and the interaction information is After the statistics are performed, the statistical results are sent to the multipoint audio video communication multipoint control processing unit 1230.
进一步地,由于实施例二中已经对多点音频视频通信多点控制处理单元的具体组成以及实施原理进行了详细阐述,所以在本实施例中对其结构及功能不作赘述。Further, since the specific composition and implementation principle of the multi-point audio video communication multi-point control processing unit have been described in detail in the second embodiment, the structure and function of the multi-point audio and video communication multi-point control processing unit are not described in detail in this embodiment.
综上所述,可知本实施例所述系统在不改变用户的会议系统组网、不改变用户对会议系统的原始使用习惯的基础上,通过本实施例所阐述的多点音频视频通信多点控制处理单元,能使得与会终端用户能够有如临现场一样进行投票、打分或表决的互动体验。In summary, it can be seen that the system in this embodiment does not change the conference system of the user, does not change the original usage habits of the user to the conference system, and uses the multipoint audio and video communication described in this embodiment. The control processing unit enables the participating end users to have an interactive experience of voting, scoring or voting as if they were on the spot.
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。It will be apparent to those skilled in the art that the various modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein. The steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software.
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above description is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.
如上所述,本发明实施例提供的一种多点音频视频通信中远程互动的方法及设备,具有以下有益效果:解决相关技术中在多点音频视频通信系统中引入第三方交互系统实现交互所导致的缺陷,在多点音频视频通信系统中实现了远程交互,从而无需引入第三方设备,提高了可操作性并降低了成本。
As described above, the method and device for remote interaction in multi-point audio and video communication provided by the embodiments of the present invention have the following beneficial effects: solving the related art, introducing a third-party interactive system to implement an interaction in a multi-point audio and video communication system The resulting defect enables remote interaction in a multi-point audio video communication system, eliminating the need to introduce third-party devices, increasing operability and reducing costs.
Claims (31)
- 一种音频视频会议中的远程互动方法,包括:A remote interaction method in an audio video conference, including:多点控制单元MCU将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;The multi-point control unit MCU sends an audio and/or video interface corresponding to the current interaction to guide the user input to each participating terminal; wherein each participating terminal supports the user input channel;会议业务管理系统统计各与会终端根据音频和/或视频界面的指引而反馈的互动信息;The conference business management system collects interactive information that each participating terminal feeds back according to the guidance of the audio and/or video interface;MCU将会议业务管理系统得到的统计结果发送至各与会终端。The MCU sends the statistical results obtained by the conference service management system to each participating terminal.
- 根据权利要求1所述的方法,其中,所述MCU将所述音频和/或视频界面发送至各与会终端前包括:The method of claim 1, wherein the MCU sends the audio and/or video interface to each of the participating terminals before:MCU获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The MCU obtains the pre-stored audio file and picture file corresponding to the interaction, and synthesizes the audio file and the picture file separately to form an audio and video interface for guiding user input.
- 根据权利要求1所述的方法,其中,所述MCU将指引用户输入的音频和/或视频界面发送至各与会终端,具体包括:The method of claim 1, wherein the MCU sends an audio and/or video interface that directs user input to each of the participating terminals, including:MCU获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。The MCU obtains the type of information supported by each participating terminal, and sends an audio and/or video interface to each participating terminal based on the type of the information.
- 根据权利要求1所述的方法,其中,所述MCU将指引用户输入的音频和/或视频界面发送至各与会终端,具体包括:The method of claim 1, wherein the MCU sends an audio and/or video interface that directs user input to each of the participating terminals, including:MCU获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定;The MCU obtains the type of information supported by each participating terminal, and detects whether each participating terminal sets the type of the received information;若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;If the setting is made, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, based on the conference terminal support The information type sends an audio or video interface to the corresponding conference terminal;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。If no setting is made, an audio and/or video interface is transmitted to each participating terminal based on the type of information supported by each participating terminal.
- 根据权利要求1所述的方法,其中,所述MCU将统计结果发送至各与会终端,具体包括: The method of claim 1, wherein the MCU sends the statistical result to each participant terminal, specifically:MCU将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The MCU synthesizes the statistical results into text, audio, and video, and sends statistical results of text, audio, and/or video to each participating terminal based on the type of information previously supported by each participating terminal.
- 根据权利要求3或4或5所述的方法,其中,所述MCU获取各与会终端支持的信息类型的方式包括:The method according to claim 3 or 4 or 5, wherein the manner in which the MCU obtains the type of information supported by each participating terminal comprises:MCU获取为各与会终端建立的逻辑通道,根据逻辑通道的类型确定与会终端支持的信息类型;所述逻辑通道的类型包括:音频、视频和数据。The MCU obtains a logical channel established for each participating terminal, and determines the type of information supported by the participating terminal according to the type of the logical channel; the types of the logical channel include: audio, video, and data.
- 根据权利要求1至5任一项所述的方法,其中,The method according to any one of claims 1 to 5, wherein所述MCU将指引用户输入的音频和/或视频界面发送至各与会终端,进一步包括:MCU将用于指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;The MCU sends an audio and/or video interface directed to the user to each participating terminal, further comprising: the MCU will use the audio and/or video interface for directing user input along with the data, audio and video streams in the current conference. Encoded into a media stream format and sent to each participating terminal;所述MCU将统计结果发送至各与会终端,进一步包括:MCU将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The MCU sends the statistical result to each participant terminal, and further includes: the MCU encodes the statistical result together with the data, audio, and video code streams in the current conference into a media stream format, and then sends the result to the conference terminal.
- 根据权利要求1至5任一项所述的方法,其中,所述互动的类型包括:投票、打分和表决。The method of any one of claims 1 to 5, wherein the type of interaction comprises: voting, scoring, and voting.
- 一种多点控制单元MCU,包括:A multipoint control unit MCU comprising:互动发起模块,设置为在视频会议中,将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;The interactive initiating module is configured to send, in the video conference, an audio and/or video interface corresponding to the current interaction, which is set to guide the user input, to each participating terminal; wherein each participating terminal supports the user input channel;互动处理模块,设置为接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并将所述互动信息发送至会议业务管理系统,以及接收会议业务管理系统利用所述互动信息进行统计得到的统计结果,将所述统计结果发送至各与会终端。The interaction processing module is configured to receive interaction information fed back by each participating terminal according to the instruction of the audio and/or video interface, and send the interaction information to the conference service management system, and receive the conference service management system to use the interaction information to perform The statistical result obtained by the statistics is sent to each participating terminal.
- 根据权利要求9所述的MCU,其中,所述互动发起模块,还设置为获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The MCU of claim 9, wherein the interaction initiation module is further configured to acquire a pre-stored audio file and a picture file corresponding to the current interaction, and separately synthesize the audio file and the picture file to form a The audio and video interface that guides the user input.
- 根据权利要求9所述的MCU,其中,所述互动发起模块,具体设置为获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。 The MCU according to claim 9, wherein the interaction initiation module is specifically configured to acquire an information type supported by each participant terminal, and send an audio and/or video interface to each participant terminal based on the information type.
- 根据权利要求9所述的MCU,其中,所述互动发起模块,具体设置为获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定,若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。The MCU according to claim 9, wherein the interaction initiation module is specifically configured to acquire an information type supported by each participant terminal, and detect whether each participant terminal sets a type of the received information, and if set, Then, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if yes, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, the information type supported by the conference terminal is used to correspond to the conference terminal. Send an audio or video interface; if not set, send an audio and/or video interface to each participating terminal based on the type of information supported by each participating terminal.
- 根据权利要求9所述的MCU,其中,The MCU according to claim 9, wherein所述互动处理模块,进一步设置为将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The interaction processing module is further configured to synthesize the statistical result into text, audio, and video, and send the statistical result of the text, audio, and/or video to each participating terminal based on the type of information supported by each of the participating terminals.
- 根据权利要求9至13任一项所述的MCU,其中,The MCU according to any one of claims 9 to 13, wherein所述互动发起模块,进一步设置为将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;The interaction initiation module is further configured to encode the audio and/or video interface set to direct user input with the data, audio and video code streams in the current conference as a media stream format, and then send the format to the conference terminal;所述互动处理模块,进一步设置为将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The interaction processing module is further configured to encode the statistical result together with the data, audio, and video code streams in the current conference into a media stream format, and then send the result to each conference terminal.
- 一种音频视频会议中的远程互动系统,包括:会议业务管理系统、与会终端、以及权利要求9至14任意一项所述的MCU;A remote interactive system in an audio video conference, comprising: a conference service management system, a conference terminal, and the MCU according to any one of claims 9 to 14;所述与会终端,设置为在接收到所述MCU发送的与本次互动对应的设置为指引用户输入的音频和/或视频界面后,根据音频和/或视频界面的指引进行互动操作,并将操作得到的互动信息发送至所述MCU;The participant terminal is configured to perform an interactive operation according to an instruction of the audio and/or video interface after receiving an audio and/or video interface corresponding to the current interaction and being set to guide the user input, and The interaction information obtained by the operation is sent to the MCU;所述会议业务管理系统,设置为接收所述MCU发送的各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并对所述互动信息进行统计后,将统计结果发送至所述MCU。The conference service management system is configured to receive interaction information that is fed back by the MCUs according to the guidance of the audio and/or video interface, and collect statistics on the interaction information, and send the statistics to the MCU.
- 一种多点音频视频通信中远程互动方法,包括:A remote interaction method in multi-point audio video communication, comprising:多点音频视频通信多点控制处理单元将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道; The multi-point audio and video communication multi-point control processing unit sends an audio and/or video interface corresponding to the current interaction to guide the user input to each participating terminal; wherein each participating terminal supports the user input channel;所述多点音频视频通信多点控制处理单元接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息;The multi-point audio video communication multi-point control processing unit receives interaction information fed back by each participating terminal according to the guidance of the audio and/or video interface;所述多点音频视频通信多点控制处理单元将所述互动信息发送至多点音频视频通信业务管理系统;The multipoint audio video communication multipoint control processing unit transmits the interactive information to a multipoint audio video communication service management system;所述多点音频视频通信多点控制处理单元接收所述多点音频视频通信业务管理系统利用所述互动信息进行统计得到的统计结果;The multi-point audio video communication multi-point control processing unit receives the statistical result obtained by the multi-point audio video communication service management system using the interaction information to perform statistics;所述多点音频视频通信多点控制处理单元将所述统计结果发送至各与会终端。The multipoint audio video communication multipoint control processing unit transmits the statistical result to each participating terminal.
- 根据权利要求16所述的方法,其中,所述多点音频视频通信多点控制处理单元将所述音频和/或视频界面发送至各与会终端之前,还包括:The method according to claim 16, wherein the multi-point audio video communication multi-point control processing unit sends the audio and/or video interface to each of the participating terminals, and further includes:多点音频视频通信多点控制处理单元获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The multi-point audio video communication multi-point control processing unit acquires the pre-stored audio file and picture file corresponding to the current interaction, and separately synthesizes the audio file and the picture file to form an audio and video interface for guiding user input.
- 根据权利要求16所述的方法,其中,所述多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:The method of claim 16, wherein the multipoint audio video communication multipoint control processing unit transmits an audio and/or video interface directing user input to each of the participating terminals, including:多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型;The multi-point audio and video communication multi-point control processing unit acquires the type of information supported by each participating terminal;所述多点音频视频多点控制处理单元基于各与会终端支持的信息类型向各将指引用户输入的音频和/或视频界面发送至各与会终端。The multi-point audio video multi-point control processing unit transmits an audio and/or video interface that will guide the user input to each participating terminal based on the type of information supported by each participating terminal.
- 根据权利要求16所述的方法,其中,所述多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:The method of claim 16, wherein the multipoint audio video communication multipoint control processing unit transmits an audio and/or video interface directing user input to each of the participating terminals, including:多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定;The multi-point audio and video communication multi-point control processing unit acquires the type of information supported by each participating terminal, and detects whether each participating terminal sets the type of the received information;当与会终端设定了接收信息的类型时,检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,基于设定的信息类型向对应与会终端发送音频和/或视频界面;若否,基于与会终端支持的信息类型向对应与会终端发送音频和/或视频界面;和/或When the participant terminal sets the type of the received information, it detects whether the type of information set by the corresponding conference terminal belongs to the type of information it supports, and if so, sends an audio and/or video interface to the corresponding conference terminal based on the set information type; No, sending an audio and/or video interface to the corresponding participant terminal based on the type of information supported by the participant terminal; and/or当与会终端未设定接收信息的类型时,基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。 When the participating terminal does not set the type of the received information, the audio and/or video interface is sent to each participating terminal based on the type of information supported by each participating terminal.
- 根据权利要求16所述的方法,其中,所述多点音频视频通信多点控制处理单元将统计结果发送至各与会终端,包括:The method according to claim 16, wherein the multi-point audio video communication multi-point control processing unit sends the statistical result to each participating terminal, including:多点音频视频通信多点控制处理单元将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The multi-point audio video communication multi-point control processing unit synthesizes the statistical result into text, audio and video, and transmits the statistical result of text, audio and/or video to each participating terminal based on the type of information supported by each participating terminal.
- 根据权利要求18至20中任一项所述的方法,其中,所述多点音频视频通信多点控制处理单元获取各与会终端支持的信息类型的方式包括:The method according to any one of claims 18 to 20, wherein the manner in which the multipoint audio video communication multipoint control processing unit acquires the type of information supported by each participating terminal comprises:多点音频视频通信多点控制处理单元获取为各与会终端建立的逻辑通道,根据逻辑通道的类型确定与会终端支持的信息类型;所述逻辑通道的类型包括:音频、视频和数据。The multi-point audio and video communication multi-point control processing unit acquires a logical channel established for each participating terminal, and determines the type of information supported by the participating terminal according to the type of the logical channel; the types of the logical channel include: audio, video, and data.
- 根据权利要求16至20任一项所述的方法,其中,The method according to any one of claims 16 to 20, wherein所述多点音频视频通信多点控制处理单元将指引用户输入的音频和/或视频界面发送至各与会终端,包括:多点音频视频通信多点控制处理单元将用于指引用户输入的音频和/或视频界面与当前多点音频视频通信中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;The multi-point audio video communication multi-point control processing unit transmits an audio and/or video interface that directs user input to each participating terminal, including: a multi-point audio video communication multi-point control processing unit that will guide the user input audio and And/or the video interface is encoded into the media stream format together with the data, audio and video code streams in the current multi-point audio and video communication, and then sent to each participating terminal;所述多点音频视频通信多点控制处理单元将统计结果发送至各与会终端,包括:多点音频视频通信多点控制处理单元将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The multi-point audio video communication multi-point control processing unit sends the statistical result to each participating terminal, including: the multi-point audio video communication multi-point control processing unit encodes the statistical result together with the data, audio and video code streams in the current conference. It is sent to each participant terminal after being formatted as a media stream.
- 根据权利要求16至20任一项所述的方法,其中,所述互动的类型包括:投票、打分和表决。The method of any one of claims 16 to 20, wherein the type of interaction comprises: voting, scoring, and voting.
- 根据权利要求16至20任一项所述的方法,其中,所述多点音频视频通信多点控制处理单元包括多点控制单元MCU。The method according to any one of claims 16 to 20, wherein the multipoint audio video communication multipoint control processing unit comprises a multipoint control unit MCU.
- 一种多点音频视频通信多点控制处理单元,包括:A multi-point audio and video communication multi-point control processing unit includes:互动发起模块,设置为在多点音频视频通信中,将与本次互动对应的设置为指引用户输入的音频和/或视频界面发送至各与会终端;其中,各与会终端均支持用户输入通道;The interactive initiating module is configured to send, in the multi-point audio video communication, an audio and/or video interface corresponding to the current interaction, which is set to guide the user input, to each participating terminal; wherein each participating terminal supports the user input channel;互动处理模块,设置为接收各与会终端根据音频和/或视频界面的指引而反馈的互动信息,并将所述互动信息发送至多点音频视频通信业务管理系统,以 及接收会议业务管理系统利用所述互动信息进行统计得到的统计结果,将所述统计结果发送至各与会终端。The interaction processing module is configured to receive the interaction information fed back by each participating terminal according to the instruction of the audio and/or video interface, and send the interaction information to the multi-point audio video communication service management system, And receiving the statistical result obtained by the conference service management system by using the interaction information, and sending the statistical result to each conference terminal.
- 根据权利要求25所述的多点音频视频通信多点控制处理单元,其中,所述互动发起模块,还设置为获取预先存储的与本次互动对应的音频文件和图片文件,并对音频文件和图片文件分别进行合成,形成用以指引用户输入的音频和视频界面。The multipoint audio video communication multipoint control processing unit of claim 25, wherein the interaction initiation module is further configured to acquire a pre-stored audio file and a picture file corresponding to the current interaction, and to the audio file and The image files are separately synthesized to form an audio and video interface for directing user input.
- 根据权利要求25所述的多点音频视频通信多点控制处理单元,其中,所述互动发起模块,设置为获取各与会终端支持的信息类型,并基于该信息类型向各与会终端发送音频和/或视频界面。The multipoint audio video communication multipoint control processing unit according to claim 25, wherein the interaction initiation module is configured to acquire an information type supported by each participant terminal, and send audio and/or to each participant terminal based on the information type. Or video interface.
- 根据权利要求25所述的多点音频视频通信多点控制处理单元,其中,所述互动发起模块,设置为获取各与会终端支持的信息类型,并检测各与会终端是否对接收信息的类型进行设定,若进行了设定,则检测对应与会终端设定的信息类型是否属于其支持的信息类型,若是,则基于设定的信息类型向对应与会终端发送音频或视频界面;否则,基于与会终端支持的信息类型向对应与会终端发送音频或视频界面;若未进行设定,则基于各与会终端支持的信息类型,向各与会终端发送音频和/或视频界面。The multipoint audio and video communication multipoint control processing unit according to claim 25, wherein the interaction initiation module is configured to acquire information types supported by each participant terminal, and detect whether each participant terminal sets the type of the received information. If the setting is made, it is detected whether the information type set by the corresponding conference terminal belongs to the information type supported by the conference terminal, and if so, the audio or video interface is sent to the corresponding conference terminal based on the set information type; otherwise, based on the conference terminal The supported information type sends an audio or video interface to the corresponding participant terminal; if not, the audio and/or video interface is sent to each participating terminal based on the type of information supported by each participating terminal.
- 根据权利要求25所述的多点音频视频通信多点控制处理单元,其中,A multipoint audio video communication multipoint control processing unit according to claim 25, wherein所述互动处理模块,进一步设置为将统计结果合成文字、音频和视频,并基于在先获取的各与会终端支持的信息类型,向各与会终端发送文字、音频和/或视频的统计结果。The interaction processing module is further configured to synthesize the statistical result into text, audio, and video, and send the statistical result of the text, audio, and/or video to each participating terminal based on the type of information supported by each of the participating terminals.
- 根据权利要求25至29任一项所述的多点音频视频通信多点控制处理单元,其中,The multipoint audio video communication multipoint control processing unit according to any one of claims 25 to 29, wherein所述互动发起模块,进一步设置为将设置为指引用户输入的音频和/或视频界面与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端;The interaction initiation module is further configured to encode the audio and/or video interface set to direct user input with the data, audio and video code streams in the current conference as a media stream format, and then send the format to the conference terminal;所述互动处理模块,进一步设置为将统计结果与当前会议中的数据、音频及视频码流一同编码为媒体流格式后发送至各与会终端。The interaction processing module is further configured to encode the statistical result together with the data, audio, and video code streams in the current conference into a media stream format, and then send the result to each conference terminal.
- 根据权利要求25至29任一项所述的多点音频视频通信多点控制处理单元,其中,所述多点音频视频通信包括语音视频会议;所述多点音频视频通信多点控制处理单元包括多点控制单元MCU。 The multipoint audio video communication multipoint control processing unit according to any one of claims 25 to 29, wherein said multipoint audio video communication comprises a voice video conference; said multipoint audio video communication multipoint control processing unit comprises Multipoint control unit MCU.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410816690.8A CN105791740A (en) | 2014-12-24 | 2014-12-24 | Remote interaction method and device in multipoint audio-video communication |
CN201410816690.8 | 2014-12-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016101623A1 true WO2016101623A1 (en) | 2016-06-30 |
Family
ID=56149165
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/086271 WO2016101623A1 (en) | 2014-12-24 | 2015-08-06 | Remote interaction method and device in multipoint audio and video communication |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105791740A (en) |
WO (1) | WO2016101623A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112118440A (en) * | 2020-09-16 | 2020-12-22 | 苏州科达科技股份有限公司 | Conference polling method, electronic device and storage medium |
US20220360896A1 (en) * | 2021-05-06 | 2022-11-10 | Facebook Technologies, Llc | Modular conferencing system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1928859A (en) * | 2005-09-08 | 2007-03-14 | 年代数位媒体股份有限公司 | Interactive multimedia interface and display |
CN101335868A (en) * | 2008-05-28 | 2008-12-31 | 深圳华为通信技术有限公司 | Voting method, apparatus for meeting and meeting management system |
CN101631225A (en) * | 2009-08-03 | 2010-01-20 | 深圳华为通信技术有限公司 | Conference voting method, conference voting device and conference voting system |
US20140122588A1 (en) * | 2012-10-31 | 2014-05-01 | Alain Nimri | Automatic Notification of Audience Boredom during Meetings and Conferences |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN201039361Y (en) * | 2007-03-23 | 2008-03-19 | 天津市巨展机房装备有限公司 | Intelligent electronic meeting room |
CN101621657B (en) * | 2009-08-06 | 2011-01-19 | 中兴通讯股份有限公司 | Wireless video conference system and voting method |
JP2011180948A (en) * | 2010-03-03 | 2011-09-15 | Brother Industries Ltd | Terminal device, conference server and processing program |
-
2014
- 2014-12-24 CN CN201410816690.8A patent/CN105791740A/en active Pending
-
2015
- 2015-08-06 WO PCT/CN2015/086271 patent/WO2016101623A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1928859A (en) * | 2005-09-08 | 2007-03-14 | 年代数位媒体股份有限公司 | Interactive multimedia interface and display |
CN101335868A (en) * | 2008-05-28 | 2008-12-31 | 深圳华为通信技术有限公司 | Voting method, apparatus for meeting and meeting management system |
CN101631225A (en) * | 2009-08-03 | 2010-01-20 | 深圳华为通信技术有限公司 | Conference voting method, conference voting device and conference voting system |
US20140122588A1 (en) * | 2012-10-31 | 2014-05-01 | Alain Nimri | Automatic Notification of Audience Boredom during Meetings and Conferences |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112118440A (en) * | 2020-09-16 | 2020-12-22 | 苏州科达科技股份有限公司 | Conference polling method, electronic device and storage medium |
US20220360896A1 (en) * | 2021-05-06 | 2022-11-10 | Facebook Technologies, Llc | Modular conferencing system |
Also Published As
Publication number | Publication date |
---|---|
CN105791740A (en) | 2016-07-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9024997B2 (en) | Virtual presence via mobile | |
US8531502B2 (en) | Method, device and system for presenting virtual conference site of video conference | |
KR100880150B1 (en) | Multi-point video conference system and media processing method thereof | |
US7929011B2 (en) | Method and system for handling video signals of conference | |
US9172912B2 (en) | Telepresence method, terminal and system | |
US7996540B2 (en) | Method and system for replacing media stream in a communication process of a terminal | |
CN101478642A (en) | Multi-picture mixing method and apparatus for video meeting system | |
CN105763832A (en) | Video interaction and control method and device | |
WO2015154608A1 (en) | Method, system and apparatus for sharing video conference material | |
WO2016026336A1 (en) | Remote interaction method and system in audio/video conference and mcu | |
WO2016082577A1 (en) | Video conference processing method and device | |
US9369671B2 (en) | Method and system for handling content in videoconferencing | |
WO2015003532A1 (en) | Multimedia conferencing establishment method, device and system | |
CN111246150A (en) | Control method, system, server and readable storage medium for video conference | |
CN107181926A (en) | A kind of communication means, device and server | |
WO2016206471A1 (en) | Multimedia service processing method, system and device | |
JP2016192610A (en) | Remote conference program, controller and remote conference method | |
WO2016101623A1 (en) | Remote interaction method and device in multipoint audio and video communication | |
US20200329083A1 (en) | Video conference transmission method and apparatus, and mcu | |
EP2637404A1 (en) | Method and device for controlling multiple auxiliary streams, and network system | |
KR20070054769A (en) | Method for connecting video call in mobile communication terminal | |
KR100953509B1 (en) | Method for multipoint video communication | |
US11102451B2 (en) | Videoconferencing server for providing multi-screen videoconferencing by using a plurality of videoconferencing terminals and method therefor | |
CN105915837A (en) | Video switching method, video switching device and video switching system | |
KR101691124B1 (en) | System and method for providing multi-sharing of multimedia |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15871697 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15871697 Country of ref document: EP Kind code of ref document: A1 |