CN111212259B - Method, system and related device for realizing audio and video conference - Google Patents

Method, system and related device for realizing audio and video conference Download PDF

Info

Publication number
CN111212259B
CN111212259B CN202010212284.6A CN202010212284A CN111212259B CN 111212259 B CN111212259 B CN 111212259B CN 202010212284 A CN202010212284 A CN 202010212284A CN 111212259 B CN111212259 B CN 111212259B
Authority
CN
China
Prior art keywords
conference
room
terminal
call server
identity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010212284.6A
Other languages
Chinese (zh)
Other versions
CN111212259A (en
Inventor
黄铁鸣
赵晓强
黄强
林莉
陈静聪
王文渊
罗程
李斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN202010212284.6A priority Critical patent/CN111212259B/en
Publication of CN111212259A publication Critical patent/CN111212259A/en
Application granted granted Critical
Publication of CN111212259B publication Critical patent/CN111212259B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Abstract

The embodiment of the application discloses an audio and video conference implementation method, which can reserve a conference in advance, when a first user wishes to reserve the conference, the first user sends a conference reservation request to a conference management server through a first terminal, wherein the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity identifier of the first terminal and a second identity identifier of a second terminal. And the conference management server generates a reservation record according to the conference reservation request. Therefore, when the conference management server determines that the conference starting time is reached according to the reservation record, the conference starting request is triggered to be sent to the conference call server, and the conference starting request comprises the first identity identification and the second identity identification determined according to the reservation record, so that the corresponding terminal can be invited to enter the conference according to the first identity identification and the second identity identification. The method realizes automatic and quick conference starting by reserving the conference in advance, has simple operation and ensures that the conference is carried out on time.

Description

Method, system and related device for realizing audio and video conference
Technical Field
The present application relates to the field of communications technologies, and in particular, to a method, a system, and a related device for implementing an audio/video conference.
Background
With the rapid development of network technology, communication technology and streaming media technology and the increase of mobility of work and study of people, enterprises and individuals have more and more demands on video communication, and audio and video conference systems come into play.
In the current audio and video conference implementation method, when an audio and video conference is started, a participant needs to initiate the conference and invite other participants to enter the conference.
However, this method is cumbersome, especially when there are many participating members, it is more complicated, and it takes a long time to start the audio-video conference, and even it may be difficult to start the conference on time.
Disclosure of Invention
In order to solve the technical problems, the application provides a method, a system and a related device for realizing an audio and video conference, a conference management server can automatically and quickly start the conference without a participant initiating the conference, the operation is simple, the time consumed by starting the conference is shortened, and the conference is ensured to be carried out on time.
The embodiment of the application discloses the following technical scheme:
in a first aspect, an embodiment of the present application provides an audio and video conference implementation method, where the method includes:
the conference management server acquires a conference reservation request sent by a first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal;
the conference management server generates a reservation record according to the conference reservation request; the reservation record comprises the conference information;
the conference management server sends a conference starting request to a conference call server when determining that the conference starting time is reached according to the reservation record; the conference initiation request comprises the first identity and the second identity; the first identity and the second identity are determined from the subscription record.
In a second aspect, an embodiment of the present application provides an apparatus for implementing an audio/video conference, where the apparatus includes an obtaining unit, a generating unit, and a sending unit:
the acquisition unit is used for acquiring a conference reservation request sent by a first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal;
the generating unit is used for generating a reservation record according to the conference reservation request; the reservation record comprises the conference information;
the sending unit is used for sending a conference starting request to a conference call server when the meeting starting time is determined to be reached according to the reservation record; the conference initiation request comprises the first identity and the second identity; the first identity and the second identity are determined from the subscription record.
In a third aspect, an embodiment of the present application provides an audio and video conference implementation method, where the method includes:
the conference call server responds to the conference starting request to establish a conference room; the conference starting request is triggered by the conference management server according to the conference starting time in the reservation record; the conference starting request comprises a first identity mark of the first terminal and a second identity mark of the second terminal; the first identity mark and the second identity mark are determined according to the reservation record;
the conference call server initiates a conference start prompt to a first terminal corresponding to the first identity;
and if receiving a conference room entering notification sent by the first terminal, the conference call server initiates a conference room entering invitation to a second terminal corresponding to the second identity.
In a fourth aspect, an embodiment of the present application provides an apparatus for implementing an audio/video conference, where the apparatus includes a creating unit, a first sending unit, and a second sending unit:
the creating unit is used for responding to a conference starting request to create a conference room; the conference starting request is triggered by the conference management server according to the conference starting time in the reservation record; the conference starting request comprises a first identity mark of the first terminal and a second identity mark of the second terminal; the first identity mark and the second identity mark are determined according to the reservation record;
the first sending unit is used for initiating a conference start reminding to a first terminal corresponding to the first identity;
and the second sending unit is configured to initiate a conference room entry invitation to a second terminal corresponding to the second identity if the conference room entry notification of the first terminal is received.
In a fifth aspect, an embodiment of the present application provides an audio and video conference implementation method, where the method includes:
the conference control server receives a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state;
and the conference control server sends the control instruction to a target terminal corresponding to the identity identification.
In a sixth aspect, an embodiment of the present application provides an apparatus for implementing an audio/video conference, where the apparatus includes a receiving unit and a sending unit:
the receiving unit is used for receiving a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state;
and the sending unit is used for sending the control instruction to the target terminal corresponding to the identity through the conference call server.
In a seventh aspect, an embodiment of the present application provides an audio and video conference implementation system, where the system includes a conference management server, a conference call server, and a conference control server:
the conference management server is used for acquiring a conference reservation request sent by the first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal; generating a reservation record according to the conference reservation request; the reservation record comprises the conference information; when the meeting starting time is determined to be reached according to the reservation record, a meeting starting request is sent to a meeting call server; the conference initiation request comprises the first identity and the second identity; the first identity mark and the second identity mark are determined according to the reservation record;
the conference call server is used for responding to a conference starting request to establish a conference room; initiating a conference start prompt to a first terminal corresponding to the first identity mark; if receiving the notification of meeting room entrance of the first terminal, initiating a meeting room entrance invitation to a second terminal corresponding to the second identity;
the conference control server is used for receiving a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state; and sending the control instruction to a target terminal corresponding to the identity through a conference call server.
In an eighth aspect, an embodiment of the present application provides an apparatus, including a processor and a memory:
the memory is used for storing program codes and transmitting the program codes to the processor;
the processor is configured to perform the method of the first aspect or the third aspect or the fifth aspect according to instructions in the program code.
In a ninth aspect, embodiments of the present application provide a computer-readable storage medium for storing program code for executing the method of the first aspect, the third aspect, or the fifth aspect.
According to the technical scheme, the audio and video conference implementation method can reserve the conference in advance, when a first user wants to reserve the conference, the first user sends a conference reservation request to the conference management server through the first terminal, the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal. And the conference management server generates a reservation record according to the conference reservation request, wherein the reservation record comprises conference information. Therefore, when the conference management server determines that the conference starting time is reached according to the reservation record, the conference starting request is triggered to be sent to the conference call server, and the conference starting request comprises the first identity identification and the second identity identification determined according to the reservation record, so that the corresponding terminal can be invited to enter the conference according to the first identity identification and the second identity identification. Therefore, the method can automatically and quickly start the conference according to the reservation record when the audio and video conference needs to be started by reserving the conference in advance, and the conference members such as the first user do not need to add the conference members to the conference one by one, so that the operation is simple, and the conference can be carried out on time.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and for a person of ordinary skill in the art, other drawings can be obtained according to these drawings without inventive exercise.
Fig. 1 is a schematic diagram of a system architecture of an audio and video conference implementation method provided in an embodiment of the present application;
fig. 2 is a signaling interaction diagram of an audio and video conference implementation method provided in an embodiment of the present application;
fig. 3 is a structural diagram of an audio and video conference implementation system provided in an embodiment of the present application;
fig. 4 is a diagram of a functional example of a conference management server according to an embodiment of the present application;
fig. 5 is an exemplary diagram of an interface for reserving a conference according to an embodiment of the present application;
fig. 6 is a diagram illustrating an example of an interface for reservation recording according to an embodiment of the present application;
fig. 7 is a diagram illustrating an exemplary system architecture of a conference voice service according to an embodiment of the present application;
fig. 8 is a signaling interaction diagram of an overall voice call flow provided in the embodiment of the present application;
fig. 9a is a diagram illustrating an example of an interface of a first terminal when a conference is started according to an embodiment of the present application;
fig. 9b is an exemplary diagram of an interface of a second terminal when a conference is started according to the embodiment of the present application;
fig. 10 is a signaling interaction diagram of a conference control flow provided in an embodiment of the present application;
fig. 11 is an exemplary diagram of an interface of a first terminal in a conference control process according to an embodiment of the present application;
fig. 12 is an exemplary diagram of an interface of a second terminal in a conference control process according to the embodiment of the present application;
fig. 13 is a signaling interaction diagram of an audio and video conference implementation method provided in an embodiment of the present application;
fig. 14 is a structural diagram of an audio/video conference implementation apparatus provided in an embodiment of the present application;
fig. 15a is a structural diagram of an audio and video conference implementation apparatus provided in an embodiment of the present application;
fig. 15b is a structural diagram of an audio and video conference implementation apparatus provided in an embodiment of the present application;
fig. 16 is a structural diagram of an audio/video conference implementation apparatus according to an embodiment of the present application;
FIG. 17 is a block diagram of an apparatus provided in an embodiment of the present application;
fig. 18 is a block diagram of a server according to an embodiment of the present application.
Detailed Description
Embodiments of the present application are described below with reference to the accompanying drawings.
In the existing audio and video conference implementation method, when an audio and video conference is started, a conference initiator initiates the conference, the conference initiator needs to add other members participating in the conference to the conference one by one from an address book, and if 30 members refer to the conference except the conference initiator, the conference initiator needs to pull the 30 members into the conference one by one. The operation is complicated, which causes long time consumption when starting the audio and video conference, and even may cause difficulty in starting the conference on time.
In order to solve the above technical problems, an embodiment of the present application provides an audio and video conference implementation method, which provides a conference reservation function, and by reserving a conference in advance, a conference management server may automatically trigger and start the conference when determining that a conference start time is reached, so that a conference call server may automatically invite terminals corresponding to users participating in the conference to enter the conference according to a reservation record, simplify an operation flow of starting the conference, and improve efficiency of starting the conference.
It should be emphasized that the audio and video conference implementation method provided in the embodiment of the present application is implemented based on a Cloud technology, and the Cloud technology (Cloud technology) is based on a generic term of a network technology, an information technology, an integration technology, a management platform technology, an application technology, and the like applied in a Cloud computing business model. Background services of the technical network system require a large amount of computing and storage resources, such as video websites, picture-like websites and more portal websites. With the high development and application of the internet industry, each article may have its own identification mark and needs to be transmitted to a background system for logic processing, data in different levels are processed separately, and various industrial data need strong system background support and can only be realized through cloud computing.
The method and the device integrate the powerful cloud computing capability and the audio and video technical capability in the cloud technology, and can realize the audio and video conference function based on the communication software with the audio and video conference function. The cloud conference is an efficient, convenient and low-cost conference form based on a cloud computing technology. A user can share voice, data files and videos with teams and clients all over the world quickly and efficiently only by performing simple and easy-to-use operation through an internet interface, and complex technologies such as transmission and processing of data in a conference are assisted by a cloud conference service provider to operate.
Currently, a cloud conference mainly focuses on Service contents mainly in a Software as a Service (SaaS) mode, including Service forms such as a telephone, a network, and a video, and in a conference process, a voice call is realized based on the SaaS.
In the cloud conference era, data transmission, processing and storage are all processed by computer resources of video conference manufacturers, users do not need to purchase expensive hardware and install fussy software at all, and efficient teleconferencing can be performed only by opening a browser or communication software and logging in a corresponding interface.
In order to facilitate understanding of the technical scheme of the present application, the following introduces an audio and video conference implementation method provided by the embodiment of the present application in combination with an actual application scenario.
Referring to fig. 1, fig. 1 is a schematic diagram of a system architecture of an audio and video conference implementation method provided in an embodiment of the present application. The system architecture includes a conference management server 101, a conference call server 102, a conference control server 103, and a plurality of terminals 104. The conference management server 101, the conference call server 102, and the conference control server 103 are collectively referred to as a server, and the server may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing basic cloud computing services such as a cloud service, a cloud database, cloud computing, a cloud function, cloud storage, a network service, cloud communication, a middleware service, a domain name service, a security service, a CDN, and a big data and artificial intelligence platform.
The plurality of terminals 104 may include a first terminal and a second terminal, where the first terminal is a terminal corresponding to a first user, and the first user may be referred to as a conference initiator because the first user will subscribe to a conference through the first terminal; the second terminal is a terminal corresponding to the second user, and the second user may be another user who participates in the conference and is invited by the first user. The terminal may be, but is not limited to, a smart phone, a tablet computer, a laptop computer, a desktop computer, a smart speaker, a smart watch, and the like. The terminal 104 and the server may be directly or indirectly connected through wired or wireless communication, and the application is not limited herein.
The audio and video conference realized based on the system has a conference reservation function, when a first user wants to reserve the conference, the first terminal sends a conference reservation request to the conference management server 101, the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal. The conference management server 101 may know when to start the conference according to the conference start time, and may know the members participating in the conference according to the first identity and the second identity.
The conference management server 101 generates a reservation record according to the conference reservation request, and the reservation record includes the conference information. When the conference management server 101 determines that the conference start time is reached according to the reservation record, it sends a conference start request to the conference call server 102, where the conference start request includes the first identity and the second identity determined according to the reservation record. In this way, after the conference call server 102 creates a conference room in response to the conference start request, it may initiate a conference start reminder to the first terminal corresponding to the first identity, and if a conference room entry notification of the first terminal is received, initiate a conference room entry invitation to the second terminal corresponding to the second identity.
After the first terminal and the second terminal enter the conference room, the participating members can speak in the conference room or listen to the speeches of other people. During the conference, the conference control server 103 may control the conference by receiving a control instruction, for example, changing the call state of some terminals 104, for example, target terminals.
Next, a method for implementing an audio/video conference provided by the embodiment of the present application will be described in detail with reference to the accompanying drawings. Referring to fig. 2, the method includes:
s201, the first terminal sends a conference reservation request to the conference management server.
It can be understood that the audio and video conference implementation method provided in the embodiment of the present application depends on an audio and video conference implementation system, which is shown in fig. 3 and includes a conference management server 101, a conference call server 102, and a conference control server 103, where different servers have different functions. The conference management server 101 mainly provides related supporting capabilities before, during, and after the conference, as shown in fig. 4, the related capabilities before the conference provided by the conference management server 101 include conference reservation, conference reservation modification, conference reminder, conference receipt, and the like; the related capabilities in the conference comprise capabilities of a conference to be handled, a conference reentry and the like; the related capabilities after the meeting comprise meeting details, meeting records and the like of the to-be-handled meeting. Conference call server 102 is primarily used to provide conference room creation, room member management, member status synchronization, voice, data communication, and other capabilities. The conference control server 103 mainly provides the ability to control the participants to change the call state during the conference, including turning on/off the voice in the conference, turning on/off the video, removing the conference, transferring the host, adding the conference members, etc.
Based on the different functions of the different servers, before the conference, if the first user wishes to reserve the conference, the first user sends a conference reservation request to the conference management server through the first terminal, where the conference reservation request includes conference information, and the conference information may include, for example, conference start time, a first identity of the first terminal, and a second identity of the second terminal.
The manner of triggering the sending of the conference reservation request may be that the first user clicks a "reserve conference" function key (see 501 in fig. 5 for an interface example) to enter a conference reservation interface (see 502 in fig. 5 for an interface example), and in the conference reservation interface, the first user may fill in conference information, such as a conference name, a conference start time (i.e., a time filled in at a "start" position), participants (which may be embodied by identifiers such as a first identifier and a second identifier), and other relevant information.
And S202, the conference management server generates a reservation record according to the conference reservation request.
After receiving the conference reservation request, the conference management server generates a reservation record according to the conference reservation request, where the reservation record may be displayed on the first terminal corresponding to the first user, see an example of an interface shown at 503 in fig. 5. The reservation recording includes the aforementioned introduced meeting information. When the first user clicks on the reservation record, the meeting details can be viewed, see the example interface diagram shown at 504 in FIG. 5.
After receiving the conference reservation request, the conference management server may send a notification message to the second terminal, where the notification message includes conference information, so as to notify a second user corresponding to the second terminal of the audio and video conference at the conference start time. Meanwhile, after the second terminal receives the notification message, the second user may feed back whether the second user will refer to the conference management server according to the actual situation of the second user, for example, the second terminal sends a receipt message to the conference management server, where the receipt message is used to reflect whether the second user will refer to the conference, so that the conference management server may record which users will participate in the conference and record the user in the reservation record. In this way, the reservation record displayed on the first terminal will identify which second users have agreed to participate in the meeting, which may be identified by a "√" as shown at 504.
After the reservation record is generated, the second terminal corresponding to the second user may also display the reservation record, see the example interface diagram shown in 601 in fig. 6. The second user may select one of the reservation records (to-do list) for viewing, for example, select a reservation record with a conference name "some scientific forum", and may display the conference information of the reservation record, see the interface example shown in 602. When the second user clicks on the reservation record, the meeting details can be viewed, see the example interface diagram shown at 603. The second user may select "accept", "pending" or "decline" on the interface shown at 603 to indicate whether or not he will refer to the conference, and send a response piece message to the conference management server based on the selection of the second user.
If the conference is in progress, the to-do list can show the current conference state, and the participating members can enter the conference through the to-do list or the conference details, so that a convenient re-entry conference entrance is provided for the participating members who exit the conference.
It can be understood that, through the conference reservation modification function of the conference management server, the first user may modify the filled-in conference information according to the actual situation, for example, modify the conference start time, and the like.
Under the common condition, after the first user makes a conference reservation, the first user can be reminded to start the conference on time when meeting starting time is reached, and meanwhile, the conference management server can automatically start the conference when the meeting starting time is reached, so that the problem that the operation of starting the conference by the first user is complicated is avoided.
However, in some cases, the conference members may have something else in the interim after reaching the conference start time, and in order to ensure as much as possible that the conference members can see the conference, the conference may be started in advance before reaching the conference start time after reserving the conference. For example, the first user may click on the "start meeting ahead" function key in the exemplary diagram of the interface shown at 504 to start a meeting ahead.
S203, when the conference management server determines that the conference starting time is reached according to the reservation record, the conference management server sends a conference starting request to the conference call server.
When meeting starting time is reached, the meeting management server automatically triggers meeting starting, namely, a meeting starting request is sent to the meeting call server, the meeting starting request comprises a first identity identification and a second identity identification, and the first identity identification and the second identity identification are determined according to reservation records. Therefore, the conference call server can invite the corresponding terminal to enter the conference according to the first identity and the second identity. That is to say, in the application, the conference is automatically started by the conference management server, so that the conference call server can invite the first terminal and the second terminal to enter the conference, and thus, the conference member does not need to initiate the conference.
It should be noted that, because the second terminal may send a receipt message to the conference management server to indicate whether the second user corresponding to the second terminal will refer to the conference, if the receipt message indicates that the second user will refer to the conference, the conference call server will invite the second user to enter the conference, and if the receipt message indicates that the second user refuses to refer to the conference, in order to avoid making an meaningless invitation and avoid wasting communication resources, the conference call server will not invite the second user to enter the conference. Therefore, the second identity in the conference starting request is the identity of the second terminal which is determined to participate in the conference according to the receipt message, so that the conference call server initiates the conference invitation to the second terminal which is determined to participate in the conference.
And S204, the conference call server responds to the conference starting request to establish a conference room.
And after receiving the conference starting request, the conference call server creates a conference room, and the participating members enter the conference room to carry out the audio and video conference.
In this embodiment, audio and video communication may be established between terminals through a Voice Over Internet Protocol (VOIP). The conference voice service in the audio and video conference process can be implemented based on the architecture shown in fig. 7, which includes a plurality of terminals 701, such as terminal 1, terminal 2, and terminal … … (a plurality of terminals includes a first terminal and a second terminal), a voice call server 702, a room management server 703, and a data management platform 704. The conference call server 102 may include a voice call server 702, a room management server 703, a data management platform 704, and the like, where the voice call server and the room management server are used in a signaling layer; data management platform 704 is used for the data layer. The plurality of terminals 701 are connected to the access layer 705 through a Transmission Control Protocol (TCP), so as to establish a signaling layer with the voice call server and the room management server, and the plurality of terminals 701 are connected to the data management platform 704 through a User Datagram Protocol (UDP) so as to establish a data layer.
It should be noted that based on the architecture shown in fig. 7, in a general case, a conference room may include a signaling room and a data room, and therefore, in the audio/video conference call process, a signaling interaction diagram of an overall voice call flow may be shown in fig. 8. The first terminal transmits a room creation request to the voice call server to the room management server (S801); the conference call server sends a signaling room creation request to the room management server through the voice call server (S802); the conference call server receives a first response message sent by the room management server through the voice call server (S803), where the first response message represents that the signaling room is successfully created, and may include a room identifier and a key used for transmitting voice data; the conference call server sends a data room creation request to the data management platform through the voice call server (S804); and the conference call server receives a second response message sent by the data management platform through the voice call server (S805), wherein the second response message represents that the creation of the data room is successful, so that the creation of the conference room is completed. After the creation of the conference room is completed, the conference call server transmits a third response message to the first terminal through the voice call server (S806), indicating that the creation of the conference room is successful.
S205, the conference call server initiates a conference start prompt to the first terminal corresponding to the first identity.
S206, receiving the conference room entering notification sent by the first terminal.
After the conference room is created, the conference call server initiates a conference start prompt to the first terminal, and the first user can directly enter the conference only by triggering the start of the conference with one key.
When the first terminal receives the conference start reminder, the interface displayed on the first terminal may be as shown in fig. 9a, for example, the interface prompts the first user to "reserve the conference for the start time", and displays the conference information. The first user can select 'start meeting immediately' or 'pause' through a function key provided by the interface, and when the first user clicks the function key 'start meeting immediately', the first user triggers the sending of a meeting room entering notification to the meeting call server.
And S207, the conference call server initiates a conference room entering invitation to a second terminal corresponding to the second identity.
After the first terminal enters the conference, the conference call server directly initiates the conference room entrance invitation to the second terminal according to the second identity, and the first user does not need to add participating members one by one, thereby simplifying the operation process of the first user.
When the second terminal receives the meeting room entry invitation, the interface displayed thereon may be, for example, as shown in fig. 9b, which prompts the second user to enter the meeting and displays meeting information. The second user can select "slide into the conference" or "temporarily not enter" through the function keys provided by the interface, and when the first user drags the circular icon to slide according to the arrow direction shown in fig. 9b, the second terminal enters the conference room.
And S208, the second terminal enters the conference room.
Based on the architecture shown in fig. 7, the flow of the second terminal entering the conference can be seen in fig. 8. The second terminal transmits a conference room entry request to the voice call server (S807); the voice call server sends a signaling room entry request to the room management server (S808); the voice call server receives a fourth response message (S809) sent by the room management server, wherein the fourth response message represents that the signaling room is successfully entered; the voice call server sends a data room entering request to the data management platform (S810); the voice call server receives a fifth response message sent by the data management platform (S811), where the fifth response message represents that the data room successfully enters, thereby completing the entrance of the conference room. After completing the entrance of the conference room, the voice call server transmits a sixth response message to the second terminal (S812), indicating that the entrance of the conference room is successful. The first user and the second user may then enter the call using the data layer (as shown by the dashed box in fig. 8).
After the first terminal enters the data room, the conference state management server can be informed that the first terminal enters the conversation state timing and the second terminal enters the calling state timing; after the second terminal enters the data room, the conference state management server may be notified that the second terminal enters the talk state timing, and the second terminal clears the call state timing.
According to the technical scheme, the audio and video conference implementation method can reserve the conference in advance, when a first user wants to reserve the conference, the first user sends a conference reservation request to the conference management server through the first terminal, the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal. And the conference management server generates a reservation record according to the conference reservation request, wherein the reservation record comprises conference information. Therefore, when the conference management server determines that the conference starting time is reached according to the reservation record, the conference starting request is triggered to be sent to the conference call server, and the conference starting request comprises the first identity identification and the second identity identification determined according to the reservation record, so that the corresponding terminal can be invited to enter the conference according to the first identity identification and the second identity identification. Therefore, the method can automatically and quickly start the conference according to the reservation record when the audio and video conference needs to be started by reserving the conference in advance, a conference initiator such as a first user does not need to add conference members to the conference one by one, the operation is simple, and the conference is guaranteed to be carried out on time.
It can be understood that the conference control is an important component of the audio and video conference, and a set of complete and reliable control mechanism is a key for realizing ordered and efficient organization of the audio and video conference. In the embodiment of the present application, the provided conference control function is mainly to control the participating members to change the call state, for example, the conference control function includes turning on/off voice in the conference, turning on/off video, removing the conference, transferring a host, adding a conference member, and the like, where turning off voice or video may be for a certain member or all members. Where the trigger for conference control is typically a conference initiator, e.g. a first user, and the participant member being controlled is typically a second user.
During the conference, the signaling interaction diagram of the conference control flow can be seen in fig. 10:
and S1001, the first terminal sends a control instruction to the conference control server.
The control instruction includes a third identity of the target terminal, the control instruction is used for indicating the target terminal to change a call state, and the target terminal may be any one or more of the second terminals.
S1002, the conference control server sends the control instruction to the target terminal corresponding to the identity.
In some cases, the conference call server has the capabilities of room member management, member state synchronization, voice communication, data communication and the like, so that the conference call server can send a control instruction to the target terminal through the conference call server in order to realize the functions of the conference call server.
It should be noted that, since not every member of the conference has the capability of controlling other members of the conference to change the call state, for example, the first user usually has the capability of changing the call state of the member of the conference, but the second user does not have the capability, or the call state of a member of the conference should not be changed, or the system resources cannot meet the change of the call state. Therefore, in order to avoid some unreasonable control, the conference control server may arbitrate the control command before S1002 (S1004), and if so, perform S1002. Otherwise, an arbitration result message is returned to the first terminal (S1005), which prompts that control is restricted.
And S1003, the target terminal executes the control command.
And the target terminal executes the control instruction after receiving the control instruction. For example, if the control instruction indicates to close the video, the target terminal closes the camera after receiving the control instruction, and stops collecting the video data.
The target terminal may also return an execution result to the conference control server (S1006), and the conference control server synchronizes the call state of the target terminal to the first terminal and the second terminal including the target terminal and the other second terminals according to the execution result (S1007).
Fig. 11 is an exemplary interface diagram of the first terminal, where a conference member list is shown in the interface diagram, and function keys for controlling to change a call state, such as removing a conference, closing a video, or canceling a call state, are shown in the interface diagram, where fig. 11 is an exemplary interface diagram of the first terminal, and certainly other function keys may also be included, and taking closing a video as an example, the first user may select, in the conference member list, which conference member or members to close a video. FIG. 12 is a diagram illustrating an example of an interface of a second terminal, wherein when a first user selects to close a video of a participant, the corresponding interface of the second terminal displays "the host has closed your video" as shown by 1201 in FIG. 12; when the first user selects to open the video of a participant, the interface of the corresponding second terminal is as shown in 1202 in fig. 12, at this time, "the host invites you to open the video, and the microphone will be automatically opened after the video is opened" is displayed, and at the same time, the second user may select "temporarily not open" or "open immediately".
It should be noted that in the conventional conference process, the conference members usually record the conference contents manually, so as to understand or perform conference summary after the conference. In the embodiment, however, a way of automatically recording the conference content is provided, that is, during the conference, the conference management server may record the audio data during the conference, so as to generate the conference record according to the audio data. The audio data may be directly used as a conference record, or of course, the audio data may be automatically converted into text content to generate a text conference record. In addition, the conference record may include one or more of a start time and an end time of the conference, and participating members of the conference.
It should be noted that, because the call states of the participating members may be changed during the conference, for example, a certain participating member may be removed from the conference, or a new participating member may be added, and the like, if the conference management server receives the control instruction sent by the conference control server, the conference management server may change the conference record according to the third identity in the control instruction. For example, when a new participant is added, the new participant is also added to the generated conference record to ensure the complete and comprehensive conference record.
It should be noted that, during the conference or at the end of the conference, the participants can exit the conference, and the flow of the first user and the second user exiting the conference is similar, but the two users have different influences on the conference. When the second user exits the conference, the exit conference flow can be seen in fig. 8. The second terminal transmits a conference room exit request to the voice call server (S813); the voice call server transmitting a signaling room exit request to the room management server (S814); the voice call server receives a seventh response message sent by the room management server (S815), where the seventh response message indicates that the signaling room is successfully exited; the voice call server sends a data room exit request to the data management platform (S816); and the voice call server receives an eighth response message sent by the data management platform (S817), wherein the eighth response message represents that the exit of the data room is successful, so that the exit of the conference room is completed. After completion of the exit of the conference room, the voice call server transmits a ninth response message to the second terminal (S818), indicating that the exit of the conference room is successful.
When the first user exits the conference, the first user exits the conference flow as shown in fig. 8. The first terminal transmits a conference room exit request to the voice call server (S819); the voice call server transmitting a signaling room exit request to the room management server (S820); the voice call server receives a tenth response message sent by the room management server (S821), where the tenth response message represents that the signaling room quit successfully; the voice call server transmits a data room exit request to the data management platform (S822); the voice call server receives an eleventh response message sent by the data management platform (S823), where the eleventh response message indicates that the data room is successfully exited, thereby completing the exit of the conference room.
Since the first user corresponding to the first terminal is the initiator of the conference, the first terminal exits from the conference room and indicates that the conference is ended, and at this time, the voice call server may send a signaling room end request to the room management server (S824); the voice call server receives a twelfth response message sent by the room management server (S825), where the twelfth response message represents that the signaling room is successfully ended; the voice call server sends a data room end request to the data management platform (S826); the voice call server receives a thirteenth response message sent by the data management platform (S827), where the thirteenth response message represents that the data room is successfully ended, and the voice call server sends a fourteenth response message to the first terminal (S828), which represents that the conference room of the first terminal is successfully exited.
After the second terminal exits the data room, the conference state management server can be informed that the second terminal clears the call state timing; after the first terminal exits the data room, the conference state management server may be notified that the first terminal clears the talk state timer.
Next, an audio and video conference implementation method provided by the embodiment of the present application will be introduced in combination with an actual application scenario. The audio-video conference can be realized by specific communication software, and referring to fig. 13, the method comprises the following steps:
and S1301, filling meeting information in the first user.
S1302, the first terminal sends a conference reservation request to the conference management server.
And S1303, the conference management server sends a notification message to the second terminal.
And S1304, the second terminal sends a receipt message to the conference management server.
And S1305, the conference management server generates a reservation record according to the conference reservation request and the receipt message.
And S1306, when the conference management server determines that the conference starting time is up according to the reservation record, the conference management server sends a conference starting request to the conference call server.
S1307, the conference call server creates a conference room in response to the conference start request.
S1308, the conference call server initiates a conference start prompt to the first terminal corresponding to the first identity.
S1309, the first terminal sends a notification of entering a conference room to the conference call server.
S1310, the conference call server initiates a conference room entering invitation to a second terminal corresponding to the second identity.
And S1311, the second terminal enters the conference room.
S1312, the first terminal sends a control instruction for removing the members to the conference control server.
S1313, the conference control server sends the control instruction to the conference call server.
And S1314, the conference control server sends the control instruction to the conference management server.
S1315, the conference call server sends the control instruction to the second terminal to notify the second terminal to exit the conference.
S1316, the first terminal sends a control instruction for ending the conference to the conference control server.
S1317, the conference control server sends the control instruction to the conference call server
S1318, the conference control server sends the control instruction to the conference management server.
S1319, the conference call server sends the control instruction to the second terminal to notify the second terminal to end the conference.
S1320, the conference management server generates a conference record.
S1321, the conference management server sends the conference record to the first terminal.
S1322, the conference management server sends the conference record to the second terminal.
Based on the audio and video conference implementation method provided by the foregoing embodiment, an embodiment of the present application further provides an audio and video conference implementation apparatus, referring to fig. 14, the apparatus includes an obtaining unit 1401, a generating unit 1402, and a sending unit 1403:
the acquiring unit 1401 is configured to acquire a conference reservation request sent by a first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal;
the generating unit 1402, configured to generate a reservation record according to the conference reservation request; the reservation record comprises the conference information;
the sending unit 1403 is configured to send a conference starting request to the conference call server when it is determined that the conference starting time is reached according to the reservation record; the conference initiation request comprises the first identity and the second identity; the first identity and the second identity are determined from the subscription record.
In a possible implementation manner, the sending unit 1403 is configured to:
sending a notification message to the second terminal, wherein the notification message comprises the conference information;
the obtaining unit 1401 is further configured to receive a response piece message sent by the second terminal;
and the second identity in the conference starting request is the identity of the second terminal which is determined to participate in the conference according to the receipt message.
In a possible implementation manner, the generating unit 1402 is further configured to record audio data during a conference; and generating a conference record according to the audio data.
In one possible implementation, the conference recording includes a combination of one or more of textual content characterized by the audio data, start and end times of the conference, and participating members of the conference.
In a possible implementation manner, the obtaining unit 1401 is configured to:
in the conference process, receiving a control instruction sent by a conference control server, wherein the control instruction comprises a third identity of a target terminal and is used for indicating the target terminal to change the call state;
the generating unit 1402 is further configured to change the conference record according to the third identity.
An embodiment of the present application further provides an apparatus for implementing an audio/video conference, referring to fig. 15a, the apparatus includes a creating unit 1501, a first sending unit 1502, and a second sending unit 1503:
the creating unit 1501 is configured to create a conference room in response to a conference initiation request; the conference starting request is triggered by the conference management server according to the conference starting time in the reservation record; the conference starting request comprises a first identity mark of the first terminal and a second identity mark of the second terminal; the first identity mark and the second identity mark are determined according to the reservation record;
the first sending unit 1502 is configured to initiate a conference start reminder to a first terminal corresponding to the first identity;
the second sending unit 1503 is configured to initiate a conference room entry invitation to a second terminal corresponding to the second identity if the conference room entry notification of the first terminal is received.
In one possible implementation, referring to fig. 15b, the apparatus further includes a receiving unit 1504:
the receiving unit 1504 is configured to receive a control instruction sent by a conference control server in a conference process, where the control instruction includes a third identity of a target terminal, and the control instruction is used to instruct the target terminal to change a call state;
the second sending unit 1503 is further configured to forward the control instruction to the target terminal according to the third identity.
In a possible implementation manner, the conference room includes a signaling room and a data room, and the creating unit 1501 is configured to:
sending a signaling room creating request to a room management server through a voice call server;
receiving a first response message sent by the room management server through the voice call server; the first response message represents that the signaling room is successfully created;
sending a data room creating request to a data management platform through the voice call server;
receiving a second response message sent by the data management platform through the voice call server; the second response message characterizes the data room creation success.
The embodiment of the present application further provides an apparatus for implementing an audio/video conference, referring to fig. 16, the apparatus includes a receiving unit 1601 and a sending unit 1602:
the receiving unit 1601 is configured to receive a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state;
the sending unit 1602 is configured to send the control instruction to the target terminal corresponding to the identity through the conference call server.
The embodiment of the present application further provides an audio and video conference implementation system, where the system includes a conference management server, a conference call server, and a conference control server:
the conference management server is used for acquiring a conference reservation request sent by the first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal; generating a reservation record according to the conference reservation request; the reservation record comprises the conference information; when the meeting starting time is determined to be reached according to the reservation record, a meeting starting request is sent to a meeting call server; the conference initiation request comprises the first identity and the second identity; the first identity mark and the second identity mark are determined according to the reservation record;
the conference call server is used for responding to a conference starting request to establish a conference room; initiating a conference start prompt to a first terminal corresponding to the first identity mark; if receiving the notification of meeting room entrance of the first terminal, initiating a meeting room entrance invitation to a second terminal corresponding to the second identity;
the conference control server is used for receiving a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state; and sending the control instruction to a target terminal corresponding to the identity through a conference call server.
The embodiment of the application also provides equipment which can drive the animation through voice and can be audio and video processing equipment. The apparatus is described below with reference to the accompanying drawings. Referring to fig. 17, an embodiment of the present application provides an apparatus 1700, where the apparatus 1700 may also be a terminal, and the terminal may be any intelligent terminal including a mobile phone, a tablet computer, a Personal Digital Assistant (PDA), a Point of Sales (POS), a vehicle-mounted computer, and the like, where the terminal is taken as a mobile phone for example:
fig. 17 is a block diagram illustrating a partial structure of a mobile phone related to a terminal provided in an embodiment of the present application. Referring to fig. 17, the handset includes: radio Frequency (RF) circuit 1710, memory 1720, input unit 1730, display unit 1740, sensor 1750, audio circuit 1760, wireless fidelity (WiFi) module 1770, processor 1780, and power supply 1790. Those skilled in the art will appreciate that the handset configuration shown in fig. 17 is not intended to be limiting and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
The following describes each component of the mobile phone in detail with reference to fig. 17:
the RF circuit 1710 can be used for receiving and transmitting signals during information transmission and reception or during a call, and in particular, for processing the received downlink information of the base station in the processor 1780; in addition, the data for designing uplink is transmitted to the base station. In general, the RF circuitry 1710 includes, but is not limited to, an antenna, at least one Amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like. In addition, the RF circuitry 1710 may also communicate with networks and other devices via wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to Global System for Mobile communication (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE), email, Short Message Service (SMS), and the like.
The memory 1720 can be used for storing software programs and modules, and the processor 1780 executes various functional applications and data processing of the mobile phone by operating the software programs and modules stored in the memory 1720. The memory 1720 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function (such as a sound playing function, an image playing function, and the like), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 1720 may include high-speed random access memory and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
The input unit 1730 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the cellular phone. Specifically, the input unit 1730 may include a touch panel 1731 and other input devices 1732. The touch panel 1731, also referred to as a touch screen, may collect touch operations of a user (e.g., operations of the user on the touch panel 1731 or near the touch panel 1731 by using any suitable object or accessory such as a finger or a stylus), and drive a corresponding connection device according to a preset program. Alternatively, the touch panel 1731 may include two parts, a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, and sends the touch point coordinates to the processor 1780, and can receive and execute commands sent from the processor 1780. In addition, the touch panel 1731 may be implemented by various types, such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. The input unit 1730 may include other input devices 1732 in addition to the touch panel 1731. In particular, other input devices 1732 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.
The display unit 1740 may be used to display information input by or provided to the user and various menus of the mobile phone. The Display unit 1740 may include a Display panel 1741, and optionally, the Display panel 1741 may be configured in a form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like. Further, the touch panel 1731 may cover the display panel 1741, and when the touch panel 1731 detects a touch operation on or near the touch panel 1731, the touch panel is transmitted to the processor 1780 to determine the type of the touch event, and then the processor 1780 provides a corresponding visual output on the display panel 1741 according to the type of the touch event. Although in fig. 17, the touch panel 1731 and the display panel 1741 are implemented as two separate components to implement the input and output functions of the mobile phone, in some embodiments, the touch panel 1731 and the display panel 1741 may be integrated to implement the input and output functions of the mobile phone.
The handset may also include at least one sensor 1750, such as light sensors, motion sensors, and other sensors. Specifically, the light sensor may include an ambient light sensor that adjusts the brightness of the display panel 1741 according to the brightness of ambient light, and a proximity sensor that turns off the display panel 1741 and/or the backlight when the mobile phone is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally, three axes), can detect the magnitude and direction of gravity when stationary, and can be used for applications of recognizing the posture of a mobile phone (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer and tapping), and the like; as for other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which can be configured on the mobile phone, further description is omitted here.
Audio circuitry 1760, speaker 1761, and microphone 1762 may provide an audio interface between the user and the handset. The audio circuit 1760 may transmit the electrical signal converted from the received audio data to the speaker 1761, and the electrical signal is converted into a sound signal by the speaker 1761 and output; on the other hand, the microphone 1762 converts the collected sound signals into electrical signals, which are received by the audio circuit 1760 and converted into audio data, which are then processed by the audio data output processor 1780 and sent to, for example, another cell phone via the RF circuit 1710, or output to the memory 1720 for further processing.
WiFi belongs to short-distance wireless transmission technology, and the mobile phone can help a user to send and receive e-mails, browse webpages, access streaming media and the like through the WiFi module 1770, and provides wireless broadband Internet access for the user. Although fig. 17 shows the WiFi module 1770, it is understood that it does not belong to the essential constitution of the handset, and can be omitted entirely as needed within the scope not changing the essence of the invention.
The processor 1780 is the control center of the handset, connects various parts of the entire handset using various interfaces and lines, and performs various functions of the handset and processes data by running or executing software programs and/or modules stored in the memory 1720 and calling data stored in the memory 1720, thereby monitoring the entire handset. Optionally, processor 1780 may include one or more processing units; preferably, the processor 1780 may integrate an application processor, which primarily handles operating systems, user interfaces, application programs, etc., and a modem processor, which primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into processor 1780.
The handset also includes a power supply 1790 (e.g., a battery) to power the various components, which may preferably be logically connected to the processor 1780 via a power management system, to manage charging, discharging, and power consumption via the power management system.
Although not shown, the mobile phone may further include a camera, a bluetooth module, etc., which are not described herein.
In the present embodiment, the steps performed by the terminal can be implemented based on the structure shown in fig. 17.
Referring to fig. 18, fig. 18 is a block diagram of a server 1800 provided in this embodiment, and the server 1800 may have a large difference due to different configurations or performances, and may include one or more Central Processing Units (CPUs) 1822 (e.g., one or more processors) and a memory 1832, and one or more storage media 1830 (e.g., one or more mass storage devices) storing an application program 1842 or data 1844. The memory 1832 and the storage medium 1830 may be, among other things, transient storage or persistent storage. The program stored on the storage medium 1830 may include one or more modules (not shown), each of which may include a series of instruction operations on a server. Still further, a central processor 1822 may be provided in communication with the storage medium 1830 to execute a series of instruction operations in the storage medium 1830 on the server 1800.
The server 1800 may also include one or more power supplies 1826, one or more wired or wireless network interfaces 1850, one or more input-output interfaces 1858, and/or one or more operating systems 1841, such as Windows Server, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, and so forth.
In this embodiment, the central processor 1822 in the server 1800 may perform the following steps;
acquiring a conference reservation request sent by a first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal;
generating a reservation record according to the conference reservation request; the reservation record comprises the conference information;
when the meeting starting time is determined to be reached according to the reservation record, a meeting starting request is sent to a meeting call server; the conference initiation request comprises the first identity and the second identity; the first identity mark and the second identity mark are determined according to the reservation record;
or the like, or, alternatively,
creating a conference room in response to the conference initiation request; the conference starting request is triggered by the conference management server according to the conference starting time in the reservation record; the conference starting request comprises a first identity mark of the first terminal and a second identity mark of the second terminal; the first identity mark and the second identity mark are determined according to the reservation record;
initiating a conference start prompt to a first terminal corresponding to the first identity mark;
if receiving a conference room entering notification sent by the first terminal, initiating a conference room entering invitation to a second terminal corresponding to the second identity;
or the like, or, alternatively,
receiving a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state;
and sending the control instruction to a target terminal corresponding to the identity identification.
The embodiment of the present application further provides a computer-readable storage medium, where the computer-readable storage medium is used to store a program code, and the program code is used to execute the audio and video conference implementation method described in the foregoing embodiments.
Embodiments of the present application further provide a computer program product including instructions, which when run on a computer, causes the computer to execute the audio and video conference implementation method described in the foregoing embodiments.
The terms "first," "second," "third," "fourth," and the like in the description of the application and the above-described figures, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
It should be understood that in the present application, "at least one" means one or more, "a plurality" means two or more. "and/or" for describing an association relationship of associated objects, indicating that there may be three relationships, e.g., "a and/or B" may indicate: only A, only B and both A and B are present, wherein A and B may be singular or plural. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. "at least one of the following" or similar expressions refer to any combination of these items, including any combination of single item(s) or plural items. For example, at least one (one) of a, b, or c, may represent: a, b, c, "a and b", "a and c", "b and c", or "a and b and c", wherein a, b, c may be single or plural.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed to by the prior art, or all or part of the technical solution may be embodied in a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims (13)

1. An audio and video conference implementation method is characterized by comprising the following steps:
the conference management server acquires a conference reservation request sent by a first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal;
the conference management server generates a reservation record according to the conference reservation request; the reservation record comprises the conference information;
the conference management server sends the reservation record to the second terminal; the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the conference management server sends a conference starting request to a conference call server when determining that the conference starting time is reached according to the reservation record; enabling the conference call server to respond to the conference starting request and sending a signaling room creating request to a room management server through a voice call server; receiving a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data; sending a data room creating request to a data management platform through the voice call server; receiving a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; after the conference room is created, the conference call server sends a third response message to the first terminal through the voice call server, and the conference room creation is indicated to be successful; the conference initiation request comprises the first identity and the second identity; the first identity mark and the second identity mark are determined according to the reservation record;
the method further comprises the following steps:
the conference management server records audio data in a conference process;
and the conference management server converts the audio data into text content to generate a conference record.
2. The method of claim 1, wherein after the conference management server obtains the conference reservation request sent by the first terminal, the method further comprises:
the conference management server sends a notification message to the second terminal, wherein the notification message comprises the conference information;
the conference management server receives a receipt message sent by the second terminal;
and the second identity in the conference starting request is the identity of the second terminal which is determined to participate in the conference according to the receipt message.
3. The method of claim 1, wherein the conference recording comprises a combination of one or more of textual content characterized by the audio data, start and end times of the conference, and participating members of the conference.
4. The method of claim 1, further comprising:
in a conference process, the conference management server receives a control instruction sent by a conference control server, wherein the control instruction comprises a third identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state;
and the conference management server changes the conference record according to the third identity.
5. The device for realizing the audio and video conference is characterized by comprising an acquisition unit, a generation unit and a sending unit:
the acquisition unit is used for acquiring a conference reservation request sent by a first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal;
the generating unit is used for generating a reservation record according to the conference reservation request; the reservation record comprises the conference information;
the sending unit is used for sending a conference starting request to a conference call server when the meeting starting time is determined to be reached according to the reservation record; enabling the conference call server to respond to the conference starting request and sending a signaling room creating request to a room management server through a voice call server; receiving a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data; sending a data room creating request to a data management platform through the voice call server; receiving a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; after the conference room is created, the conference call server sends a third response message to the first terminal through the voice call server, and the conference room creation is indicated to be successful; the conference initiation request comprises the first identity and the second identity; the first identity mark and the second identity mark are determined according to the reservation record;
the audio and video conference realization device is also used for sending the reservation record to the second terminal; the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the audio and video conference realization device is also used for:
the conference management server records audio data in a conference process;
and the conference management server converts the audio data into text content to generate a conference record.
6. An audio and video conference implementation method is characterized by comprising the following steps:
the conference call server responds to the conference starting request to establish a conference room; the conference starting request is triggered by the conference management server according to the conference starting time in the reservation record; the conference management server is further used for recording audio data in a conference process, converting the audio data into text content and generating a conference record, wherein the conference starting request comprises a first identity identifier of the first terminal and a second identity identifier of the second terminal; the first identity mark and the second identity mark are determined according to the reservation record;
the conference call server initiates a conference start prompt to a first terminal corresponding to the first identity;
if receiving a conference room entering notification sent by the first terminal, the conference call server initiates a conference room entering invitation to a second terminal corresponding to the second identity;
the second terminal also displays a reservation record, the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the conference room includes a signaling room and a data room, and the conference call server creates a conference room in response to a conference initiation request, including:
the conference call server sends a signaling room creating request to a room management server through a voice call server;
the conference call server receives a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data;
the conference call server sends a data room establishing request to a data management platform through the voice call server;
the conference call server receives a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; and after the creation of the conference room is completed, the conference call server sends a third response message to the first terminal through the voice call server, and the conference room creation is indicated to be successful.
7. The method of claim 6, further comprising:
in a conference process, the conference call server receives a control instruction sent by a conference control server, wherein the control instruction comprises a third identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state;
and the conference call server forwards the control instruction to the target terminal according to the third identity.
8. An audio and video conference realization device is characterized by comprising a creation unit, a first sending unit and a second sending unit:
the creating unit is used for responding to a conference starting request to create a conference room; the conference starting request is triggered by the conference management server according to the conference starting time in the reservation record; the conference management server is further used for recording audio data in a conference process, converting the audio data into text content and generating a conference record, wherein the conference starting request comprises a first identity identifier of the first terminal and a second identity identifier of the second terminal; the first identity mark and the second identity mark are determined according to the reservation record;
the first sending unit is used for initiating a conference start reminding to a first terminal corresponding to the first identity;
the second sending unit is configured to initiate a conference room entry invitation to a second terminal corresponding to the second identity if the conference room entry notification of the first terminal is received; the second terminal also displays a reservation record, the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the conference room includes a signaling room and a data room, and the creating unit is specifically configured to:
sending a signaling room creating request to a room management server through a voice call server;
receiving a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data;
sending a data room creating request to a data management platform through the voice call server;
receiving a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; and after the conference room is created, sending a third response message to the first terminal through the voice call server to indicate that the conference room is successfully created.
9. An audio and video conference implementation method is characterized by comprising the following steps:
a conference control server receives a control instruction sent by a first terminal where a conference initiator is located; the control instruction comprises an identity of a target terminal, the control instruction is used for indicating the target terminal to change a call state, and the target terminal is any one or more of second terminals where controlled participating members are located;
the conference control server sends the control instruction to a target terminal corresponding to the identity;
displaying a reservation record on the second terminal, wherein the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the first terminal and the second terminal establish audio and video conference connection through a conference management server and a conference call server, and the conference call server is used for responding to a conference starting request and sending a signaling room establishing request to a room management server through a voice call server; receiving a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data; sending a data room creating request to a data management platform through the voice call server; receiving a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; after the conference room is created, the conference call server sends a third response message to the first terminal through the voice call server, and the conference room creation is indicated to be successful; the conference management server is also used for recording audio data in the conference process, converting the audio data into text content and generating a conference record.
10. An audio and video conference realization device is characterized by comprising a receiving unit and a sending unit:
the receiving unit is used for receiving a control instruction sent by a first terminal where a conference initiator is located; the control instruction comprises an identity of a target terminal, the control instruction is used for indicating the target terminal to change a call state, and the target terminal is any one or more of second terminals where controlled participating members are located;
the sending unit is used for sending the control instruction to the target terminal corresponding to the identity through the conference call server;
displaying a reservation record on the second terminal, wherein the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the first terminal and the second terminal establish audio and video conference connection through a conference management server and a conference call server, and the conference call server is used for responding to a conference starting request and sending a signaling room establishing request to a room management server through a voice call server; receiving a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data; sending a data room creating request to a data management platform through the voice call server; receiving a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; after the conference room is created, the conference call server sends a third response message to the first terminal through the voice call server, and the conference room creation is indicated to be successful; the conference management server is also used for recording audio data in the conference process, converting the audio data into text content and generating a conference record.
11. The audio and video conference realization system is characterized by comprising a conference management server, a conference call server and a conference control server:
the conference management server is used for acquiring a conference reservation request sent by the first terminal; the conference reservation request comprises conference information, and the conference information comprises conference starting time, a first identity mark of the first terminal and a second identity mark of the second terminal; generating a reservation record according to the conference reservation request; the reservation record comprises the conference information; when the meeting starting time is determined to be reached according to the reservation record, a meeting starting request is sent to a meeting call server; the conference initiation request comprises the first identity and the second identity; the first identity mark and the second identity mark are determined according to the reservation record; the conference management server is also used for recording audio data in a conference process, converting the audio data into text content and generating a conference record;
the conference management server sends the reservation record to the second terminal; the reservation record displayed on the second terminal can show the current conference state, and if the conference is in progress, the reservation record displayed on the second terminal provides an entrance for a user to enter the conference;
the conference call server is used for responding to a conference starting request to establish a conference room; initiating a conference start prompt to a first terminal corresponding to the first identity mark; if receiving the notification of meeting room entrance of the first terminal, initiating a meeting room entrance invitation to a second terminal corresponding to the second identity; the conference room includes a signaling room and a data room, and the conference call server creates a conference room in response to a conference initiation request, including: the conference call server sends a signaling room creating request to a room management server through a voice call server; the conference call server receives a first response message sent by the room management server through the voice call server; the first response message characterizes that the signaling room is successfully created, and comprises a room identifier and a key used for transmitting voice data; the conference call server sends a data room establishing request to a data management platform through the voice call server; the conference call server receives a second response message sent by the data management platform through the voice call server; the second response message represents that the data room is successfully created; after the conference room is created, the conference call server sends a third response message to the first terminal through the voice call server, and the conference room creation is indicated to be successful;
the conference control server is used for receiving a control instruction; the control instruction comprises an identity of a target terminal, and the control instruction is used for indicating the target terminal to change a call state; and sending the control instruction to a target terminal corresponding to the identity through a conference call server.
12. An apparatus, comprising a processor and a memory:
the memory is used for storing program codes and transmitting the program codes to the processor;
the processor is configured to perform the method of any of claims 1-4 or 6-7 or 9 according to instructions in the program code.
13. A computer-readable storage medium, characterized in that the computer-readable storage medium is configured to store a program code for performing the method of any of claims 1-4 or 6-7 or 9.
CN202010212284.6A 2020-03-24 2020-03-24 Method, system and related device for realizing audio and video conference Active CN111212259B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010212284.6A CN111212259B (en) 2020-03-24 2020-03-24 Method, system and related device for realizing audio and video conference

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010212284.6A CN111212259B (en) 2020-03-24 2020-03-24 Method, system and related device for realizing audio and video conference

Publications (2)

Publication Number Publication Date
CN111212259A CN111212259A (en) 2020-05-29
CN111212259B true CN111212259B (en) 2021-09-28

Family

ID=70787069

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010212284.6A Active CN111212259B (en) 2020-03-24 2020-03-24 Method, system and related device for realizing audio and video conference

Country Status (1)

Country Link
CN (1) CN111212259B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111711784B (en) * 2020-06-15 2022-10-18 北京字节跳动网络技术有限公司 Conference control method and device, readable medium and electronic equipment
CN111711830B (en) * 2020-06-19 2022-08-05 广州市百果园信息技术有限公司 Live broadcast bit supplementing method and device, server and storage medium
CN111757042A (en) * 2020-06-28 2020-10-09 深圳市闪联信息技术有限公司 Remote collaborative conference method and system based on face authentication
CN114285681B (en) * 2020-07-06 2023-06-06 腾讯科技(深圳)有限公司 Conference initiating method, conference responding method, device and storage medium
CN111798013B (en) * 2020-08-07 2023-01-24 中国工商银行股份有限公司 Conference reservation processing method and device
CN114079651A (en) * 2020-08-19 2022-02-22 阿里巴巴集团控股有限公司 Conference processing method and device
CN114567747A (en) * 2020-11-27 2022-05-31 北京新媒传信科技有限公司 Conference data transmission method and conference system
CN113542659B (en) * 2021-05-24 2023-03-10 华为技术有限公司 Conference creation method, conference control method and electronic device
CN113487076A (en) * 2021-06-30 2021-10-08 武汉空心科技有限公司 Project task fund prediction system based on room management
CN114095689B (en) * 2021-11-11 2023-06-16 华能招标有限公司 Method and device for joining remote comment video conference
CN114760271B (en) * 2022-06-14 2022-09-20 深圳乐播科技有限公司 File processing method and device

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100420194C (en) * 2006-10-08 2008-09-17 华为技术有限公司 Video conference system and its data transmission method and device
CN101257395B (en) * 2007-02-27 2010-12-08 中国移动通信集团公司 System and method for supporting multimedia conference booking
CN101150705A (en) * 2007-10-19 2008-03-26 中兴通讯股份有限公司 A method for realizing multi-point conference video control in initial session protocol
CN108494571B (en) * 2013-05-10 2021-01-05 华为技术有限公司 Method, device and system for initiating reservation conference
CN104284033A (en) * 2013-07-04 2015-01-14 深圳市潮流网络技术有限公司 Multi-party teleconference reservation method and communication devices
CN104683121B (en) * 2013-11-29 2018-06-05 华为技术有限公司 A kind of method and device of initiating network conference
CN105262750B (en) * 2015-10-21 2020-01-10 腾讯科技(深圳)有限公司 Method and equipment for automatically initiating session
CN107423947B (en) * 2016-05-31 2021-05-28 昆山龙腾光电股份有限公司 Conference management method and system
CN108111322B (en) * 2016-11-24 2021-07-16 北京中创视讯科技有限公司 Network conference control method and device
CN108632048B (en) * 2017-03-22 2020-12-22 展讯通信(上海)有限公司 Conference call control method and device and multi-pass terminal
JP2019102953A (en) * 2017-11-30 2019-06-24 キヤノンマーケティングジャパン株式会社 Web conference system, control method for the same, and program

Also Published As

Publication number Publication date
CN111212259A (en) 2020-05-29

Similar Documents

Publication Publication Date Title
CN111212259B (en) Method, system and related device for realizing audio and video conference
US11025686B2 (en) Network call method and apparatus, terminal, and server
US8701020B1 (en) Text chat overlay for video chat
CN105162693B (en) message display method and device
CN106533711B (en) Multimedia conference method and device
CN106803993B (en) Method and device for realizing video branch selection playing
CN107037949A (en) A kind of multi-screen display method and device
CN106973330B (en) Screen live broadcasting method, device and system
CN106059894B (en) Message processing method and device
CN105471704B (en) A kind of method, apparatus and system for realizing more people's calls
US10673790B2 (en) Method and terminal for displaying instant messaging message
CN105550860A (en) Payment method and device
US9363300B2 (en) Systems and methods for voice communication
JP2015507313A (en) Multi-user interface mirror interface navigation
KR20100091045A (en) Mobile terminal and multisession managing method thereof
CN108646961B (en) Management method and device for tasks to be handled and storage medium
WO2015085951A1 (en) Terminal, server, system and method for inviting friend to watch video
CN107483320B (en) Group creation method and server
US20140024362A1 (en) Method and apparatus for initiating a call in an electronic device
WO2016045277A1 (en) Method, device, and system for information acquisition
CN107888965A (en) Image present methods of exhibiting and device, terminal, system, storage medium
CN105515948A (en) Instant messaging method and device
CN114285681B (en) Conference initiating method, conference responding method, device and storage medium
CN104168178A (en) Method, apparatus and system of real-time communication during television broadcasting
CN110618806B (en) Application program control method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant