CN111641800A - Method and device for realizing conference - Google Patents

Method and device for realizing conference Download PDF

Info

Publication number
CN111641800A
CN111641800A CN202010312487.2A CN202010312487A CN111641800A CN 111641800 A CN111641800 A CN 111641800A CN 202010312487 A CN202010312487 A CN 202010312487A CN 111641800 A CN111641800 A CN 111641800A
Authority
CN
China
Prior art keywords
conference
mobile terminal
video data
audio
speaking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010312487.2A
Other languages
Chinese (zh)
Inventor
杨乌拉
叶斌
沈世国
王艳辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Visionvera Information Technology Co Ltd
Original Assignee
Visionvera Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Visionvera Information Technology Co Ltd filed Critical Visionvera Information Technology Co Ltd
Priority to CN202010312487.2A priority Critical patent/CN111641800A/en
Publication of CN111641800A publication Critical patent/CN111641800A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/64322IP

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the invention provides a method and a device for realizing a conference, wherein the method comprises the following steps: sending a conference room establishing request to a streaming media server; the meeting room establishing request carries a meeting room identifier; entering a meeting room corresponding to the meeting room identification; wherein the conference room is created by the streaming server in response to the conference room creation request; establishing connection with a WEB server; and sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking. Therefore, the conference control can be carried out on the first mobile terminal through the WEB server, and the conference control does not need to be carried out through the conference management platform, so that the operation process of the conference control is facilitated.

Description

Method and device for realizing conference
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a method and an apparatus for implementing a conference.
Background
The video networking is an important milestone for network development, is a higher-level form of the Internet, is a real-time network, can realize the real-time transmission of full-network high-definition videos which cannot be realized by the existing Internet, and pushes a plurality of Internet applications to high-definition video, and high definition faces each other. Finally, world no-distance is realized, and the distance between people in the world is only the distance of one screen.
At present, an audio and video service is integrated at a video network mobile terminal, and functions such as video call can be performed, and a client sometimes needs to perform a conference by multiple people.
Disclosure of Invention
In view of the above, embodiments of the present invention are proposed to provide a method of conference implementation, and a corresponding apparatus of conference implementation, which overcome or at least partially solve the above problems.
The embodiment of the invention discloses a method for realizing a conference, which is applied to a first mobile terminal and comprises the following steps:
sending a conference room establishing request to a streaming media server; the meeting room establishing request carries a meeting room identifier;
entering a meeting room corresponding to the meeting room identification; wherein the conference room is created by the streaming server in response to the conference room creation request;
establishing connection with a WEB server;
and sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking.
Optionally, the conference control instruction includes a speak allowing instruction and a speak ending instruction;
the sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server to control the at least one second mobile terminal to speak or finish speaking includes:
acquiring a speaking request sent by at least one second mobile terminal in the conference room through the WEB server; generating a speaking permission instruction according to the speaking request; sending the speaking permission instruction to the at least one second mobile terminal so as to control the at least one second mobile terminal to speak;
or the like, or, alternatively,
and sending the speaking ending instruction to the at least one second mobile terminal through the WEB server so as to control the at least one second mobile terminal to end speaking.
Optionally, the method further comprises:
sending a conference starting request to the streaming media server;
receiving a permission instruction returned by the streaming media server in response to the conference starting request;
pushing audio and video data to the streaming media server according to the permission instruction;
and sending the channel identification of the audio and video data to the conference room.
Optionally, the method further comprises:
acquiring a channel identifier of at least one audio/video data in the conference room;
transmitting a watching request to the streaming media server according to the channel identifier of the at least one piece of audio and video data;
receiving multi-channel audio and video data pushed by the streaming media server in response to the watching request;
and displaying a plurality of conference pictures according to the multi-path audio and video data.
Optionally, the displaying a plurality of conference pictures according to the multi-channel audio/video data includes:
carrying out shunting processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data;
determining a plurality of target display areas corresponding to the plurality of target audio and video data;
and displaying a plurality of conference pictures on the plurality of target display areas.
The embodiment of the invention also discloses a method for realizing the conference, which is applied to a second mobile terminal and comprises the following steps:
sending a conference entering request to a streaming media server; the conference joining request carries a conference room identifier;
entering a meeting room corresponding to the meeting room identification;
establishing connection with a WEB server;
and receiving a conference control instruction sent by a first mobile terminal entering the conference room through the WEB server, and speaking or ending speaking according to the conference control instruction.
Optionally, the conference control instruction includes a speak allowing instruction and a speak ending instruction;
the receiving, by the WEB server, a conference control instruction sent by a first mobile terminal entering the conference room, and speaking or ending the speech according to the conference control instruction, includes:
sending a speaking request to a first mobile terminal entering the conference room through the WEB server; receiving an speaking permission instruction returned by the first mobile terminal in response to the speaking request; speaking according to the speaking permission instruction;
or the like, or, alternatively,
receiving a speech ending instruction sent by a first mobile terminal entering the conference room through the WEB server; and ending the speaking according to the speaking ending instruction.
Optionally, the speaking according to the speaking permission instruction includes:
audio and video data are pushed to the streaming media server according to the speaking permission instruction;
and sending the channel identification of the audio and video data to the conference room.
Optionally, the method further comprises:
acquiring a channel identifier of at least one audio/video data in the conference room;
transmitting a watching request to the streaming media server according to the channel identifier of the at least one piece of audio and video data;
receiving multi-channel audio and video data pushed by the streaming media server in response to the watching request;
and displaying a plurality of conference pictures according to the multi-path audio and video data.
Optionally, the displaying a plurality of conference pictures according to the multi-channel audio/video data includes:
carrying out shunting processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data;
determining a plurality of target display areas corresponding to the plurality of target audio and video data;
and displaying a plurality of conference pictures on the plurality of target display areas.
The embodiment of the invention also discloses a device for realizing the conference, which is applied to the first mobile terminal and comprises:
a conference room creation request sending module, configured to send a conference room creation request to the streaming media server; the meeting room establishing request carries a meeting room identifier;
the conference room entering module is used for entering a conference room corresponding to the conference room identifier; wherein the conference room is created by the streaming server in response to the conference room creation request;
the connection module is used for establishing connection with a WEB server;
and the conference control instruction sending module is used for sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking.
The embodiment of the invention also discloses a device for realizing the conference, which is applied to a second mobile terminal and comprises:
the conference joining request sending module is used for sending a conference joining request to the streaming media server; the conference joining request carries a conference room identifier;
the conference room entering module is used for entering a conference room corresponding to the conference room identifier;
the connection module is used for establishing connection with a WEB server;
and the conference control instruction receiving module is used for receiving a conference control instruction sent by a first mobile terminal entering the conference room through the WEB server and making a speech or ending the speech according to the conference control instruction.
The embodiment of the invention also discloses an electronic device, which comprises:
one or more processors; and
one or more machine-readable media having instructions stored thereon, which when executed by the one or more processors, cause the electronic device to perform the steps of a method according to any one of the embodiments of the invention.
The embodiment of the invention also discloses a computer readable storage medium, which stores a computer program for enabling a processor to execute the steps of the method according to any one of the embodiments of the invention.
The embodiment of the invention has the following advantages:
in the embodiment of the invention, the first mobile terminal sends a conference room establishing request to the streaming media server, enters the conference room corresponding to the conference room identifier in the conference room establishing request, and sends a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking, so that the mobile terminal can control the conference through the WEB server without controlling the conference through the conference management platform, and the operation process of conference control is facilitated.
In addition, in the process of the conference, the second mobile terminal can issue audio and video data according to the conference control instruction, and the functions of watching the conference by multiple persons and speaking by the participation of the multiple persons are realized. The method is not limited to the situation that only the video networking terminal can release the audio and video data, does not need to rely on terminal equipment, has low requirements on the network environment, and can carry out services by common WiFi and 4G.
Drawings
FIG. 1 is a flow chart of the steps of a method embodiment of a conference implementation of the present invention;
FIG. 2 is a schematic diagram of a system for conference implementation of the present invention;
FIG. 3 is a flow chart of the steps of a method embodiment of another conferencing implementation of the present invention;
FIG. 4 is a block diagram of an apparatus embodiment of a conference implementation of the present invention;
FIG. 5 is a block diagram of an apparatus embodiment of another conferencing implementation of the present invention;
FIG. 6 is a networking schematic of a video network of the present invention;
FIG. 7 is a diagram of a hardware architecture of a node server according to the present invention;
fig. 8 is a schematic diagram of a hardware architecture of an access switch of the present invention;
fig. 9 is a schematic diagram of a hardware structure of an ethernet protocol conversion gateway according to the present invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
Referring to fig. 1, a flowchart of steps of an embodiment of a method for implementing a conference is shown, where the method is applied to a first mobile terminal, and specifically includes the following steps:
step 101, sending a conference room establishing request to a streaming media server; the meeting room establishing request carries a meeting room identifier;
the first mobile terminal may be a host side for initiating a conference, and in the conference service, one first mobile terminal may initiate the conference, and one or more second mobile terminals may join the conference for participating.
In order to implement a multi-party conference of a mobile terminal, in the embodiment of the present invention, a first mobile terminal may control a participant to speak, so that a second mobile terminal as the participant may also upload audio and video data.
Specifically, when a conference needs to be initiated, the first mobile terminal logs in the streaming media server first and establishes a connection with the streaming media server, for example, the first mobile terminal establishes a socket connection with the streaming media server. The first mobile terminal may detect, in real time, an operation of the user on the conference initiation control, generate a conference room creation request, and send the conference room creation request to the streaming media server, for example, send the conference room creation request to the streaming media server through a socket. The conference room creating request may carry a conference room identifier, and is used to request the streaming media server to create a conference room corresponding to the conference room identifier. The conference room identification may be a unique identification code for the conference room, for example, the conference room identification is 006823.
Step 102, entering a meeting room corresponding to the meeting room identification; wherein the conference room is created by the streaming server in response to the conference room creation request;
in the embodiment of the present invention, the first mobile terminal may enter a conference room corresponding to the conference room identifier, and the conference room is created by the streaming server in response to the conference room creation request.
After the first mobile terminal enters the conference room, the streaming media server may set the first mobile terminal as a host, and grant the first mobile terminal with a right to control the conference process, for example, control the participant to speak, end the speaking of the participant, and forbid speaking of the participant.
103, establishing connection with a WEB server;
the WEB server may be a server in the conference system for transmitting interactive data between the mobile terminals participating in the conference.
In the embodiment of the present invention, the first mobile terminal may establish a connection with the WEB server.
Specifically, the first mobile terminal may log in to the WEB server, and send a login request to the WEB server, and the WEB server establishes a connection with the first mobile terminal after receiving the login request sent by the first mobile terminal, for example, the WEB server may allocate a websocket interface to the first mobile terminal, and establish a connection between the websocket interface and the first mobile terminal, thereby implementing establishment of a websocket connection between the first mobile terminal and the WEB server. The websocket is an application layer protocol and is a full-duplex communication protocol newly added in HTML5, and after the client and the server are successfully connected based on TCP handshake, persistent connection can be established between the client and the server, so that bidirectional data transmission is realized.
And 104, sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server to control the at least one second mobile terminal to speak or finish speaking.
The second mobile terminal may be a mobile terminal that participates in the conference as a participant, and the second mobile terminal as a participant has no authority to control the conference process. The second mobile terminal may request the first mobile terminal to speak in a manner of raising a hand.
In this embodiment of the present invention, the first mobile terminal may send a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server, so as to control the at least one second mobile terminal to speak or end speaking. The conference control command may be an instruction or a command for controlling a mobile terminal entering a conference room to perform a corresponding conference process, and the conference control command may include a talk-banning command, a talk-allowing command, a talk-ending command, and the like. Specifically, the first mobile terminal may send a conference control instruction to at least one second mobile terminal entering the conference room through a websocket connection connected to the WEB server.
In a specific implementation, the first mobile terminal may pull up at least one second mobile terminal in the conference room for speaking through the conference control instruction. The second mobile terminal may also send a speech request to the first mobile terminal through the elevator, and the first mobile terminal determines whether to allow speech according to circumstances and generates a corresponding conference control instruction. After the speech instruction is allowed to be transmitted to a certain second mobile terminal, the second mobile terminal can push audio and video data to the streaming media server by using socket, after the audio and video data are successfully pushed, the second mobile terminal can also send speech messages to other second mobile terminals entering a conference room by using websocket, and after the speech messages are received by the other second mobile terminals, the other second mobile terminals can request to receive the audio and video data of the second mobile terminal to the streaming media server, so that a new conference picture is started.
Multiple conference pictures can be supported on the mobile terminal, for example, four conference pictures can be supported. When a second mobile terminal wants to speak, the first mobile terminal can hold one's hands, and the first mobile terminal can adjust according to the situation, end the speaking of a certain second mobile terminal, and then permit a new second mobile terminal to speak. The second mobile terminal can also automatically finish speaking so as to finish pushing the audio and video data.
In the process of the conference, the second mobile terminal can speak according to the conference control instruction, and the functions of watching the conference by multiple persons and speaking by participation of the multiple persons are achieved. The method is not limited to the situation that only the video networking terminal can release the audio and video data, does not need to rely on terminal equipment, has low requirements on the network environment, and can carry out services by common WiFi and 4G.
In the conference process, if the second mobile terminal in the conference room does not speak, the second mobile terminal can be used as a conference picture of the viewer watching the multi-channel stream, and the second mobile terminal can request one path of audio and video stream from the streaming media server every time the second mobile terminal receives the speaking message of other mobile terminals. The streaming media server can mix and push the multi-channel video and audio streams requested by the mobile terminal to the first mobile terminal and/or the second mobile terminal, the first mobile terminal and/or the second mobile terminal split the channel identifications of different audio and video data, the pictures are distributed to corresponding video windows for display, the audio is directly played, and the effect of watching multi-channel conference video is achieved. If the mobile terminal is a speaker, one of the pictures is a released local picture, the local picture can be pushed to the streaming media server, and the other three pictures can request audio and video data of other mobile terminals speaking according to the speaking information to be watched.
In a preferred embodiment of the invention, the conference control instruction comprises a speak allow instruction and a speak end instruction; the step 104 may comprise the sub-steps of:
acquiring a speaking request sent by at least one second mobile terminal in the conference room through the WEB server; generating a speaking permission instruction according to the speaking request; sending the speaking permission instruction to the at least one second mobile terminal so as to control the at least one second mobile terminal to speak; or, the speaking ending instruction is sent to the at least one second mobile terminal through the WEB server, so as to control the at least one second mobile terminal to end speaking.
The speaking permission instruction is used for controlling the second mobile terminal to speak and pushing audio and video data to the streaming media server; and the speech ending instruction is used for controlling the second mobile terminal to end speech and end the pushing of the audio and video data to the streaming media server. In addition, the conference control instruction may also include other instructions, such as a talk-bant instruction (the talk-bant second mobile terminal will not be able to speak), and the like, which is not limited by the embodiment of the present invention.
The talk request may be a request message for requesting to talk. Specifically, a graphical user interface provided by the second mobile terminal may include a hand-holding control, and the user may request to speak by clicking the hand-holding control, and when the second mobile terminal detects that the hand-holding control is triggered, the second mobile terminal may generate a speaking request and send the speaking request to the WEB server.
In this embodiment of the present invention, the first mobile terminal may obtain, through the WEB server, the utterance request sent by at least one second mobile terminal in the conference room. After acquiring the speaking request, the first mobile terminal may generate a speaking permission instruction and send the speaking permission instruction to the at least one second mobile terminal, so as to control the at least one second mobile terminal to speak.
As an example, a confirmation speech control is included in a graphical user interface provided by the first mobile terminal, the host may click the confirmation speech control to allow the speech when receiving the speech request, and the first mobile terminal may generate the speech allowing instruction after detecting a click operation for the confirmation speech control. In addition, a speech rejection control may be further included in the graphical user interface provided by the first mobile terminal, the host may click the speech rejection control to reject the speech when receiving the speech request, and the first mobile terminal does not perform a subsequent process of generating the speech instruction after detecting a click operation for the speech rejection control.
In a specific implementation, a speech ending control may be further included in the graphical user interface provided by the first mobile terminal, and when a host speaks, the host may click the speech ending control to end the user's speech. After detecting the click operation on the talk ending control, the first mobile terminal may generate a talk ending instruction, and send the talk ending instruction to the at least one second mobile terminal through the WEB server, so as to control the at least one second mobile terminal to end the talk.
In the embodiment of the present invention, the conference control command (e.g., the speak allowing command and the speak ending command) may be sent by the first mobile terminal to the second mobile terminal through a websocekt connection established with the WEB server. Therefore, the conference control instruction sent by the first mobile terminal is transmitted on the mobile terminal through the WEB server, so that the conference control is realized, the conference control does not need to be carried out through the conference management platform, and the operation process of the conference control is facilitated.
In a preferred embodiment of the present invention, the method may further comprise the steps of:
sending a conference starting request to the streaming media server; receiving a permission instruction returned by the streaming media server in response to the conference starting request; pushing audio and video data to the streaming media server according to the permission instruction; and sending the channel identification of the audio and video data to the conference room.
The open conference request is used to request initiation of a conference from the streaming server. In a specific implementation, a conference opening control may be included in a graphical user interface provided by the first mobile terminal, when a user needs to initiate a conference, the conference opening control on the graphical user interface may be clicked, and after detecting a click operation of the user on the conference opening control, the first mobile terminal may generate a conference opening request and send the conference opening request to the streaming media server.
After receiving the conference opening request sent by the first mobile terminal, the streaming media server may generate a permission instruction in response to the conference opening request, where the permission instruction may be an instruction or a command that allows the first mobile terminal to open a conference. The first mobile terminal can receive a permission instruction returned by the streaming media server and start to push audio and video data to the streaming media server according to the permission instruction. Therefore, the streaming media server can push the audio and video data to at least one second mobile terminal in the conference room.
In an embodiment of the present invention, the audio/video data may have a corresponding channel identifier, and the channel identifier is used to uniquely identify the data stream. After the first mobile terminal pushes the audio and video data to the streaming media server, the first mobile terminal can send the channel identifier of the audio and video data to the conference room, so that a participant in the conference room can request the streaming media server for the corresponding audio and video data according to the channel identifier.
In a preferred embodiment of the present invention, the method further comprises:
acquiring a channel identifier of at least one audio/video data in the conference room; transmitting a watching request to the streaming media server according to the channel identifier of the at least one piece of audio and video data; receiving multi-channel audio and video data pushed by the streaming media server in response to the watching request; and displaying a plurality of conference pictures according to the multi-path audio and video data.
The first mobile terminal can acquire a channel identifier of at least one piece of audio and video data in the conference room, wherein the channel identifier of the audio and video data in the conference room can be sent to the conference room by the second mobile terminal serving as a participant when speaking.
The first mobile terminal may send a viewing request to the streaming media server according to the obtained at least one channel identifier of the audio/video data, and specifically, may send the viewing request to the streaming media server by using a socket, where the viewing request carries the at least one channel identifier of the audio/video data.
After receiving the watching request, the streaming media server can extract a channel identifier of at least one piece of audio/video data in the watching request, then corresponding audio/video data are obtained according to the channel identifiers respectively, if the channel identifiers are multiple, the obtained audio/video data are multiple, the streaming media server mixes the obtained multiple pieces of audio/video data to generate multiple paths of audio/video data, and the multiple paths of audio/video data are pushed to the first mobile terminal; if only one path of audio and video data is available, the audio and video data does not need to be mixed, and the audio and video data can be directly pushed to the first mobile terminal. The first mobile terminal can receive the multi-channel audio and video data pushed by the streaming media server and display a plurality of conference pictures according to the multi-channel audio and video data.
In a preferred embodiment of the present invention, the presenting a plurality of conference pictures according to the multi-channel audio/video data may include the following sub-steps:
carrying out shunting processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data; determining a plurality of target display areas corresponding to the plurality of target audio and video data; and displaying a plurality of conference pictures on the plurality of target display areas.
In the embodiment of the invention, the multi-channel audio and video data can have the channel identifications corresponding to the audio and video data one by one. The first mobile terminal can perform shunting processing according to the channel identification of the multi-channel audio and video data to obtain a plurality of target audio and video data, further determine a plurality of target display areas corresponding to the plurality of target audio and video data according to the channel identification, and display a plurality of conference pictures on the plurality of target display areas.
In a preferred embodiment of the present invention, the method further comprises:
and sending or receiving chat information through a WEB server so as to carry out chat interaction with a second mobile terminal entering the conference room.
In the embodiment of the present invention, the first mobile terminal may further send or receive chat information through a WEB server, for example, by using websocket connection, so as to perform chat interaction with the second mobile terminal entering the conference room.
The chat information can comprise interactive text information, sign-in information, information for kicking out a meeting room, information for exiting the meeting room and the like.
The interactive text information may be chat text data. In the conference room, the user can chat in text while watching the video of the conference, during which the user can interact with other users. Meanwhile, in a conference room, a user can see a current online user list returned by the websocekt, and can select a corresponding user to perform all text chats or select one of the users to perform independent chats through the current online user list.
The check-in information may be statistical information for checking-in, such as the number of people online. The host can issue the check-in information, and the watching users can respond to the check-in information to participate in the check-in after receiving the check-in information, so that the host can watch the number of people in the conference at present.
The information of kicking out the conference room can be information of kicking out the conference room by audiences, and is used for controlling users in the conference room by a host, the users receive corresponding websocekt information when being kicked, and then the corresponding socekt instruction is called to finish the corresponding conference watching service. The exit conference room information may be information that the viewer exits the conference room.
Fig. 2 is a schematic diagram of a system for implementing a conference according to an embodiment of the present invention, where the system for implementing a conference includes a streaming media server, a WEB server, a first mobile peer and two second mobile peers, where the first mobile peer serves as a host in the conference and the two second mobile peers serve as participants. The first mobile terminal can send a conference control instruction to the second mobile terminal through the WEB server to control the second mobile terminal entering a conference room, so that conference control operation is facilitated.
In the embodiment of the invention, the first mobile terminal sends a conference room establishing request to the streaming media server, enters the conference room corresponding to the conference room identifier in the conference room establishing request, and sends a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking, so that the mobile terminal can control the conference through the WEB server without controlling the conference through the conference management platform, and the operation process of conference control is facilitated.
In addition, in the process of the conference, the second mobile terminal can issue audio and video data according to the conference control instruction, and the functions of watching the conference by multiple persons and speaking by the participation of the multiple persons are realized. The method is not limited to the situation that only the video networking terminal can release the audio and video data, does not need to rely on terminal equipment, has low requirements on the network environment, and can carry out services by common WiFi and 4G.
Referring to fig. 3, a flowchart of steps of an embodiment of a method for implementing a conference is shown, where the method is applied to a second mobile terminal, and specifically includes the following steps:
step 301, sending a conference joining request to a streaming media server; the conference joining request carries a conference room identifier;
the second mobile terminal may be a participant for participating in the conference, and in the conference service, one or more second mobile terminals may join in the conference for participating in the conference, and the second mobile terminal may also have a first mobile terminal, and the first mobile terminal may be a host for initiating the conference.
When a user needs to participate in a conference, a conference joining request may be sent to the streaming media server through the second mobile terminal, where the conference joining request may carry a conference room identifier, and the conference room identifier may be a unique identifier of a conference room, for example, the conference room identifier is 006823.
In a specific implementation, the second mobile terminal may establish a connection with the streaming media server before sending the conference entry request. If the second mobile terminal can send a login request to the streaming media server, after receiving the login request, the streaming media server can allocate a socket interface to the second mobile terminal and establish the connection between the socket interface and the second mobile terminal, so as to realize the connection between the second mobile terminal and the streaming media server. The second mobile terminal may send a conference entry request to the streaming media server using the socket.
Step 302, entering a conference room corresponding to the conference room identifier;
in the embodiment of the present invention, the second mobile terminal may enter a conference room corresponding to the conference room identifier, and the conference room may be created by the streaming server in response to the conference room creation request sent by the first mobile terminal. After the second mobile terminal enters the conference room, the streaming media server can set the second mobile terminal as a participant, and the mobile terminal of the participant does not have the authority to control the conference process.
Step 303, establishing connection with a WEB server;
the WEB server may be a server in the conference system for transmitting interactive data between the mobile terminals participating in the conference. In the embodiment of the present invention, the second mobile terminal may establish a connection with the WEB server.
Specifically, the second mobile terminal may log in to the WEB server, and by sending a login request to the WEB server, the WEB server establishes a connection with the first mobile terminal after receiving the login request sent by the second mobile terminal, for example, the WEB server may allocate a websocket interface to the second mobile terminal, and establish a connection between the websocket interface and the second mobile terminal, thereby establishing a websocket connection between the second mobile terminal and the WEB server. The websocket is an application layer protocol and is a full-duplex communication protocol newly added in HTML5, and after the client and the server are successfully connected based on TCP handshake, persistent connection can be established between the client and the server, so that bidirectional data transmission is realized.
And 304, receiving a conference control instruction sent by a first mobile terminal entering the conference room through the WEB server, and speaking or ending speaking according to the conference control instruction.
In the embodiment of the present invention, the second mobile terminal may receive, through the WEB server, the conference control instruction sent by the first mobile terminal entering the conference room, and speak or end speaking according to the conference control instruction. The conference control instruction may be an instruction or a command for controlling a mobile terminal entering a conference room to perform a corresponding conference process, and the conference control instruction may include a speech prohibition instruction, a speech permission instruction, a speech termination instruction, and the like.
In a preferred embodiment of the invention, the conference control instruction comprises a speak allow instruction and a speak end instruction; the step 304 may include the following sub-steps:
sending a speaking request to a first mobile terminal entering the conference room through the WEB server; receiving an speaking permission instruction returned by the first mobile terminal in response to the speaking request; speaking according to the speaking permission instruction; or, receiving a speech ending instruction sent by a first mobile terminal entering the conference room through the WEB server; and ending the speaking according to the speaking ending instruction.
Specifically, on a graphical user interface provided by the second mobile terminal, a hand-holding control may be included, the user may request to speak by clicking the hand-holding control, when the second mobile terminal detects that the hand-holding control is triggered, a speaking request may be generated and sent to the WEB server, and then the WEB server may send the speaking request to the first mobile terminal entering the conference room. The first mobile station may generate an utterance permission instruction in response to the utterance request after receiving the utterance request, and return the generated utterance permission instruction to the second mobile station. The second mobile station can make a comment according to the permitted comment instruction after receiving the permitted comment instruction.
The second mobile terminal can also receive a speech ending instruction sent by the first mobile terminal entering the conference room through the WEB server, and finish the speech according to the speech ending instruction. Ending the utterance may be ending the pushing of the audio-video data to the streaming media server.
In a preferred embodiment of the present invention, the speaking according to the speaking permission instruction includes:
audio and video data are pushed to the streaming media server according to the speaking permission instruction; and sending the channel identification of the audio and video data to the conference room.
In the embodiment of the invention, the second mobile terminal can push the audio and video data to the streaming media server according to the speaking permission instruction, and send the channel identifier of the audio and video data to the conference room.
Specifically, the second mobile terminal may push audio and video data to the streaming media server through a socket connection established with the streaming media server according to the instruction allowing the speech, and send a channel identifier of the audio and video data to the conference room through a websocket connection established with the WEB server.
In a preferred embodiment of the present invention, the method further comprises:
acquiring a channel identifier of at least one audio/video data in the conference room; transmitting a watching request to the streaming media server according to the channel identifier of the at least one piece of audio and video data; receiving multi-channel audio and video data pushed by the streaming media server in response to the watching request; and displaying a plurality of conference pictures according to the multi-path audio and video data.
The second mobile terminal can acquire a channel identifier of at least one piece of audio and video data in the conference room, wherein the channel identifier of the audio and video data in the conference room can be sent to the conference room by the second mobile terminal serving as a participant when speaking, and can also be sent to the conference room by the first mobile terminal serving as a host.
The second mobile terminal may send a viewing request to the streaming media server according to the obtained at least one channel identifier of the audio/video data, and specifically, may send a viewing request to the streaming media server by using a socket, where the viewing request carries the at least one channel identifier of the audio/video data.
After receiving the watching request, the streaming media server can extract a channel identifier of at least one piece of audio/video data in the watching request, then corresponding audio/video data are obtained according to the channel identifiers respectively, if the channel identifiers are multiple, the obtained audio/video data are multiple, the streaming media server mixes the obtained multiple pieces of audio/video data to generate multiple paths of audio/video data, and the multiple paths of audio/video data are pushed to the second mobile terminal; if only one path of audio and video data is available, the audio and video data does not need to be mixed, and the audio and video data can be directly pushed to the second mobile terminal. The second mobile terminal can receive the multi-channel audio and video data pushed by the streaming media server and display a plurality of conference pictures according to the multi-channel audio and video data.
In a preferred embodiment of the present invention, the displaying a plurality of conference pictures according to the multi-channel audio/video data includes:
carrying out shunting processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data; determining a plurality of target display areas corresponding to the plurality of target audio and video data; and displaying a plurality of conference pictures on the plurality of target display areas.
In the embodiment of the invention, the multi-channel audio and video data can have the channel identifications corresponding to the audio and video data one by one. The second mobile terminal can perform shunting processing according to the channel identification of the multi-channel audio and video data to obtain a plurality of target audio and video data, further determine a plurality of target display areas corresponding to the plurality of target audio and video data according to the channel identification, and display a plurality of conference pictures on the plurality of target display areas.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 4, a block diagram of a structure of an embodiment of an apparatus for implementing a conference is shown, and is applied to a first mobile terminal, where the apparatus may specifically include the following modules:
a conference room creation request sending module 401, configured to send a conference room creation request to the streaming media server; the meeting room establishing request carries a meeting room identifier;
a conference room entering module 402, configured to enter a conference room corresponding to the conference room identifier; wherein the conference room is created by the streaming server in response to the conference room creation request;
a connection module 403, configured to establish a connection with a WEB server;
a conference control instruction sending module 404, configured to send a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server, so as to control the at least one second mobile terminal to speak or end speaking.
In a preferred embodiment of the invention, the conference control instruction comprises a speak allow instruction and a speak end instruction;
the conference control instruction sending module 404 includes:
the first instruction sending submodule is used for acquiring a speaking request sent by at least one second mobile terminal in the conference room through the WEB server; generating a speaking permission instruction according to the speaking request; sending the speaking permission instruction to the at least one second mobile terminal so as to control the at least one second mobile terminal to speak;
or the like, or, alternatively,
and the second instruction sending submodule is used for sending the speaking ending instruction to the at least one second mobile terminal through the WEB server so as to control the at least one second mobile terminal to end speaking.
In a preferred embodiment of the present invention, the method further comprises:
a conference starting request sending module, configured to send a conference starting request to the streaming media server;
a permission instruction receiving module, configured to receive a permission instruction returned by the streaming media server in response to the conference starting request;
the audio and video data pushing module is used for pushing audio and video data to the streaming media server according to the permission instruction;
and the channel identifier sending module is used for sending the channel identifier of the audio and video data to the conference room.
In a preferred embodiment of the present invention, the apparatus further comprises:
the channel identifier acquisition module is used for acquiring a channel identifier of at least one piece of audio and video data in the conference room;
a watching request sending module, configured to send a watching request to the streaming media server according to the channel identifier of the at least one piece of audio/video data;
the multi-channel audio and video data receiving module is used for receiving the multi-channel audio and video data pushed by the streaming media server responding to the watching request;
and the conference picture display module is used for displaying a plurality of conference pictures according to the multi-path audio and video data.
In a preferred embodiment of the present invention, the conference screen display module includes:
the distribution submodule is used for carrying out distribution processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data;
the display area determining submodule is used for determining a plurality of target display areas corresponding to the plurality of target audio and video data;
and the conference picture display submodule is used for displaying a plurality of conference pictures on the target display areas.
Referring to fig. 5, a block diagram of a structure of an embodiment of an apparatus for implementing a conference is shown, and is applied to a second mobile terminal, where the apparatus may specifically include the following modules:
a conference joining request sending module 501, configured to send a conference joining request to a streaming media server; the conference joining request carries a conference room identifier;
a conference room entering module 502, configured to enter a conference room corresponding to the conference room identifier;
the connection module 503 is configured to establish a connection with a WEB server;
and a conference control instruction receiving module 504, configured to receive, through the WEB server, a conference control instruction sent by a first mobile terminal entering the conference room, and speak or end speaking according to the conference control instruction.
In a preferred embodiment of the invention, the conference control instruction comprises a speak allow instruction and a speak end instruction; the conference control instruction receiving module 504 includes:
the first instruction receiving module is used for sending a speaking request to a first mobile terminal entering the conference room through the WEB server; receiving an speaking permission instruction returned by the first mobile terminal in response to the speaking request; speaking according to the speaking permission instruction;
or the like, or, alternatively,
the second instruction receiving module is used for receiving a speech ending instruction sent by a first mobile terminal entering the conference room through the WEB server; and ending the speaking according to the speaking ending instruction.
In a preferred embodiment of the present invention, the first instruction receiving module includes:
the audio and video data pushing submodule is used for pushing the audio and video data to the streaming media server according to the speaking permission instruction;
and the channel identifier sending submodule is used for sending the channel identifier of the audio and video data to the conference room.
In a preferred embodiment of the present invention, the method further comprises:
the channel identifier acquisition module is used for acquiring a channel identifier of at least one piece of audio and video data in the conference room;
a watching request sending module, configured to send a watching request to the streaming media server according to the channel identifier of the at least one piece of audio/video data;
the audio and video data receiving module is used for receiving the multi-channel audio and video data pushed by the streaming media server responding to the watching request;
and the conference picture display module is used for displaying a plurality of conference pictures according to the multi-path audio and video data.
In a preferred embodiment of the present invention, the conference screen display module includes:
the distribution submodule is used for carrying out distribution processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data;
the display area determining submodule is used for determining a plurality of target display areas corresponding to the plurality of target audio and video data;
and the conference picture display submodule is used for displaying a plurality of conference pictures on the target display areas.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
An embodiment of the present invention further provides an electronic device, including:
one or more processors; and
one or more machine-readable media having instructions stored thereon, which when executed by the one or more processors, cause the electronic device to perform one or more of the method steps as described in embodiments of the invention.
Embodiments of the present invention also provide a computer-readable storage medium storing a computer program for causing a processor to execute the steps of the method according to the embodiments of the present invention.
The video networking is an important milestone for network development, is a real-time network, can realize high-definition video real-time transmission, and pushes a plurality of internet applications to high-definition video, and high-definition faces each other.
The video networking adopts a real-time high-definition video exchange technology, can integrate required services such as dozens of services of video, voice, pictures, characters, communication, data and the like on a system platform on a network platform, such as high-definition video conference, video monitoring, intelligent monitoring analysis, emergency command, digital broadcast television, delayed television, network teaching, live broadcast, VOD on demand, television mail, Personal Video Recorder (PVR), intranet (self-office) channels, intelligent video broadcast control, information distribution and the like, and realizes high-definition quality video broadcast through a television or a computer.
To better understand the embodiments of the present invention, the following description refers to the internet of view:
some of the technologies applied in the video networking are as follows:
network Technology (Network Technology)
Network technology innovation in video networking has improved the traditional Ethernet (Ethernet) to face the potentially huge first video traffic on the network. Unlike pure network Packet Switching (Packet Switching) or network Circuit Switching (Circuit Switching), the Packet Switching is adopted by the technology of the video networking to meet the Streaming requirement. The video networking technology has the advantages of flexibility, simplicity and low price of packet switching, and simultaneously has the quality and safety guarantee of circuit switching, thereby realizing the seamless connection of the whole network switching type virtual circuit and the data format.
Switching Technology (Switching Technology)
The video network adopts two advantages of asynchronism and packet switching of the Ethernet, eliminates the defects of the Ethernet on the premise of full compatibility, has end-to-end seamless connection of the whole network, is directly communicated with a user terminal, and directly bears an IP data packet. The user data does not require any format conversion across the entire network. The video networking is a higher-level form of the Ethernet, is a real-time exchange platform, can realize the real-time transmission of the whole-network large-scale high-definition video which cannot be realized by the existing Internet, and pushes a plurality of network video applications to high-definition and unification.
Server Technology (Server Technology)
The server technology on the video networking and unified video platform is different from the traditional server, the streaming media transmission of the video networking and unified video platform is established on the basis of connection orientation, the data processing capacity of the video networking and unified video platform is independent of flow and communication time, and a single network layer can contain signaling and data transmission. For voice and video services, the complexity of video networking and unified video platform streaming media processing is much simpler than that of data processing, and the efficiency is greatly improved by more than one hundred times compared with that of a traditional server.
Storage Technology (Storage Technology)
The super-high speed storage technology of the unified video platform adopts the most advanced real-time operating system in order to adapt to the media content with super-large capacity and super-large flow, the program information in the server instruction is mapped to the specific hard disk space, the media content is not passed through the server any more, and is directly sent to the user terminal instantly, and the general waiting time of the user is less than 0.2 second. The optimized sector distribution greatly reduces the mechanical motion of the magnetic head track seeking of the hard disk, the resource consumption only accounts for 20% of that of the IP internet of the same grade, but concurrent flow which is 3 times larger than that of the traditional hard disk array is generated, and the comprehensive efficiency is improved by more than 10 times.
Network Security Technology (Network Security Technology)
The structural design of the video network completely eliminates the network security problem troubling the internet structurally by the modes of independent service permission control each time, complete isolation of equipment and user data and the like, generally does not need antivirus programs and firewalls, avoids the attack of hackers and viruses, and provides a structural carefree security network for users.
Service Innovation Technology (Service Innovation Technology)
The unified video platform integrates services and transmission, and is not only automatically connected once whether a single user, a private network user or a network aggregate. The user terminal, the set-top box or the PC are directly connected to the unified video platform to obtain various multimedia video services in various forms. The unified video platform adopts a menu type configuration table mode to replace the traditional complex application programming, can realize complex application by using very few codes, and realizes infinite new service innovation.
Networking of the video network is as follows:
the video network is a centralized control network structure, and the network can be a tree network, a star network, a ring network and the like, but on the basis of the centralized control node, the whole network is controlled by the centralized control node in the network.
As shown in fig. 6, the video network is divided into an access network and a metropolitan network.
The devices of the access network part can be mainly classified into 3 types: node server, access switch, terminal (including various set-top boxes, coding boards, memories, etc.). The node server is connected to an access switch, which may be connected to a plurality of terminals and may be connected to an ethernet network.
The node server is a node which plays a centralized control function in the access network and can control the access switch and the terminal. The node server can be directly connected with the access switch or directly connected with the terminal.
Similarly, devices of the metropolitan network portion may also be classified into 3 types: a metropolitan area server, a node switch and a node server. The metro server is connected to a node switch, which may be connected to a plurality of node servers.
The node server is a node server of the access network part, namely the node server belongs to both the access network part and the metropolitan area network part.
The metropolitan area server is a node which plays a centralized control function in the metropolitan area network and can control a node switch and a node server. The metropolitan area server can be directly connected with the node switch or directly connected with the node server.
Therefore, the whole video network is a network structure with layered centralized control, and the network controlled by the node server and the metropolitan area server can be in various structures such as tree, star and ring.
The access network part can form a unified video platform (the part in the dotted circle), and a plurality of unified video platforms can form a video network; each unified video platform may be interconnected via metropolitan area and wide area video networking.
Video networking device classification
1.1 devices in the video network of the embodiment of the present invention can be mainly classified into 3 types: server, exchanger (including Ethernet protocol conversion gateway), terminal (including various set-top boxes, code board, memory, etc.). The video network as a whole can be divided into a metropolitan area network (or national network, global network, etc.) and an access network.
1.2 wherein the devices of the access network part can be mainly classified into 3 types: node server, access exchanger (including Ethernet protocol conversion gateway), terminal (including various set-top boxes, coding board, memory, etc.).
The specific hardware structure of each access network device is as follows:
a node server:
as shown in fig. 7, the system mainly includes a network interface module 601, a switching engine module 602, a CPU module 603, and a disk array module 604;
the network interface module 601, the CPU module 603, and the disk array module 604 all enter the switching engine module 602; the switching engine module 602 performs an operation of looking up the address table 605 on the incoming packet, thereby obtaining the direction information of the packet; and stores the packet in a queue of a corresponding packet buffer 606 according to the packet's steering information; if the queue of the packet buffer 606 is nearly full, it is discarded; the switching engine module 602 polls all packet buffer queues for forwarding if the following conditions are met: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero. The disk array module 604 mainly implements control over the hard disk, including initialization, read-write, and other operations of the hard disk; the CPU module 603 is mainly responsible for protocol processing with an access switch and a terminal (not shown in the figure), configuring an address table 605 (including a downlink protocol packet address table, an uplink protocol packet address table, and a data packet address table), and configuring the disk array module 604.
The access switch:
as shown in fig. 8, the network interface module (downlink network interface module 701, uplink network interface module 702), the switching engine module 703 and the CPU module 704 are mainly included;
wherein, the packet (uplink data) coming from the downlink network interface module 701 enters the packet detection module 705; the packet detection module 705 detects whether the Destination Address (DA), the Source Address (SA), the packet type, and the packet length of the packet meet the requirements, if so, allocates a corresponding stream identifier (stream-id) and enters the switching engine module 703, otherwise, discards the stream identifier; the packet (downstream data) coming from the upstream network interface module 702 enters the switching engine module 703; the incoming data packet from the CPU module 704 enters the switching engine module 703; the switching engine module 703 performs an operation of looking up the address table 706 on the incoming packet, thereby obtaining the direction information of the packet; if a packet entering the switching engine module 703 goes from the downstream network interface to the upstream network interface, the packet is stored in the queue of the corresponding packet buffer 707 in association with a stream-id; if the queue of the packet buffer 707 is close to full, discard; if the packet entering the switching engine module 703 does not go from the downlink network interface to the uplink network interface, the packet is stored in the queue of the corresponding packet buffer 707 according to the packet steering information; if the queue of the packet buffer 707 is close to full, it is discarded.
The switching engine module 703 polls all packet buffer queues and may include two cases:
if the queue is from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queued packet counter is greater than zero; 3) obtaining a token generated by a code rate control module;
if the queue is not from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero.
The rate control module 708 is configured by the CPU module 704 to generate tokens for packet buffer queues going to the upstream network interface for all downstream network interfaces at programmable intervals to control the rate of upstream forwarding.
The CPU module 704 is mainly responsible for protocol processing with the node server, configuration of the address table 706, and configuration of the code rate control module 708.
Ethernet protocol conversion gateway
As shown in fig. 9, the apparatus mainly includes a network interface module (a downlink network interface module 801 and an uplink network interface module 802), a switch engine module 803, a CPU module 804, a packet detection module 805, a rate control module 808, an address table 806, a packet buffer 807, a MAC adding module 809, and a MAC deleting module 810.
Wherein, the data packet coming from the downlink network interface module 801 enters the packet detection module 805; the packet detection module 805 detects whether the ethernet MAC DA, the ethernet MAC SA, the ethernet length or frame type, the video network destination address DA, the video network source address SA, the video network packet type, and the packet length of the packet meet the requirements, and if so, allocates a corresponding stream identifier (stream-id); then, the MAC deleting module 810 subtracts MAC DA, MAC SA, length or frame type (2byte), and enters the corresponding receiving buffer, otherwise, discards it;
the downlink network interface module 801 detects the sending buffer of the port, and if there is a packet, acquires the ethernet MAC DA of the corresponding terminal according to the destination address DA of the packet, adds the ethernet MAC DA of the terminal, the MACSA of the ethernet coordination gateway, and the ethernet length or frame type, and sends the packet.
The other modules in the ethernet protocol gateway function similarly to the access switch.
A terminal:
the system mainly comprises a network interface module, a service processing module and a CPU module; for example, the set-top box mainly comprises a network interface module, a video and audio coding and decoding engine module and a CPU module; the coding board mainly comprises a network interface module, a video and audio coding engine module and a CPU module; the memory mainly comprises a network interface module, a CPU module and a disk array module.
1.3 devices of the metropolitan area network part can be mainly classified into 2 types: node server, node exchanger, metropolitan area server. The node switch mainly comprises a network interface module, a switching engine module and a CPU module; the metropolitan area server mainly comprises a network interface module, a switching engine module and a CPU module.
2. Video networking packet definition
2.1 Access network packet definition
The data packet of the access network mainly comprises the following parts: destination Address (DA), Source Address (SA), reserved bytes, payload (pdu), CRC.
As shown in the following table, the data packet of the access network mainly includes the following parts:
DA SA Reserved Payload CRC
wherein:
the Destination Address (DA) is composed of 8 bytes (byte), the first byte represents the type of the data packet (such as various protocol packets, multicast data packets, unicast data packets, etc.), there are 256 possibilities at most, the second byte to the sixth byte are metropolitan area network addresses, and the seventh byte and the eighth byte are access network addresses;
the Source Address (SA) is also composed of 8 bytes (byte), defined as the same as the Destination Address (DA);
the reserved byte consists of 2 bytes;
the payload part has different lengths according to different types of datagrams, and is 64 bytes if the datagram is various types of protocol packets, and is 32+1024 or 1056 bytes if the datagram is a unicast packet, of course, the length is not limited to the above 2 types;
the CRC consists of 4 bytes and is calculated in accordance with the standard ethernet CRC algorithm.
2.2 metropolitan area network packet definition
The topology of a metropolitan area network is a graph and there may be 2, or even more than 2, connections between two devices, i.e., there may be more than 2 connections between a node switch and a node server, a node switch and a node switch, and a node switch and a node server. However, the metro network address of the metro network device is unique, and in order to accurately describe the connection relationship between the metro network devices, parameters are introduced in the embodiment of the present invention: a label to uniquely describe a metropolitan area network device.
In this specification, the definition of the Label is similar to that of the Label of MPLS (Multi-Protocol Label Switch), and assuming that there are two connections between the device a and the device B, there are 2 labels for the packet from the device a to the device B, and 2 labels for the packet from the device B to the device a. The label is classified into an incoming label and an outgoing label, and assuming that the label (incoming label) of the packet entering the device a is 0x0000, the label (outgoing label) of the packet leaving the device a may become 0x 0001. The network access process of the metro network is a network access process under centralized control, that is, address allocation and label allocation of the metro network are both dominated by the metro server, and the node switch and the node server are both passively executed, which is different from label allocation of MPLS, and label allocation of MPLS is a result of mutual negotiation between the switch and the server.
As shown in the following table, the data packet of the metro network mainly includes the following parts:
DA SA Reserved label (R) Payload CRC
Namely Destination Address (DA), Source Address (SA), Reserved byte (Reserved), tag, payload (pdu), CRC. The format of the tag may be defined by reference to the following: the tag is 32 bits with the upper 16 bits reserved and only the lower 16 bits used, and its position is between the reserved bytes and payload of the packet.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The method for implementing the conference and the device for implementing the conference provided by the invention are described in detail above, and the principle and the implementation mode of the invention are explained by applying specific examples, and the description of the above examples is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (14)

1. A method for realizing a conference is applied to a first mobile terminal, and the method comprises the following steps:
sending a conference room establishing request to a streaming media server; the meeting room establishing request carries a meeting room identifier;
entering a meeting room corresponding to the meeting room identification; wherein the conference room is created by the streaming server in response to the conference room creation request;
establishing connection with a WEB server;
and sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking.
2. The method of claim 1, wherein the conference control directives include a allow talk directive and an end talk directive;
the sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server to control the at least one second mobile terminal to speak or finish speaking includes:
acquiring a speaking request sent by at least one second mobile terminal in the conference room through the WEB server; generating a speaking permission instruction according to the speaking request; sending the speaking permission instruction to the at least one second mobile terminal so as to control the at least one second mobile terminal to speak;
or the like, or, alternatively,
and sending the speaking ending instruction to the at least one second mobile terminal through the WEB server so as to control the at least one second mobile terminal to end speaking.
3. The method of claim 1, further comprising:
sending a conference starting request to the streaming media server;
receiving a permission instruction returned by the streaming media server in response to the conference starting request;
pushing audio and video data to the streaming media server according to the permission instruction;
and sending the channel identification of the audio and video data to the conference room.
4. The method of claim 1, further comprising:
acquiring a channel identifier of at least one audio/video data in the conference room;
transmitting a watching request to the streaming media server according to the channel identifier of the at least one piece of audio and video data;
receiving multi-channel audio and video data pushed by the streaming media server in response to the watching request;
and displaying a plurality of conference pictures according to the multi-path audio and video data.
5. The method of claim 4, wherein said presenting a plurality of meeting frames based on said multi-channel audio-video data comprises:
carrying out shunting processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data;
determining a plurality of target display areas corresponding to the plurality of target audio and video data;
and displaying a plurality of conference pictures on the plurality of target display areas.
6. A method for realizing a conference is applied to a second mobile terminal, and the method comprises the following steps:
sending a conference entering request to a streaming media server; the conference joining request carries a conference room identifier;
entering a meeting room corresponding to the meeting room identification;
establishing connection with a WEB server;
and receiving a conference control instruction sent by a first mobile terminal entering the conference room through the WEB server, and speaking or ending speaking according to the conference control instruction.
7. The method of claim 6, wherein the conference control directives include a allow talk directive and an end talk directive;
the receiving, by the WEB server, a conference control instruction sent by a first mobile terminal entering the conference room, and speaking or ending the speech according to the conference control instruction, includes:
sending a speaking request to a first mobile terminal entering the conference room through the WEB server; receiving an speaking permission instruction returned by the first mobile terminal in response to the speaking request; speaking according to the speaking permission instruction;
or the like, or, alternatively,
receiving a speech ending instruction sent by a first mobile terminal entering the conference room through the WEB server; and ending the speaking according to the speaking ending instruction.
8. The method of claim 7, wherein speaking according to the allow-to-speak instruction comprises:
audio and video data are pushed to the streaming media server according to the speaking permission instruction;
and sending the channel identification of the audio and video data to the conference room.
9. The method of claim 7, further comprising:
acquiring a channel identifier of at least one audio/video data in the conference room;
transmitting a watching request to the streaming media server according to the channel identifier of the at least one piece of audio and video data;
receiving multi-channel audio and video data pushed by the streaming media server in response to the watching request;
and displaying a plurality of conference pictures according to the multi-path audio and video data.
10. The method of claim 9, wherein said presenting a plurality of meeting frames based on said multi-channel audio-video data comprises:
carrying out shunting processing according to the channel identification of the multi-path audio/video data to obtain a plurality of target audio/video data;
determining a plurality of target display areas corresponding to the plurality of target audio and video data;
and displaying a plurality of conference pictures on the plurality of target display areas.
11. An apparatus for implementing a conference, applied to a first mobile terminal, the apparatus comprising:
a conference room creation request sending module, configured to send a conference room creation request to the streaming media server; the meeting room establishing request carries a meeting room identifier;
the conference room entering module is used for entering a conference room corresponding to the conference room identifier; wherein the conference room is created by the streaming server in response to the conference room creation request;
the connection module is used for establishing connection with a WEB server;
and the conference control instruction sending module is used for sending a conference control instruction to at least one second mobile terminal entering the conference room through the WEB server so as to control the at least one second mobile terminal to speak or finish speaking.
12. An apparatus for implementing a conference, applied to a second mobile terminal, the apparatus comprising:
the conference joining request sending module is used for sending a conference joining request to the streaming media server; the conference joining request carries a conference room identifier;
the conference room entering module is used for entering a conference room corresponding to the conference room identifier;
the connection module is used for establishing connection with a WEB server;
and the conference control instruction receiving module is used for receiving a conference control instruction sent by a first mobile terminal entering the conference room through the WEB server and making a speech or ending the speech according to the conference control instruction.
13. An electronic device, comprising:
one or more processors; and
one or more machine readable media having instructions stored thereon, which when executed by the one or more processors, cause the electronic device to perform the steps of the method of any of claims 1-5 or 6-10.
14. A computer-readable storage medium, characterized in that it stores a computer program for causing a processor to perform the steps of the method according to any one of claims 1 to 5 or 6 to 10.
CN202010312487.2A 2020-04-20 2020-04-20 Method and device for realizing conference Pending CN111641800A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010312487.2A CN111641800A (en) 2020-04-20 2020-04-20 Method and device for realizing conference

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010312487.2A CN111641800A (en) 2020-04-20 2020-04-20 Method and device for realizing conference

Publications (1)

Publication Number Publication Date
CN111641800A true CN111641800A (en) 2020-09-08

Family

ID=72330777

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010312487.2A Pending CN111641800A (en) 2020-04-20 2020-04-20 Method and device for realizing conference

Country Status (1)

Country Link
CN (1) CN111641800A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112954256A (en) * 2021-01-29 2021-06-11 深圳壹秘科技有限公司 Conference control method, device, system and computer readable storage medium
CN113489736A (en) * 2021-07-14 2021-10-08 海南视联通信技术有限公司 Method, device, equipment and storage medium for realizing multimedia conference
CN113938336A (en) * 2021-11-15 2022-01-14 网易(杭州)网络有限公司 Conference control method and device and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130147900A1 (en) * 2011-12-07 2013-06-13 Reginald Weiser Systems and methods for providing video conferencing services via an ethernet adapter
CN103974093A (en) * 2013-01-30 2014-08-06 云联(北京)信息技术有限公司 Method and system for processing streaming media
CN205408020U (en) * 2015-12-11 2016-07-27 中国农业银行股份有限公司 Teleconferencing system
CN110166728A (en) * 2019-05-29 2019-08-23 视联动力信息技术股份有限公司 A kind of view networking meeting method and device
CN110855926A (en) * 2018-08-21 2020-02-28 视联动力信息技术股份有限公司 Video conference processing method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130147900A1 (en) * 2011-12-07 2013-06-13 Reginald Weiser Systems and methods for providing video conferencing services via an ethernet adapter
CN103974093A (en) * 2013-01-30 2014-08-06 云联(北京)信息技术有限公司 Method and system for processing streaming media
CN205408020U (en) * 2015-12-11 2016-07-27 中国农业银行股份有限公司 Teleconferencing system
CN110855926A (en) * 2018-08-21 2020-02-28 视联动力信息技术股份有限公司 Video conference processing method and device
CN110166728A (en) * 2019-05-29 2019-08-23 视联动力信息技术股份有限公司 A kind of view networking meeting method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李俊;王静;吴巍;陈立水;: "基于Red5网页版视频会议系统设计与实现", 无线电通信技术, no. 06, pages 1 - 4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112954256A (en) * 2021-01-29 2021-06-11 深圳壹秘科技有限公司 Conference control method, device, system and computer readable storage medium
CN113489736A (en) * 2021-07-14 2021-10-08 海南视联通信技术有限公司 Method, device, equipment and storage medium for realizing multimedia conference
CN113938336A (en) * 2021-11-15 2022-01-14 网易(杭州)网络有限公司 Conference control method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN108574688B (en) Method and device for displaying participant information
CN109640028B (en) Method and device for carrying out conference combining on multiple video networking terminals and multiple Internet terminals
CN110620896B (en) Conference establishing method, system and device
CN110049271B (en) Video networking conference information display method and device
CN110493554B (en) Method and system for switching speaking terminal
CN109547728B (en) Recorded broadcast source conference entering and conference recorded broadcast method and system
CN109120879B (en) Video conference processing method and system
CN110475090B (en) Conference control method and system
CN109660816B (en) Information processing method and device
CN110138728B (en) Video data sharing method and device
CN110191304B (en) Data processing method, device and storage medium
CN110049273B (en) Video networking-based conference recording method and transfer server
CN110545395B (en) Video networking conference switching method and device
CN109862307B (en) Video conference initiating method and device
CN109040656B (en) Video conference processing method and system
CN111641800A (en) Method and device for realizing conference
CN110855926A (en) Video conference processing method and device
CN111131754A (en) Control split screen method and device of conference management system
CN111478884B (en) Method, device and storage medium for data transmission among different video networks
CN111654713A (en) Live broadcast interaction method and device
CN110049268B (en) Video telephone connection method and device
CN109963108B (en) One-to-many talkback method and device
CN109286775B (en) Multi-person conference control method and system
CN111327868A (en) Method, terminal, server, device and medium for setting conference speaking party role
CN109005378B (en) Video conference processing method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination