CN110062194B - Audio and video data processing method, device and storage medium - Google Patents

Audio and video data processing method, device and storage medium Download PDF

Info

Publication number
CN110062194B
CN110062194B CN201910319090.3A CN201910319090A CN110062194B CN 110062194 B CN110062194 B CN 110062194B CN 201910319090 A CN201910319090 A CN 201910319090A CN 110062194 B CN110062194 B CN 110062194B
Authority
CN
China
Prior art keywords
audio
video data
terminals
terminal
polling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910319090.3A
Other languages
Chinese (zh)
Other versions
CN110062194A (en
Inventor
史少军
王艳辉
亓娜
袁占涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Visionvera Information Technology Co Ltd
Original Assignee
Visionvera Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Visionvera Information Technology Co Ltd filed Critical Visionvera Information Technology Co Ltd
Priority to CN201910319090.3A priority Critical patent/CN110062194B/en
Publication of CN110062194A publication Critical patent/CN110062194A/en
Application granted granted Critical
Publication of CN110062194B publication Critical patent/CN110062194B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The embodiment of the invention provides a method and a device for processing audio and video data, a device and a computer readable storage medium, wherein the method comprises the following steps: the method comprises the steps that a first terminal responds to configuration operation of a user and acquires a plurality of pieces of configuration information aiming at a plurality of second terminals; the first terminal stores a plurality of pieces of configuration information into a preset polling task list; the method comprises the steps that a first terminal starts one or more preset polling threads, sends a playing instruction of audio and video data to a node server according to a preset period by utilizing the one or more polling threads, the node server obtains the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to a plurality of second terminals, and the plurality of second terminals play the received audio and video data according to a plurality of configuration information. The embodiment of the invention increases the number of paths for simultaneously displaying the audio and video data and improves the efficiency for displaying the audio and video data.

Description

Audio and video data processing method, device and storage medium
Technical Field
The present invention relates to the field of video networking technologies, and in particular, to a method and an apparatus for processing audio and video data, and a computer-readable storage medium.
Background
The video network is a special network for transmitting high-definition video and a special protocol at high speed based on Ethernet hardware, is a higher-level form of the Internet and is a real-time network.
In the video conference based on the video network, a main conference place can poll and display audio and video data of part of participants in the video conference through an 8-channel television wall. The 8-channel television wall can be composed of 4 sub-cores, each sub-core can receive 2-channel audio and video data, and the 8-channel television wall can simultaneously display the 8-channel audio and video data of the participants.
If too many pieces of audio and video data of the participants are displayed through polling of one 8-channel television wall, the 8-channel television wall is required to poll for many times, and the display efficiency of the audio and video data is low.
Disclosure of Invention
In view of the above problems, embodiments of the present invention are proposed to provide a method and an apparatus for processing audio-video data, and an apparatus and a computer-readable storage medium, which overcome or at least partially solve the above problems.
In order to solve the above problems, an embodiment of the present invention discloses a method for processing audio/video data, where the method is applied to a video network, and the video network includes a first terminal, a plurality of second terminals, and a node server, where the node server is in communication connection with the first terminal and the plurality of second terminals, respectively; the method comprises the following steps: the first terminal responds to configuration operation of a user and acquires a plurality of pieces of configuration information aiming at the plurality of second terminals; the first terminal stores the plurality of configuration information into a preset polling task list; the first terminal starts one or more preset polling threads, and sends a playing instruction of audio and video data to the node server according to a preset period by using the one or more polling threads, the node server acquires the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to the second terminals, and the second terminals play the received audio and video data according to the configuration information.
Optionally, the step of starting, by the first terminal, one or more preset polling threads includes: the first terminal counts the number of the plurality of configuration information in the polling task list; and the first terminal starts the one or more polling threads according to the number.
Optionally, the step of starting, by the first terminal, the one or more polling threads according to the number includes: when the number of the first terminals is less than or equal to a preset terminal threshold value, starting the polling thread; and the first terminal starts the plurality of polling threads when the number of the first terminals is larger than the terminal threshold value.
Optionally, the video network further includes a plurality of third terminals, where the plurality of third terminals include a source terminal of the audio and video data, and the plurality of configuration information includes polling interval information and identification information of the source terminal.
Optionally, the play instruction includes identification information of the source terminal; and the plurality of second terminals circularly play the received audio and video data according to the polling interval information.
The embodiment of the invention also discloses a processing device of audio and video data, which is applied to a video network, wherein the video network comprises a plurality of second terminals and a node server, and the node server is respectively in communication connection with the processing device and the plurality of second terminals; the processing device comprises: an obtaining module, configured to obtain, in response to a configuration operation by a user, a plurality of pieces of configuration information for the plurality of second terminals; the storage module is used for storing the plurality of configuration information into a preset polling task list; the starting module is used for starting one or more preset polling threads; the transmitting module is used for transmitting a playing instruction of audio and video data to the node server according to a preset period by using the one or more polling threads, the node server acquires the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to the plurality of second terminals, and the plurality of second terminals play the received audio and video data according to the plurality of configuration information.
Optionally, the starting module includes: the counting submodule is used for counting the quantity of the plurality of configuration information in the polling task list; and the starting sub-module is used for starting the one or more polling threads according to the number.
Optionally, the starting sub-module is configured to start the polling thread when the number is less than or equal to a preset terminal threshold; when the number is larger than the terminal threshold value, starting the plurality of polling threads; the video network further comprises a plurality of third terminals, the plurality of third terminals comprise source terminals of the audio and video data, and the plurality of configuration information comprise polling interval information and identification information of the source terminals; the playing instruction comprises identification information of the source terminal; and the plurality of second terminals circularly play the received audio and video data according to the polling interval information.
The embodiment of the invention also discloses a device, which comprises: one or more processors; and one or more machine readable media having instructions stored thereon, which when executed by the one or more processors, cause the apparatus to perform a method of processing audio-visual data as described in embodiments of the invention.
The embodiment of the invention also discloses a computer readable storage medium, and a stored computer program enables a processor to execute the audio and video data processing method.
The embodiment of the invention has the following advantages:
the embodiment of the invention is applied to the video network, wherein the video network can comprise a first terminal, a plurality of second terminals and a node server, and the node server can be respectively in communication connection with the first terminal and the plurality of second terminals. In a video conference application scenario, the first terminal may be a control terminal of a video conference, the second terminal may be an 8-channel video wall, and the node server may be a video conference management server.
In the embodiment of the invention, a user can input configuration information for the second terminal in the first terminal through configuration operation, and the first terminal acquires the configuration information in response to the configuration operation of the user and stores the configuration information into a preset polling task list. The method comprises the steps that a first terminal starts one or more preset polling threads, one or more polling threads are utilized to send a playing instruction of audio and video data to a node server according to a preset period, the node server obtains the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to a second terminal, and the second terminal plays the received audio and video data according to respective configuration information.
The technical scheme provided by the embodiment of the invention is applied to a video conference application scene, configuration information can be input for a plurality of 8-channel television walls, one or more polling threads send a playing instruction to a video conference management server, so that the plurality of 8-channel television walls can work simultaneously, and compared with the situation that only one 8-channel television wall displays audio and video data of a participant, the number of channels for simultaneously displaying the audio and video data is increased, and the efficiency for displaying the audio and video data is improved.
Drawings
FIG. 1 is a schematic networking diagram of a video network of the present invention;
FIG. 2 is a schematic diagram of a hardware architecture of a node server according to the present invention;
fig. 3 is a schematic diagram of a hardware structure of an access switch of the present invention;
fig. 4 is a schematic diagram of a hardware structure of an ethernet protocol conversion gateway according to the present invention;
fig. 5 is a flowchart illustrating steps of an embodiment of a method for processing audio/video data according to an embodiment of the present invention;
fig. 6 is a design architecture diagram of a polling method for a plurality of 8-channel video walls according to an embodiment of the present invention;
FIG. 7 is a schematic diagram illustrating a polling method for a plurality of 8-channel video walls according to an embodiment of the present invention;
fig. 8 is a block diagram of an embodiment of an audio/video data processing apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
The video networking is an important milestone for network development, is a real-time network, can realize high-definition video real-time transmission, and pushes a plurality of internet applications to high-definition video, and high-definition faces each other.
The video networking adopts a real-time high-definition video exchange technology, can integrate required services such as dozens of services of video, voice, pictures, characters, communication, data and the like on a system platform on a network platform, such as high-definition video conference, video monitoring, intelligent monitoring analysis, emergency command, digital broadcast television, delayed television, network teaching, live broadcast, VOD on demand, television mail, Personal Video Recorder (PVR), intranet (self-office) channels, intelligent video broadcast control, information distribution and the like, and realizes high-definition quality video broadcast through a television or a computer.
To better understand the embodiments of the present invention, the following description refers to the internet of view:
some of the technologies applied in the video networking are as follows:
network Technology (Network Technology)
Network technology innovation in video networking has improved the traditional Ethernet (Ethernet) to face the potentially huge first video traffic on the network. Unlike pure network Packet Switching (Packet Switching) or network Circuit Switching (Circuit Switching), the Packet Switching is adopted by the technology of the video networking to meet the Streaming requirement. The video networking technology has the advantages of flexibility, simplicity and low price of packet switching, and simultaneously has the quality and safety guarantee of circuit switching, thereby realizing the seamless connection of the whole network switching type virtual circuit and the data format.
Switching Technology (Switching Technology)
The video network adopts two advantages of asynchronism and packet switching of the Ethernet, eliminates the defects of the Ethernet on the premise of full compatibility, has end-to-end seamless connection of the whole network, is directly communicated with a user terminal, and directly bears an IP data packet. The user data does not require any format conversion across the entire network. The video networking is a higher-level form of the Ethernet, is a real-time exchange platform, can realize the real-time transmission of the whole-network large-scale high-definition video which cannot be realized by the existing Internet, and pushes a plurality of network video applications to high-definition and unification.
Server Technology (Server Technology)
The server technology on the video networking and unified video platform is different from the traditional server, the streaming media transmission of the video networking and unified video platform is established on the basis of connection orientation, the data processing capacity of the video networking and unified video platform is independent of flow and communication time, and a single network layer can contain signaling and data transmission. For voice and video services, the complexity of video networking and unified video platform streaming media processing is much simpler than that of data processing, and the efficiency is greatly improved by more than one hundred times compared with that of a traditional server.
Storage Technology (Storage Technology)
The super-high speed storage technology of the unified video platform adopts the most advanced real-time operating system in order to adapt to the media content with super-large capacity and super-large flow, the program information in the server instruction is mapped to the specific hard disk space, the media content is not passed through the server any more, and is directly sent to the user terminal instantly, and the general waiting time of the user is less than 0.2 second. The optimized sector distribution greatly reduces the mechanical motion of the magnetic head track seeking of the hard disk, the resource consumption only accounts for 20% of that of the IP internet of the same grade, but concurrent flow which is 3 times larger than that of the traditional hard disk array is generated, and the comprehensive efficiency is improved by more than 10 times.
Network Security Technology (Network Security Technology)
The structural design of the video network completely eliminates the network security problem troubling the internet structurally by the modes of independent service permission control each time, complete isolation of equipment and user data and the like, generally does not need antivirus programs and firewalls, avoids the attack of hackers and viruses, and provides a structural carefree security network for users.
Service Innovation Technology (Service Innovation Technology)
The unified video platform integrates services and transmission, and is not only automatically connected once whether a single user, a private network user or a network aggregate. The user terminal, the set-top box or the PC are directly connected to the unified video platform to obtain various multimedia video services in various forms. The unified video platform adopts a menu type configuration table mode to replace the traditional complex application programming, can realize complex application by using very few codes, and realizes infinite new service innovation.
Networking of the video network is as follows:
the video network is a centralized control network structure, and the network can be a tree network, a star network, a ring network and the like, but on the basis of the centralized control node, the whole network is controlled by the centralized control node in the network.
As shown in fig. 1, the video network is divided into an access network and a metropolitan network.
The devices of the access network part can be mainly classified into 3 types: node server, access switch, terminal (including various set-top boxes, coding boards, memories, etc.). The node server is connected to an access switch, which may be connected to a plurality of terminals and may be connected to an ethernet network.
The node server is a node which plays a centralized control function in the access network and can control the access switch and the terminal. The node server can be directly connected with the access switch or directly connected with the terminal.
Similarly, devices of the metropolitan network portion may also be classified into 3 types: a metropolitan area server, a node switch and a node server. The metro server is connected to a node switch, which may be connected to a plurality of node servers.
The node server is a node server of the access network part, namely the node server belongs to both the access network part and the metropolitan area network part.
The metropolitan area server is a node which plays a centralized control function in the metropolitan area network and can control a node switch and a node server. The metropolitan area server can be directly connected with the node switch or directly connected with the node server.
Therefore, the whole video network is a network structure with layered centralized control, and the network controlled by the node server and the metropolitan area server can be in various structures such as tree, star and ring.
The access network part can form a unified video platform (the part in the dotted circle), and a plurality of unified video platforms can form a video network; each unified video platform may be interconnected via metropolitan area and wide area video networking.
Video networking device classification
1.1 devices in the video network of the embodiment of the present invention can be mainly classified into 3 types: server, exchanger (including Ethernet protocol conversion gateway), terminal (including various set-top boxes, code board, memory, etc.). The video network as a whole can be divided into a metropolitan area network (or national network, global network, etc.) and an access network.
1.2 wherein the devices of the access network part can be mainly classified into 3 types: node server, access exchanger (including Ethernet protocol conversion gateway), terminal (including various set-top boxes, coding board, memory, etc.).
The specific hardware structure of each access network device is as follows:
a node server:
as shown in fig. 2, the system mainly includes a network interface module 201, a switching engine module 202, a CPU module 203, and a disk array module 204;
the packets coming from the network interface module 201, the CPU module 203, and the disk array module 204 all enter the switching engine module 202; the switching engine module 202 performs an operation of looking up the address table 205 on the incoming packet, thereby obtaining the direction information of the packet; and stores the packet in a queue of the corresponding packet buffer 206 based on the packet's steering information; if the queue of the packet buffer 206 is nearly full, it is discarded; the switching engine module 202 polls all packet buffer queues for forwarding if the following conditions are met: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero. The disk array module 204 mainly implements control over the hard disk, including initialization, read-write, and other operations on the hard disk; the CPU module 203 is mainly responsible for protocol processing with an access switch and a terminal (not shown in the figure), configuring an address table 205 (including a downlink protocol packet address table, an uplink protocol packet address table, and a data packet address table), and configuring the disk array module 204.
The access switch:
as shown in fig. 3, the network interface module mainly includes a network interface module (a downlink network interface module 301 and an uplink network interface module 302), a switching engine module 303 and a CPU module 304;
wherein, the packet (uplink data) coming from the downlink network interface module 301 enters the packet detection module 305; the packet detection module 305 detects whether the Destination Address (DA), the Source Address (SA), the packet type, and the packet length of the packet meet the requirements, and if so, allocates a corresponding stream identifier (stream-id) and enters the switching engine module 303, otherwise, discards the stream identifier; the packet (downstream data) coming from the upstream network interface module 302 enters the switching engine module 303; the incoming data packet of the CPU module 304 enters the switching engine module 303; the switching engine module 303 performs an operation of looking up the address table 306 on the incoming packet, thereby obtaining the direction information of the packet; if the packet entering the switching engine module 303 is from the downstream network interface to the upstream network interface, the packet is stored in the queue of the corresponding packet buffer 307 in association with the stream-id; if the queue of the packet buffer 307 is nearly full, it is discarded; if the packet entering the switching engine module 303 is not from the downlink network interface to the uplink network interface, the data packet is stored in the queue of the corresponding packet buffer 307 according to the guiding information of the packet; if the queue of the packet buffer 307 is nearly full, it is discarded.
The switching engine module 303 polls all packet buffer queues and may include two cases:
if the queue is from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queued packet counter is greater than zero; 3) obtaining a token generated by a code rate control module;
if the queue is not from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero.
The rate control module 308 is configured by the CPU module 304, and generates tokens for packet buffer queues from all downstream network interfaces to upstream network interfaces at programmable intervals to control the rate of upstream forwarding.
The CPU module 304 is mainly responsible for protocol processing with the node server, configuration of the address table 306, and configuration of the code rate control module 308.
Ethernet protocol conversion gateway
As shown in fig. 4, the apparatus mainly includes a network interface module (a downlink network interface module 401 and an uplink network interface module 402), a switching engine module 403, a CPU module 404, a packet detection module 405, a rate control module 408, an address table 406, a packet buffer 407, a MAC adding module 409, and a MAC deleting module 410.
Wherein, the data packet coming from the downlink network interface module 401 enters the packet detection module 405; the packet detection module 405 detects whether the ethernet MAC DA, the ethernet MAC SA, the ethernet length or frame type, the video network destination address DA, the video network source address SA, the video network packet type, and the packet length of the packet meet the requirements, and if so, allocates a corresponding stream identifier (stream-id); then, the MAC deletion module 410 subtracts MAC DA, MAC SA, length or frame type (2byte) and enters the corresponding receiving buffer, otherwise, discards it;
the downlink network interface module 401 detects the sending buffer of the port, and if there is a packet, obtains the ethernet MAC DA of the corresponding terminal according to the destination address DA of the packet, adds the ethernet MAC DA of the terminal, the MAC SA of the ethernet protocol gateway, and the ethernet length or frame type, and sends the packet.
The other modules in the ethernet protocol gateway function similarly to the access switch.
A terminal:
the system mainly comprises a network interface module, a service processing module and a CPU module; for example, the set-top box mainly comprises a network interface module, a video and audio coding and decoding engine module and a CPU module; the coding board mainly comprises a network interface module, a video and audio coding engine module and a CPU module; the memory mainly comprises a network interface module, a CPU module and a disk array module.
1.3 devices of the metropolitan area network part can be mainly classified into 2 types: node server, node exchanger, metropolitan area server. The node switch mainly comprises a network interface module, a switching engine module and a CPU module; the metropolitan area server mainly comprises a network interface module, a switching engine module and a CPU module.
2. Video networking packet definition
2.1 Access network packet definition
The data packet of the access network mainly comprises the following parts: destination Address (DA), Source Address (SA), reserved bytes, payload (pdu), CRC.
As shown in the following table, the data packet of the access network mainly includes the following parts:
DA SA Reserved Payload CRC
wherein:
the Destination Address (DA) is composed of 8 bytes (byte), the first byte represents the type of the data packet (such as various protocol packets, multicast data packets, unicast data packets, etc.), there are 256 possibilities at most, the second byte to the sixth byte are metropolitan area network addresses, and the seventh byte and the eighth byte are access network addresses;
the Source Address (SA) is also composed of 8 bytes (byte), defined as the same as the Destination Address (DA);
the reserved byte consists of 2 bytes;
the payload part has different lengths according to different types of datagrams, and is 64 bytes if the datagram is various types of protocol packets, and is 32+1024 or 1056 bytes if the datagram is a unicast packet, of course, the length is not limited to the above 2 types;
the CRC consists of 4 bytes and is calculated in accordance with the standard ethernet CRC algorithm.
2.2 metropolitan area network packet definition
The topology of a metropolitan area network is a graph and there may be 2, or even more than 2, connections between two devices, i.e., there may be more than 2 connections between a node switch and a node server, a node switch and a node switch, and a node switch and a node server. However, the metro network address of the metro network device is unique, and in order to accurately describe the connection relationship between the metro network devices, parameters are introduced in the embodiment of the present invention: a label to uniquely describe a metropolitan area network device.
In this specification, the definition of the Label is similar to that of the Label of MPLS (Multi-Protocol Label Switch), and assuming that there are two connections between the device a and the device B, there are 2 labels for the packet from the device a to the device B, and 2 labels for the packet from the device B to the device a. The label is classified into an incoming label and an outgoing label, and assuming that the label (incoming label) of the packet entering the device a is 0x0000, the label (outgoing label) of the packet leaving the device a may become 0x 0001. The network access process of the metro network is a network access process under centralized control, that is, address allocation and label allocation of the metro network are both dominated by the metro server, and the node switch and the node server are both passively executed, which is different from label allocation of MPLS, and label allocation of MPLS is a result of mutual negotiation between the switch and the server.
As shown in the following table, the data packet of the metro network mainly includes the following parts:
DA SA Reserved label (R) Payload CRC
Namely Destination Address (DA), Source Address (SA), Reserved byte (Reserved), tag, payload (pdu), CRC. The format of the tag may be defined by reference to the following: the tag is 32 bits with the upper 16 bits reserved and only the lower 16 bits used, and its position is between the reserved bytes and payload of the packet.
Referring to fig. 5, a flowchart illustrating steps of an embodiment of a method for processing audio and video data according to an embodiment of the present invention is shown, where the method may be applied to a video network, and the video network may include a first terminal, a plurality of second terminals, and a node server, where the node server may be in communication connection with the first terminal and the plurality of second terminals, respectively, and the method may specifically include the following steps:
step 501, a first terminal responds to configuration operation of a user to acquire a plurality of pieces of configuration information aiming at a plurality of second terminals.
In the embodiment of the invention, a plurality of second terminals can be preset, and the second terminals are used for playing audio and video data of a third terminal in the video network. The audio and video data of the third terminal may be the audio and video data locally stored by the third terminal, or the audio and video data collected by the third terminal. In a video conference application scene, the audio and video data of the third terminal can be real-time video conference data acquired by the participants.
The user may configure corresponding configuration information for each second terminal, and in practical application, the configuration information may include polling interval information and identification information of the source terminal. The polling interval information may indicate an interval time for the second terminal to play multiple sets of audio and video data of the third terminal, for example, if the polling interval information is 1 minute, the second terminal displays the audio and video data of the next set of third terminal 1 minute after displaying the audio and video data of the current set of third terminal. The identification information of the source terminal can represent the identification information of a third terminal where the audio and video data to be displayed are located, the identification information can be any information with unique certainty, such as a video networking number, a physical address of a network card and the like, and the embodiment of the invention does not specifically limit the type, content, format and the like of the identification information.
Step 502, the first terminal stores a plurality of configuration information into a preset polling task list.
A polling task list for storing configuration information of each second terminal may be preset in the first terminal. A configuration information may be considered a polling task. That is, how many second terminals the user configures the configuration information for, how many polling tasks are present in the polling task list.
Step 503, the first terminal starts one or more preset polling threads, and sends a play instruction of the audio and video data to the node server according to a preset period by using the one or more polling threads, so that the node server obtains the audio and video data corresponding to the play instruction according to the play instruction and returns the audio and video data to the plurality of second terminals, and the plurality of second terminals play the received audio and video data according to the plurality of configuration information.
In the embodiment of the invention, the first terminal can communicate with the node server by starting the polling thread and further utilizing the polling thread so as to complete the task in the polling task list. When the first terminal starts one or more polling threads, the first terminal may count the number of configuration information in the polling task list, and then start one or more polling threads according to the counted number of configuration information. In practical application, if the number of configuration information in the polling task list is less than or equal to a preset terminal threshold, the first terminal may start only one polling thread; if the number of the configuration information in the polling task list is greater than a preset terminal threshold, the first terminal may start a plurality of polling threads. For example, the preset termination threshold is 4, and the termination threshold indicates that only 4 polling tasks can be processed by one polling thread at most. When there are 4 or less than 4 configuration information in the polling task list, the first terminal may start only one polling thread, and the one polling thread may be responsible for processing 4 or less than 4 polling tasks in the polling task list. When 5 pieces of configuration information exist in the polling task list, the first terminal needs to start two polling threads, wherein one polling thread is responsible for processing 4 polling tasks in the polling task list, and the other polling thread is responsible for processing the remaining 1 polling task in the polling task list. When 10 pieces of configuration information exist in the polling task list, the first terminal needs to start three polling threads, wherein one polling thread is responsible for processing 4 polling tasks in the polling task list, the other polling thread is responsible for processing the other 4 polling tasks in the polling task list, and the other polling thread is responsible for processing the remaining 2 polling tasks in the polling task list.
After the first terminal starts the one or more polling threads, a playing instruction of the audio and video data can be sent to the node server by using the started one or more polling threads. In practical application, the polling thread may send a play instruction to the node server according to a preset period, where the period may be a period in which the first terminal issues one or more pieces of configuration information to the polling thread. The polling thread sends a play instruction to the node server after receiving one or more pieces of configuration information sent by the first terminal. And after receiving the next wave of one or more pieces of configuration information sent by the first terminal, the polling thread sends the playing instruction to the node server again. It should be noted that the first wave configuration information sent by the first terminal to the polling thread is configuration information of the second terminal that plays the audio and video data at the same time. For example, the first terminal simultaneously transmits the configuration information of the second terminals Z1, Z2, and Z4 to the polling thread, and the polling thread may transmit a play instruction to the node server immediately after receiving the configuration information, or the polling thread may transmit a play instruction to the node server after receiving the configuration information for a certain period of time.
The play instruction sent by the polling thread to the node server may include identification information of a source terminal of the audio and video data to be played. In a preferred embodiment of the present invention, the playing instruction may further include identification information of a second terminal that is to play the audio/video data. The identification information of the second terminal of the audio and video data to be played can be carried in the configuration information of the second terminal. For example, the configuration information configured by the user for the second terminal Z2 includes identification information b2 of the second terminal Z2, polling interval information t2, and identification information b5 of the third terminal Z5. The polling thread sends a play instruction to the node server, which may include identification information b2 and identification information b 5. The node server may obtain the audio and video data of the third terminal Z5 according to the identification information b5, and the node server returns the audio and video data obtained from the third terminal Z5 to the second terminal Z2 according to the identification information b 2. The second terminal Z2 circularly plays the audio and video data of the third terminal Z5 according to the polling interval information t 2.
Based on the above description of the embodiment of the method for processing audio and video data, a polling method for multiple 8-channel video walls is introduced below, and as shown in fig. 6, in a video conference application scenario, multiple 8-channel video walls are set so as to display audio and video data of participants. The polling module of the 8-channel television wall can be arranged in the control application program of the video conference, and the polling module of the 8-channel television wall can control the polling process of a plurality of 8-channel television walls. In a specific implementation process, before a video conference starts, a plurality of 8-channel television walls can be set according to conditions such as the number of participants and the like, a terminal list of the participants to be polled is specified for each 8-channel television wall, then a polling thread is used for sending an instruction for displaying audio and video data to each sub-core of each 8-channel television wall to a conference management server at regular time, and the conference management server can return the audio and video data to be displayed to each sub-core of each 8-channel television wall according to the instruction.
As shown in fig. 7, if 80 participants exist in the video conference, video data of 80 participants need to be sequentially displayed at a main meeting place of the video conference, 5 tv walls with 8 channels may be configured for the video conference, a first tv wall with 8 channels polls and displays video data of the first 1 st to 16 th participants, a second tv wall with 8 channels polls and displays video data of the second 17 th to 32 th participants, a third tv wall with 8 channels polls and displays video data of the third 33 th to 48 th participants, a fourth tv wall with 8 channels polls and displays video data of the fourth 49 th to 64 th participants, and a fifth tv wall with 8 channels polls and displays video data of the fifth to 80 th participants. The polling module of the 8-channel television wall may start a polling thread 01 for a first 8-channel television wall, a second 8-channel television wall, a third 8-channel television wall, and a fourth 8-channel television wall, where the polling thread 01 may send an instruction to the conference management server, so that the conference management server returns video data of 1 st to 16 th participants to the first 8-channel television wall, returns video data of 17 th to 32 th participants to the second 8-channel television wall, returns video data of 33 th to 48 th participants to the third 8-channel television wall, and returns video data of 49 th to 64 th participants to the fourth 8-channel television wall. The polling module of the 8-channel video wall may start a polling thread 02 for the fifth 8-channel video wall, and the polling thread 02 may send an instruction to the conference management server, so that the conference management server returns the video data of the 65 th to the 80 th participants to the fifth 8-channel video wall.
The embodiment of the invention is applied to the video network, wherein the video network can comprise a first terminal, a plurality of second terminals and a node server, and the node server can be respectively in communication connection with the first terminal and the plurality of second terminals. In a video conference application scenario, the first terminal may be a control terminal of a video conference, the second terminal may be an 8-channel video wall, and the node server may be a video conference management server.
In the embodiment of the invention, a user can input configuration information for the second terminal in the first terminal through configuration operation, and the first terminal acquires the configuration information in response to the configuration operation of the user and stores the configuration information into a preset polling task list. The method comprises the steps that a first terminal starts one or more preset polling threads, one or more polling threads are utilized to send a playing instruction of audio and video data to a node server according to a preset period, the node server obtains the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to a second terminal, and the second terminal plays the received audio and video data according to respective configuration information.
The technical scheme provided by the embodiment of the invention is applied to a video conference application scene, configuration information can be input for a plurality of 8-channel television walls, one or more polling threads send a playing instruction to a video conference management server, so that the plurality of 8-channel television walls can work simultaneously, and compared with the situation that only one 8-channel television wall displays audio and video data of a participant, the number of channels for simultaneously displaying the audio and video data is increased, and the efficiency for displaying the audio and video data is improved.
In a video conference application scene, the embodiment of the invention does not need to set a timer in a control application program of the video conference and use the timer to control an 8-channel television wall to poll the video data of the participants, thereby reducing the operating pressure of the control application program of the video conference.
Moreover, when more 8-channel television walls execute polling tasks in a video conference application scene, a plurality of polling threads can be started, and the plurality of polling threads are utilized to communicate with the conference management server, so that the parallel processing capacity of the control terminal of the video conference is improved.
If a main meeting place and branch meeting places exist in the video conference application scene, 8 paths of television walls can be arranged in the branch meeting places, so that audio and video data of participants can be displayed in the branch meeting places.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 8, a block diagram of an embodiment of a processing apparatus for audio and video data according to an embodiment of the present invention is shown, where the processing apparatus is applied to a video network, and the video network includes a plurality of second terminals and a node server, where the node server is in communication connection with the processing apparatus and the plurality of second terminals, respectively; the apparatus may include the following modules:
an obtaining module 801, configured to obtain, in response to a configuration operation by a user, a plurality of pieces of configuration information for the plurality of second terminals; a storage module 802, configured to store the multiple pieces of configuration information in a preset polling task list; a starting module 803, configured to start one or more preset polling threads; the sending module 804 is configured to send a playing instruction of audio and video data to the node server according to a preset period by using the one or more polling threads, the node server obtains the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to the plurality of second terminals, and the plurality of second terminals play the received audio and video data according to the plurality of configuration information.
In a preferred embodiment of the present invention, the starting module 803 includes: a statistic submodule 8031, configured to count the number of the plurality of pieces of configuration information in the polling task list; a start-up sub-module 8032 configured to start the one or more polling threads according to the number.
In a preferred embodiment of the present invention, the starting module 8032 is configured to start the polling thread when the number is smaller than or equal to a preset terminal threshold; when the number is larger than the terminal threshold value, starting the plurality of polling threads; the video network further comprises a plurality of third terminals, the plurality of third terminals comprise source terminals of the audio and video data, and the plurality of configuration information comprise polling interval information and identification information of the source terminals; the playing instruction comprises identification information of the source terminal; and the plurality of second terminals circularly play the received audio and video data according to the polling interval information.
For the embodiment of the processing device of the audio and video data, because the embodiment is basically similar to the embodiment of the processing method of the audio and video data, the description is relatively simple, and relevant points can be referred to partial description of the embodiment of the processing method of the audio and video data.
An embodiment of the present invention further provides an apparatus, including:
one or more processors; and
one or more machine-readable media having instructions stored thereon, which when executed by the one or more processors, cause the apparatus to perform a method of processing audiovisual data as described in an embodiment of the invention.
The embodiment of the invention also provides a computer-readable storage medium, and a stored computer program enables a processor to execute the processing method of the audio and video data according to the embodiment of the invention.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The method and the device for processing audio and video data, the device and the computer-readable storage medium provided by the invention are described in detail, and a specific example is applied in the text to explain the principle and the implementation of the invention, and the description of the above embodiment is only used to help understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (9)

1. The method for processing the audio and video data is characterized by being applied to a video conference of a video network, wherein the video network comprises a first terminal, a plurality of second terminals and a node server, and the node server is in communication connection with the first terminal and the plurality of second terminals respectively; the method comprises the following steps:
the first terminal responds to configuration operation of a user and acquires a plurality of pieces of configuration information aiming at the plurality of second terminals;
the first terminal stores the plurality of configuration information into a preset polling task list;
the first terminal starts one or more polling threads according to the quantity of the configuration information, and sends a playing instruction of audio and video data to the node server according to a preset period by using the one or more polling threads, the node server acquires the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to the second terminals, and the second terminals play the received audio and video data according to the configuration information;
the video network further comprises a plurality of third terminals, the third terminals comprise source terminals of the audio and video data, and the configuration information comprises polling interval information and identification information of the source terminals.
2. The method for processing audio/video data according to claim 1, wherein the step of starting one or more polling threads by the first terminal according to the amount of the configuration information includes:
the first terminal counts the number of the plurality of configuration information in the polling task list;
and the first terminal starts the one or more polling threads according to the number.
3. The method for processing audio/video data according to claim 2, wherein the step of starting the one or more polling threads by the first terminal according to the number includes:
when the number of the first terminals is less than or equal to a preset terminal threshold value, starting the polling thread;
and the first terminal starts the plurality of polling threads when the number of the first terminals is larger than the terminal threshold value.
4. The method for processing audio/video data according to claim 1, wherein the playing instruction includes identification information of the source terminal;
and the plurality of second terminals circularly play the received audio and video data according to the polling interval information.
5. The processing device for the audio and video data is characterized in that the processing device is applied to a video conference of a video network, wherein the video network comprises a plurality of second terminals and a node server, and the node server is respectively in communication connection with the processing device and the plurality of second terminals; the processing device comprises:
an obtaining module, configured to obtain, in response to a configuration operation by a user, a plurality of pieces of configuration information for the plurality of second terminals;
the storage module is used for storing the plurality of configuration information into a preset polling task list;
the starting module is used for starting one or more polling threads according to the quantity of the configuration information;
the transmitting module is used for transmitting a playing instruction of audio and video data to the node server according to a preset period by using the one or more polling threads, the node server acquires the audio and video data corresponding to the playing instruction according to the playing instruction and returns the audio and video data to the plurality of second terminals, and the plurality of second terminals play the received audio and video data according to the plurality of configuration information; the video network further comprises a plurality of third terminals, the third terminals comprise source terminals of the audio and video data, and the configuration information comprises polling interval information and identification information of the source terminals.
6. The device for processing audio-video data according to claim 5, wherein the starting module comprises:
the counting submodule is used for counting the quantity of the plurality of configuration information in the polling task list;
and the starting sub-module is used for starting the one or more polling threads according to the number.
7. The device for processing audio/video data according to claim 6, wherein the starting sub-module is configured to start the polling thread when the number is less than or equal to a preset terminal threshold; when the number is larger than the terminal threshold value, starting the plurality of polling threads;
the playing instruction comprises identification information of the source terminal;
and the plurality of second terminals circularly play the received audio and video data according to the polling interval information.
8. An apparatus, comprising:
one or more processors; and
one or more machine-readable media having instructions stored thereon, which when executed by the one or more processors, cause the apparatus to perform a method of processing audiovisual data as claimed in any of claims 1 to 4.
9. A computer-readable storage medium, characterized in that it stores a computer program causing a processor to execute the processing method of audio-visual data according to any one of claims 1 to 4.
CN201910319090.3A 2019-04-19 2019-04-19 Audio and video data processing method, device and storage medium Active CN110062194B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910319090.3A CN110062194B (en) 2019-04-19 2019-04-19 Audio and video data processing method, device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910319090.3A CN110062194B (en) 2019-04-19 2019-04-19 Audio and video data processing method, device and storage medium

Publications (2)

Publication Number Publication Date
CN110062194A CN110062194A (en) 2019-07-26
CN110062194B true CN110062194B (en) 2022-01-11

Family

ID=67319760

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910319090.3A Active CN110062194B (en) 2019-04-19 2019-04-19 Audio and video data processing method, device and storage medium

Country Status (1)

Country Link
CN (1) CN110062194B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111711791A (en) * 2020-06-12 2020-09-25 西安万像电子科技有限公司 Data processing method and device
CN111988570A (en) * 2020-08-25 2020-11-24 西安万像电子科技有限公司 Image data display method and device and image data display system
CN113542370B (en) * 2021-06-29 2023-11-03 北京达佳互联信息技术有限公司 Method and device for processing active broadcast information, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101783919A (en) * 2009-12-10 2010-07-21 上海华平信息技术股份有限公司 Round-robin method based on video conference
CN104254014A (en) * 2014-09-25 2014-12-31 福建星网锐捷安防科技有限公司 Video on-wall configuration method and system for analog TV wall
CN105208358A (en) * 2015-11-04 2015-12-30 武汉微创光电股份有限公司 Video monitoring system for video-wall configuration
CN107809609A (en) * 2017-11-16 2018-03-16 王磊 Video monitoring conference system based on touch apparatus
CN109640029A (en) * 2018-11-16 2019-04-16 视联动力信息技术股份有限公司 The method and apparatus that wall is shown on a kind of video flowing

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8660913B2 (en) * 2009-12-15 2014-02-25 A Big Blessed Family, Llc Remote servicing system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101783919A (en) * 2009-12-10 2010-07-21 上海华平信息技术股份有限公司 Round-robin method based on video conference
CN104254014A (en) * 2014-09-25 2014-12-31 福建星网锐捷安防科技有限公司 Video on-wall configuration method and system for analog TV wall
CN105208358A (en) * 2015-11-04 2015-12-30 武汉微创光电股份有限公司 Video monitoring system for video-wall configuration
CN107809609A (en) * 2017-11-16 2018-03-16 王磊 Video monitoring conference system based on touch apparatus
CN109640029A (en) * 2018-11-16 2019-04-16 视联动力信息技术股份有限公司 The method and apparatus that wall is shown on a kind of video flowing

Also Published As

Publication number Publication date
CN110062194A (en) 2019-07-26

Similar Documents

Publication Publication Date Title
CN108965224B (en) Video-on-demand method and device
CN109462753B (en) System and method for testing multiple video conferences
CN109168064B (en) Synchronous display method and system of electronic data
CN109120879B (en) Video conference processing method and system
CN110190973B (en) Online state detection method and device
CN109547728B (en) Recorded broadcast source conference entering and conference recorded broadcast method and system
CN110475090B (en) Conference control method and system
CN109587002B (en) State detection method and system for video network monitoring equipment
CN110049273B (en) Video networking-based conference recording method and transfer server
CN110062194B (en) Audio and video data processing method, device and storage medium
CN109040656B (en) Video conference processing method and system
CN111654713A (en) Live broadcast interaction method and device
CN109743284B (en) Video processing method and system based on video network
CN110289974B (en) Data stream processing method, system and device and storage medium
CN110446058B (en) Video acquisition method, system, device and computer readable storage medium
CN109963107B (en) Audio and video data display method and system
CN110022500B (en) Packet loss processing method and device
CN109698953B (en) State detection method and system for video network monitoring equipment
CN109474661B (en) Method and system for processing network request event
CN110113555B (en) Video conference processing method and system based on video networking
CN110852692A (en) Conference scoring method and device, electronic equipment and storage medium
CN110798450B (en) Audio and video data processing method and device and storage medium
CN110557411A (en) video stream processing method and device based on video network
CN110417933B (en) Business object processing method and device and storage medium
CN110139060B (en) Video conference method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant