CN108932948A - Audio data processing method, device, computer equipment and computer readable storage medium - Google Patents

Audio data processing method, device, computer equipment and computer readable storage medium Download PDF

Info

Publication number
CN108932948A
CN108932948A CN201710386977.5A CN201710386977A CN108932948A CN 108932948 A CN108932948 A CN 108932948A CN 201710386977 A CN201710386977 A CN 201710386977A CN 108932948 A CN108932948 A CN 108932948A
Authority
CN
China
Prior art keywords
audio
data
communication
asynchronous
state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710386977.5A
Other languages
Chinese (zh)
Other versions
CN108932948B (en
Inventor
赵晓强
罗程
李斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201710386977.5A priority Critical patent/CN108932948B/en
Publication of CN108932948A publication Critical patent/CN108932948A/en
Application granted granted Critical
Publication of CN108932948B publication Critical patent/CN108932948B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The present invention relates to a kind of audio data processing method, device, computer equipment and storage mediums, including:Voice data stream is obtained by unified interface, obtains the corresponding present video communications status of voice data stream;From the acquisition of Unified coding module and the matched target code algorithm of present video communications status, the voice data stream is encoded using target code algorithm to obtain coded audio data;Following steps are executed by the way that module is uniformly processed:Matched target audio tupe is determined from optional audio processing mode according to present video communications status, optional audio processing mode includes audio file mode and real-time audio frame pattern, is handled according to target audio tupe coded audio data to obtain audio data to be sent;Target network channel is determined from optional network channel according to present video communications status, by audio data to be sent by the target network channel transfer, is reduced cost and is improved efficiency.

Description

Audio data processing method, device, computer equipment and computer readable storage medium
Technical field
The present invention relates to field of computer technology, set more particularly to a kind of audio data processing method, device, computer Standby and computer readable storage medium.
Background technique
With the development of computer technology, the application that net torpedo technology be combined with each other is in people's daily life It is increasingly common.For better exchange and interdynamic, user can be by microphone input voice, music etc., to carry out Instant Messenger Believe session, amusement, working and learning.
Traditional voice applications, when realizing asynchronous audio message and Audio communication function, by introducing two sets not The problem of same voice infrastructure scheme is realized, there are more set codes while accessing system audio resource, causing complex management, and it is real Existing higher cost.
Summary of the invention
Based on this, it is necessary in view of the above technical problems, provide it is a kind of asynchronous audio and real-time audio are passed through it is unified Technical Architecture realize audio data processing method, device, computer equipment and computer readable storage medium, reduce cost and Improve the efficiency of audio resource management.
A kind of audio data processing method, the method includes:
Voice data stream is obtained by unified interface, obtains the corresponding present video communications status of the voice data stream;
From the acquisition of Unified coding module and the matched target code algorithm of the present video communications status, using the mesh Mark encryption algorithm encodes the voice data stream to obtain coded audio data;
Following steps are executed by the way that module is uniformly processed:
Determine that matched target audio handles mould from optional audio processing mode according to the present video communications status Formula, the optional audio processing mode include audio file mode and real-time audio frame pattern, are handled according to the target audio Mode is handled to obtain audio data to be sent to the coded audio data;
Target network channel is determined from optional network channel according to the present video communications status, it will be described to be sent Audio data passes through the target network channel transfer.
A kind of audio-frequency data processing device, described device include:
Module is obtained, for obtaining voice data stream by unified interface, it is corresponding current to obtain the voice data stream Voice communication state;
Unified coding module, for acquisition and the matched target code algorithm of the present video communications status, using institute Target code algorithm is stated the voice data stream is encoded to obtain coded audio data;
Module is uniformly processed, including:
Packaged unit, for determining matched mesh from optional audio processing mode according to the present video communications status Audio processing mode is marked, the optional audio processing mode includes audio file mode and real-time audio frame pattern, according to described Target audio tupe is handled to obtain audio data to be sent to the coded audio data;
Transmission unit, for determining that target network is logical from optional network channel according to the present video communications status The audio data to be sent is passed through the target network channel transfer by road.
A kind of computer equipment, which is characterized in that including memory and processor, store computer in the memory Readable instruction, when the computer-readable instruction is executed by the processor, so that the processor executes any of the above-described reality The step of applying the example audio data processing method.
A kind of computer readable storage medium, which is characterized in that calculating is stored on the computer readable storage medium Machine executable instruction, when the computer executable instructions are executed by processor, so that the processor executes any of the above-described Described in embodiment the step of audio data processing method.
Above-mentioned audio data processing method, device, computer equipment and computer readable storage medium, pass through unified interface Obtain voice data stream, obtain the corresponding present video communications status of voice data stream, from Unified coding module obtain with it is described The matched target code algorithm of present video communications status encode to the voice data stream using target code algorithm To coded audio data, following steps are executed by the way that module is uniformly processed:According to present video communications status from optional audio Matched target audio tupe is determined in reason mode, optional audio processing mode includes audio file mode and real-time audio Frame pattern is handled to obtain audio data to be sent according to target audio tupe to the coded audio data, according to Present video communications status determines target network channel from optional network channel, and audio data to be sent is passed through target network Channel transfer generates audio data to be sent from obtaining voice data stream, coding, being packaged for different voice communication states It to transmission, is all handled using unified interface and module, unification is carried out to the audio data under different voice communication states Management and distribution, realize the unification of framework, improve the convenience of audio resource management, it is only necessary to use a set of code energy It realizes the audio data processing under different voice communication states, greatly reduces the fusion cost between isomery voice system.
Detailed description of the invention
Fig. 1 is the applied environment figure of one embodiment sound intermediate frequency data processing method;
Fig. 2 is the internal structure chart of terminal in Fig. 1 in one embodiment;
Fig. 3 is the flow chart of one embodiment sound intermediate frequency data processing method;
Fig. 4 is the flow chart of another embodiment sound intermediate frequency data processing method;
Fig. 5 is the flow chart for switching adjustment audio status in one embodiment according to voice communication state;
Fig. 6 is the flow chart in one embodiment according to voice communication state processing and transmission;
Fig. 7 is one embodiment sound intermediate frequency communication interface schematic diagram;
Fig. 8 is one embodiment sound intermediate frequency data processing system configuration diagram;
Fig. 9 is the structural block diagram of one embodiment sound intermediate frequency data processing equipment;
Figure 10 is the structural block diagram of another embodiment sound intermediate frequency data processing equipment;
Figure 11 is the structural block diagram of one embodiment sound intermediate frequency communications status switching module;
Figure 12 is the structural block diagram of another embodiment sound intermediate frequency communications status switching module;
Figure 13 is the structural block diagram that module is uniformly processed in one embodiment;
Figure 14 is the structural block diagram of further embodiment sound intermediate frequency data processing equipment;
Figure 15 is the structural block diagram of another embodiment sound intermediate frequency data processing equipment.
Specific embodiment
Fig. 1 is the applied environment figure of one embodiment sound intermediate frequency data processing method operation.As shown in Figure 1, this applies ring Border includes first terminal 110, server 120, second terminal 130, wherein first terminal 110, server 120, second terminal 130 It is communicated by network, wherein first terminal 110 can send realaudio data or asynchronous announcement frequency by server 120 According to can realize seamless switching under Audio communication state and asynchronous audio communications status.According to different present videos Communications status, first terminal 110 realize the selection of encryption algorithm, audio processing by unified code and unified voice infrastructure The selection of mode, the selection according to target audio tupe to audio data packing and network channel are logical according to target network Audio data to be sent is sent to server by road, realizes unified management and calling to audio resource, so that server will be to It sends audio data and is sent to target terminal second terminal 130.First terminal 110 can receive the real-time of the transmission of second terminal 130 Audio data or asynchronous audio data, and played out in first terminal.
First terminal 110 and second terminal 130 can be smart phone, tablet computer, laptop, desktop computer Deng however, it is not limited to this.First terminal 110, second terminal 130 can send audio forwarding to server 120 by network and ask It asks, the request that server 120 can respond first terminal 110, second terminal 130 is sent returns to corresponding audio resource.First Terminal 110, second terminal 130 can be one or more, and server 120 can be individual server or server cluster.
In one embodiment, the internal structure of the first terminal 110 in Fig. 1 is as shown in Fig. 2, the first terminal 110 wraps Include processor, graphics processing unit, storage medium, memory, network interface, display screen and the input connected by system bus Equipment.Wherein, the storage medium of first terminal 110 is stored with operating system, further includes audio-frequency data processing device, which uses In realizing a kind of audio data processing method suitable for terminal.The processor supports whole for providing calculating and control ability The operation of a first terminal 110.Graphics processing unit in first terminal 110 is at least providing the drafting energy of display interface Power, inside save as the audio-frequency data processing device in storage medium operation provide environment, network interface be used for server 120 into Row network communication.Display screen such as shows real time communication interface, input equipment is for receiving user for showing application interface etc. The order of input or audio data etc., input equipment include microphone.For the first terminal 110 with touch screen, screen is shown It can be touch screen with input equipment.Structure shown in Fig. 1, the only block diagram of part-structure relevant to application scheme, The restriction for the terminal being applied thereon to application scheme is not constituted, specific terminal may include more than as shown in the figure Or less component, perhaps combine certain components or with different component layouts.
In one embodiment, as shown in figure 3, providing a kind of audio data processing method, to be applied to above-mentioned application First terminal or second terminal in environment come for example, including the following steps:
Step S210 obtains voice data stream by unified interface, obtains the corresponding present video communication of voice data stream State.
Specifically, voice data stream is the voice data or audio file recorded by the scene that audio collecting device acquires The music data etc. of broadcasting, voice data stream can be wave data of the primary voice data Jing Guo basic coding, raw tone Data are the analog signals of system hardware acquisition, and it is digital signal that voice data stream, which is byte stream,.Unified interface is to carry out logic Corresponding function can be completed without concerned with internal realization by unified interface in the method for calling externally provided after encapsulation Can, unified interface may include audio recording interface, recording rights interface, audio frequency play interface etc., either asynchronous audio data Or realaudio data is all obtained by unified interface, guarantees the audio under different voice communication states using unified data Stream format improves the convenience of different voice communication state subaudio frequency resource managements.In one embodiment, voice data stream For pcm encoder data flow, whether asynchronous audio or real-time audio, are all acquired by unified interface using identical data Process obtains pcm encoder data flow.
Wherein, present video communications status refers to the type of present video communication, including Audio communication and asynchronous announcement Frequency communicates.Wherein Audio communication acquired in real time by voice, real-time coding, data transmission, decoding, the technologies hand such as noise reduction Section realizes voice data real-time Transmission and broadcasting, and transmission is audio byte stream, and Audio communication is generally required by drop Make an uproar, quantify after encoded again, increase the compression ratio of data.It needed when Audio communication through calling, connection, connect and establish Real-time communication link simultaneously keeps connection status, if voice call communication is Audio communication.Asynchronous audio passes through voice Acquisition, voice coding, voice document transmission, voice play, and voice data asynchronous transmission are realized, first complete audio data recording At audio file, then by audio file transmissions to opposite end, transmission is to record the audio file completed.
Present video communications status can be determined and in different voice communication states by acting on the operation of communication interface Between switch over.Such as in text communication interface, present video communications status is cut by closed state by the first predetermined registration operation It is changed to Audio communication, by the second predetermined registration operation at Audio communication interface by present video communications status by real-time Voice communication is switched to asynchronous audio communication.The first predetermined registration operation, second predetermined registration operation can with gesture operation, touch screen operation, Voice command etc..Present video communications status can be distinguished by preset characters, and such as 0 indicates Audio communication, and 1 indicates different Voice communication is walked, present video communications status can be associated with the voice data stream of input, to quickly determine voice data stream pair The present video communications status answered.Can also be by real-time detection interface state, such as whether detection asynchronous audio key is pressed inferior, obtains To present video communications status, the corresponding present video communications status of voice data stream is determined.
In one embodiment, audio data processing method by interface layer and realizes that the framework of layer is realized, is a set of system One code, wherein interface layer includes that present video communications status determines interface, realizes that layer is used for by determining according to interface layer Present video communications status realize a series of processing of Different Logic to completing different codings, packing and transmission.
Step S220 is obtained and the matched target code algorithm of present video communications status, use from Unified coding module Target code algorithm encodes voice data stream to obtain coded audio data.
Specifically, target code algorithm can be configured by interface layer, as asynchronous audio communication is calculated with MP3 encoding and decoding Method matching, Audio communication and silk encoding and decoding algorithmic match.The matched target code algorithm of present video communications status can It is customized in advance as needed, real-time network state, audio data characteristics Dynamic Matching can also be passed through.Unified coding module is The module that can independently extend is integrated with optional encoder under different voice communication states, realizes system by expansible form One coding module, when needing to add encoder, only need to be realized and be added by expansible interface for newly-increased code requirement Add encoder.The encoder under different voice communication states is managed collectively and is distributed by Unified coding module, is realized The unification of framework.
Step S230 and step S240 is executed by the way that module is uniformly processed:
Step S230 is determined from matched target audio according to present video communications status from optional audio processing mode Reason mode, optional audio processing mode includes audio file mode and real-time audio frame pattern, according to target audio tupe Coded audio data are handled to obtain audio data to be sent.
Specifically, audio processing mode is used to carrying out coded audio data into processing to generate and present video communications status The audio data to be sent matched can be arranged according to the corresponding demand of present video communications status for different voice communication states Different audio processing modes, if asynchronous audio communicates corresponding audio file mode, Audio communication corresponds to real-time audio frame Mode.Audio file mode, which refers to, complete coded audio data to be written in the audio file of preset format, real-time audio frame Mode is that coded audio data is packaged into audio frame in real time and is sent immediately, and guarantee is packaged and sent using streaming in real time Property.
In one embodiment, current network state and coded audio data characteristics, such as length are obtained, mesh is dynamically determined The corresponding processing parameter of audio processing mode is marked, for real-time audio frame pattern, processing parameter includes the frame length, superfluous of audio frame Wrong data policy parameter etc. includes compressing file rate etc. for audio file mode.
Step S240 determines target network channel according to present video communications status from optional network channel, will be pending Audio data is sent to pass through target network channel transfer.
Specifically, network channel refers to the channel of transmitting network data, and different network channels corresponds to different networks and passes Matching relationship can be arranged in advance in voice communication state and network channel by defeated agreement, as asynchronous audio communicates corresponding first network Channel, corresponding second network channel of Audio communication, to can be obtained according to configuration relation according to present video communications status Corresponding target network channel.For audio data to be sent by target network channel transfer, different voice communication states are corresponding Audio data to be sent is transmitted by different network channels.
Since step S230 and step S240 are handled by the way that module is uniformly processed, guarantee Audio communication and different It walks voice communication to realize using unified code, audio resource is called by unified code, convenient for management, asynchronous audio is led to Letter and Audio communication are realized by unified Technical Architecture.
In one embodiment, be uniformly processed in module includes that the first data acquisition readjustment Processing Interface and the second data are adopted Collection readjustment Processing Interface, the first data acquisition readjustment Processing Interface is corresponding with Audio communication, and the second data acquire at readjustment It is corresponding with asynchronous audio communication to manage interface, the first data acquisition readjustment Processing Interface defines and the matched sound of Audio communication Frequency tupe and network channel define in the second data acquisition readjustment Processing Interface and communicate matched audio with asynchronous audio Tupe and network channel, to need to only determine that corresponding data acquisition readjustment processing connects according to present video communications status Mouthful, so that it may readjustment Processing Interface is acquired according to data and calls corresponding process flow, is improved between different voice communication states Manage the independence of process.
In the present embodiment, voice data stream is obtained by unified interface, it is logical to obtain the corresponding present video of voice data stream Letter state is compiled from the acquisition of Unified coding module and the matched target code algorithm of the present video communications status using target Code algorithm encodes the voice data stream to obtain coded audio data, executes following steps by the way that module is uniformly processed: Matched target audio tupe is determined from optional audio processing mode according to present video communications status, at optional audio Reason mode includes audio file mode and real-time audio frame pattern, according to target audio tupe to the coded audio data It is handled to obtain audio data to be sent, determines that target network is logical from optional network channel according to present video communications status Road, by audio data to be sent by target network channel transfer, for different voice communication states, from acquisition audio data Stream, coding are packaged generation audio data to be sent to transmitting, and are all handled using unified interface and module, to not unisonance Audio data under frequency communications status is managed collectively and is distributed, and the unification of framework is realized, and improves audio resource management Convenience, it is only necessary to using a set of code can be achieved with the audio data under different voice communication states processing, greatly reduce Fusion cost between isomery voice system.
In one embodiment, as shown in figure 4, further including before step S210:
Step S310 detects voice communication state, when detecting the presence of the switching of voice communication state, passes through unified sound Frequency configuration management interface determines audio status management parameters according to voice communication state switch data, is joined according to audio status management Number adjustment present video state.
It specifically, can be by identifying that user's operation, such as gesture operation, touch operation, preset sound order judge audio Whether communications status switches, the mesh for then recording the former voice communication state before switching in case of switching and needing to be switched to Mark with phonetic symbols frequency communications status generates voice communication state switch data.Audio configuration management interface is for determining audio status management So as to adjust audio status, audio configuration management includes audio rights management, records management, plays management and audio processing parameter Management, audio rights management include device authorization management including recording authorization etc., and recording management includes recording audio type, record Audio frequency parameter setting processed, recording state etc., playing management includes playing type, and asynchronous audio file in this way plays, or in real time Audio stream broadcasting, broadcast state, such as whether stopping the management of broadcasting etc., audio processing is managed mainly according to current voice communication State determines readjustment Processing Interface accordingly, is such as that uniform management module distribution present video communicates shape by audio processing management The corresponding first data acquisition readjustment Processing Interface of state.Corresponding audio status management parameters are determined by each administrative section, Such as sample rate, broadcast state parameter, if 0 is plays, 1 is stops broadcasting etc., to be worked as according to the adjustment of audio status management parameters Preceding audio status will such as receive audio code stream and be adjusted to abandon audio code stream, be realized not by the adjustment to present video state With the conversion between audio communication type.
In the present embodiment, audio is determined according to voice communication state switch data by unified audio configuration management interface Condition managing parameter provides the unified management of system audio resource when converting between different voice communication types, avoids system sound The complexity of frequency resource management.
In one embodiment, as shown in figure 5, voice communication state is detected in step S310, when detecting the presence of audio When communications status switches, audio status is determined according to voice communication state switch data by unified audio configuration management interface The step of management parameters includes:
Step S311 keeps real-time sound when voice communication state is switched to asynchronous audio communication by Audio communication Frequency communication link is connection status, modifies play parameter, decoded state parameter and record by unified audio configuration management interface Sound configuration parameter.
Specifically, Audio communication can be switched to asynchronous announcement by acting on the operation at Audio communication interface Frequency communicates.It may specify that the destinations traffic user of asynchronous audio communication defaults current Audio communication session if do not specified In all users be destinations traffic user.Holding Audio communication link is connection status, it is ensured that logical from asynchronous audio Believe seamless return Audio communication again.
In one embodiment, the grouping different for the user setting in Audio communication session, by real-time audio Communication determines current asynchronous voice communication corresponding group's mark while being switched to asynchronous audio communication, so as to according to group Mark quickly determines corresponding destinations traffic user.It can be wherein grouped, be led to by user gradation for the different grouping of user setting Good friend's close relationship degree grouping etc. is crossed, can be preconfigured fixed grouping, can also be Audio communication session according to history The grouping of communication behavior data dynamic setting.
Play parameter is determined as real-time audio and stops playing, decoded state parameter is determined as real-time sound by step S312 Frequency is updated to recording configuration parameter to communicate matched state with asynchronous audio according to stopping decoding.
Specifically, it due to being to be switched to asynchronous audio communication from Audio communication, needs to terminate decoded real-time The broadcasting of audio stops playing so play parameter is determined as real-time audio, needs to stop the received real-time sound of coding The decoding of frequency evidence needs the configuration parameter that will record so decoded state parameter, which is determined as realaudio data, stops decoding It modifies, guarantees the accurate recording of asynchronous audio data, so recording configuration parameter is updated to communicate with asynchronous audio The state matched.
Include according to the step of audio status management parameters adjustment present video state in step S310:
Step S313 stops having decoded the broadcasting of realaudio data, stops realaudio data according to play parameter The realaudio data to be decoded of receipt of subsequent is abandoned, starts asynchronous audio according to updated recording configuration parameter by decoding The acquisition of data.
Specifically, the broadcasting for stopping having decoded realaudio data can avoid when recording asynchronous audio data, real-time sound It is interfered caused by frequency sound, and stops the decoding of realaudio data, the realaudio data to be decoded of receipt of subsequent is abandoned, To avoid and the unmatched invalidation of present video communications status.And started according to updated recording configuration parameter asynchronous The acquisition of audio data, to start asynchronous audio communication.
In the present embodiment, by unified audio configuration management interface, realizes and lead to from Audio communication to asynchronous audio The switching of letter, whole process is by audio status management parameters adjust automatically, in the state of without terminating real-time audio connection, Realize seamless switching.
In one embodiment, after the step of adjusting present video state according to audio status management parameters, further include:
When voice communication state is switched to Audio communication by asynchronous audio communication, it is in Audio communication link When connection status, play parameter, decoded state parameter and recording configuration parameter are repaired by unified audio configuration management interface It is changed to the parameter to match with Audio communication, adjusts present video state according to audio status management parameters, restores real-time Voice communication.
Specifically, when switching between different audio status, in switching, it can determine whether state switching meets switching It is logical directly cannot to be switched to real-time audio from asynchronous audio communication such as when Audio communication link is off-state for condition Letter, because the connection of Audio communication link needs the entire response process from calling, connection, connection.Only in real-time sound When frequency communication link is connection status, it could be communicated from asynchronous audio and switch seamlessly to Audio communication.Pass through unified sound Play parameter, decoded state parameter and recording configuration parameter are revised as matching with Audio communication by frequency configuration management interface Parameter, play parameter is such as revised as real-time audio and starts to play, decoded state parameter is revised as realaudio data and is opened Begin decoding, will recording configuration parameter be updated to the matched state of Audio communication, start the acquisition of realaudio data, together When stop asynchronous audio broadcasting, stop asynchronous audio data downloading and decoding, restore Audio communication.
In the present embodiment, by unified audio configuration management interface, realize logical from asynchronous audio communication to real-time audio The switching of letter, whole process realize seamless switching by audio status management parameters adjust automatically.
In one embodiment, as shown in fig. 6, step S230 and step S240 include:
Step S230a, when present video communications status is that asynchronous audio communicates, audio processing mode is determined as audio text Audio file is written in coded audio data by part mode, and by first network channel transfer, first network channel includes HTTP At least one of protocol channel, Transmission Control Protocol channel.
Specifically, asynchronous audio communication process is complete audio data, and whether detection asynchronous audio recording terminates, such as Fruit terminates, then is encoded complete audio data to obtain coded audio data, and audio file is written in coded audio data It is middle to generate audio data to be sent, the audio file of generation can also be compressed.Audio file is passed through into http protocol channel Or Transmission Control Protocol channel is sent to server.Server produces the corresponding address URL of audio file, the corresponding mesh of audio data Mark communication terminal receives the address URL, can download audio file by the breakpoint transmission of http.
Step S230b, when present video communications status is Audio communication, audio processing mode is determined as real-time sound Audio frame is passed through the second network tunnel transports, the second network by coded audio data assembling at audio frame by frequency frame pattern in real time Channel includes udp protocol channel.
Specifically, when Audio communication, received audio data is subjected to real-time coding, as acquisition time is continuous Coded audio data are generated, so that coded audio data are assembled into audio frame by preset algorithm, it will be continuous with the time The audio frame of generation passes through the second network tunnel transports to server immediately, such as carries out real-time Transmission, server by socket Audio frame is forwarded to target communications terminal in real time, is a kind of data stream type transmission.
In one embodiment, method further includes:Encoded audio data is obtained, the processing of encoded audio data is generated Voice data stream, encoded audio data includes at least one of files-audio data and real-time audio frame data, according to The acquisition network channel of coded audio data obtains matched decoding algorithm from unified decoder module, according to decoding algorithm to audio Data stream obtains original audio data.
Specifically, it if it is Audio communication, then directly receives the audio frame that server is sent in real time and has been compiled Code audio data, encoded audio data is exactly voice data stream, is communicated if it is asynchronous audio, can be by the address URL from clothes Business device is downloaded to obtain audio file, and audio file is carried out the processing acquisition voice data stream such as to decompress.Guarantee different voice communications Audio decoder under state uses unified data stream format, improves different voice communication state subaudio frequency resource managements just Benefit.
If it is Audio communication, then communicating pair has arranged encryption algorithm by communication protocol and corresponding decoding is calculated Method is communicated if it is asynchronous network, and encryption algorithm can be carried in coded data by the preset characters of preset byte, thus from Unified decoder module obtains and the matched decoding algorithm of encryption algorithm.Unified decoder module is the module that can independently extend, and is integrated Optional decoder under different voice communication states realizes unified decoder module by expansible form, for newly-increased Code requirement only need to be realized by expansible interface when needing to add corresponding decoder and add decoder.Pass through system One decoder module is managed collectively and is distributed to the decoder under different voice communication states, and the unification of framework is realized.
In one embodiment, method further includes:Key is communicated by asynchronous audio at Audio communication interface to obtain Asynchronous audio starts to operate, and present video communications status is switched to asynchronous audio communication from Audio communication, in real-time sound Frequency communication link is kept under connection status, is communicated key by asynchronous audio and is obtained asynchronous audio end operation, by present video Communications status from asynchronous audio communication recovery be Audio communication.
Specifically, Audio communication interface can be terminal screen interface, be also possible to exist by virtual reality device The three-dimensional Audio communication space interface that three-dimensional space is formed.Asynchronous audio communication key can be on terminal screen interface Virtual key or physical button are also possible to the three-dimensional key at three-dimensional Audio communication interface shape interface.It can pass through The first predetermined registration operation generates asynchronous audio sign on, and key pressing such as is pressed in asynchronous audio communication, asynchronous audio is communicated key The operation pressed is that asynchronous audio starts to operate, and present video communications status is switched to asynchronous audio from Audio communication and is led to Letter, starting asynchronous audio acquisition can be generated different in the case where Audio communication link keeps connection status by second predetermined registration operation Audio END instruction is walked, when being bounced such as asynchronous audio communication key, asynchronous audio END instruction is generated, then communicates asynchronous audio The operation that key bounces be asynchronous audio end operation, by present video communications status from asynchronous audio communication recovery be real-time sound Frequency communicates.The first predetermined registration operation, second predetermined registration operation can be gesture operation or touch operation etc..As shown in fig. 7, being one three Virtual session voice communication interface is tieed up, it includes Audio communication triggering key 320 and asynchronous audio communication that audio, which is led on interface, Key 330 is triggered, key 320 is triggered by Audio communication and enters Audio communication, it can at Audio communication interface Press asynchronous audio communications triggered key 330 start asynchronous audio acquisition, acquisition complete, then bounce asynchronous audio communications triggered by Key 330 is used as asynchronous audio end operation, and present video communications status is by asynchronous communication completion from asynchronous audio communication recovery Audio communication.
It in the present embodiment, by different interface operations, can be switched over without being sewn between different voice communication states, letter Folk prescription is just.
In one embodiment, audio data processing method is applied to multi-conference scene, and step S230a includes:It will compile Audio file is written in code audio data user information correlation corresponding with the target user in multi-conference, and passes through first network Channel transfer is to server, so that server determines intended recipient terminal according to the user information in the audio file, by sound Frequency file is sent to the target user in multi-conference.
Specifically, multi-conference refers to that there are the scenes of multiple communication parties in session, in Audio communication, wherein one The audio data of a session subscriber can be sent to other session subscribers all in multi-conference.If the first session subscriber merely desires to It is communicated with the second session subscriber, needs to shield other session subscribers, then Audio communication can be switched to asynchronous audio Audio file, user is written in coded audio data user information correlation corresponding with the target user in multi-conference by communication Information can be user identifier etc. for determining target user.Server determines that target is used according to the user information in audio file The corresponding intended recipient terminal in family, is sent to the target user in multi-conference for audio file, since other users cannot be received To asynchronous audio data, asynchronous audio communications status is switched seamlessly in Audio communication to realize, shields other Session subscriber only sends the function of asynchronous audio to target user.And the connection status of Audio communication link is kept, it can be After sending asynchronous audio to target user, fast quick-recovery Audio communication continues real time communication, it can be achieved that in real-time audio meeting The function of secret words is sent in view to target user.
In a specific embodiment, audio data processing method passes through audio-frequency data processing system as shown in Figure 8 Framework is realized, including interface layer 410 and realization layer 420, and wherein interface layer 410 provides exposure external interface, including asynchronous language Sound starts interface 411, asynchronous voice terminates interface 412, asynchronous voice playback interface 413, real-time voice open interface 414, real Shi Yuyin down interface 415.
Realize that layer 420 includes audio configuration management 421, codec 422, schema management 423 and network is uniformly processed Module 424.
Audio configuration management 421 provides to system audio rights management, records management, broadcasting management and audio processing pipe Reason, externally provides unified interface, is easy to use, and the unified mode for using PCM data stream obtains system recording data.Pass through system One audio configuration management interface determines audio status management parameters according to voice communication state switch data, thus according to audio Condition managing parameter adjusts present video state.
Codec 422 provides one group of expansible codec and realizes, is selected according to system requirements, passes through volume Original audio data stream process is coded audio data by code.For the format needed support, it is only necessary to add new volume Decoder.It include ARM codec, MP3 codec and SILK codec in the present embodiment.
It includes mode manager, audio file mode module and real-time audio frame pattern mould that schema management 423, which is uniformly processed, Block.Schema management 423 is uniformly processed, voice communication mode management is provided, controls voice communication state, communicated according to present video State determines matched target audio tupe from optional audio processing mode, according to target audio tupe to acquisition Coded audio data handled, such as audio file mode, the code stream after coding is written to the sound of preset format Sound file is then assembled into the audio frame of preset format for real-time audio frame pattern, so as to subsequent transmission or reading.
Network module 424 provides network and receives and dispatches managerial ability, including tactical management, transmission management, reception management, will be pending It send audio data by target network channel transfer, the upload to asynchronous voice document, downloading and real-time audio frame stream is provided Send and receive processing.
In above-mentioned specific embodiment, integrated system audio resource management is realized, under different voice communication states, is led to It crosses a set of unified code and management is realized to system audio resource, avoid covering codes more while accessing system audio resource causing shape The complex management of state.The acquisition of unified audio data is realized, whether asynchronous voice or real-time voice, all using identical Data acquisition flow, all acquisition raw PCM data stream, are uniformly processed mode manager further according to built-in, flow into PCM data Row processing.Unified coding/decoding model is realized, by built-in a variety of codecs, is united as desired to PCM data stream One coding and decoding, to the data after coding, is written to audio file according to demand or is assembled into speech frame, enter subsequent Process flow.Coding/decoding module is one group of module that can independently extend, and for newly-increased encoding and decoding demand, need to only be realized new Codec.Unified playing flow is realized, what is no matter received is Real-time voice data or asynchronous voice data, It is all first processed into the form of audio data stream, further according to mode manager, corresponding decoding algorithm is determined, voice data is flowed into Decoded data are submitted to system plays device and carry out unified broadcasting by row decoding, to avoid playing race problem.
In one embodiment, as shown in figure 9, providing a kind of audio-frequency data processing device, including:
Module 510 is obtained, for obtaining voice data stream by unified interface, obtains the corresponding current sound of voice data stream Frequency communications status.
Unified coding module 520, for acquisition and the matched target code algorithm of present video communications status, using target Encryption algorithm encodes voice data stream to obtain coded audio data.
Module 530 is uniformly processed, including:
Packaged unit 531, for determining matched mesh from optional audio processing mode according to present video communications status Audio processing mode is marked, optional audio processing mode includes audio file mode and real-time audio frame pattern, according to target audio Tupe handles coded audio data to obtain audio data to be sent.
Transmission unit 532, for determining target network channel from optional network channel according to present video communications status, Audio data to be sent is passed through into target network channel transfer.
In one embodiment, as shown in Figure 10, device further includes:
Voice communication state switching module 540, for detecting voice communication state, when detecting the presence of voice communication state When switching, determine that audio status management is joined according to voice communication state switch data by unified audio configuration management interface Number adjusts present video state according to audio status management parameters.
In one embodiment, as shown in figure 11, voice communication state switching module 540 includes:
Switch asynchronous unit 541 in real time, for leading to when voice communication state is switched to asynchronous audio by Audio communication When letter, holding Audio communication link is connection status, modifies play parameter, solution by unified audio configuration management interface Play parameter is determined as real-time audio and stops playing, decoded state parameter is determined by code state parameter and recording configuration parameter Stop decoding for realaudio data, is updated to the recording configuration parameter to communicate matched state with asynchronous audio, according to Play parameter stops having decoded the broadcasting of realaudio data, stops the solution of realaudio data according to the decoded state parameter The realaudio data to be decoded of receipt of subsequent is abandoned, starts asynchronous announcement frequency according to updated recording configuration parameter by code According to acquisition.
In one embodiment, as shown in figure 12, voice communication state switching module 540 includes:
Asynchronised handover Real time capable module 542, for leading to when voice communication state is switched to real-time audio by asynchronous audio communication When letter, Audio communication link be connection status when, by unified audio configuration management interface by play parameter, decoding State parameter and recording configuration parameter are revised as the parameter to match with Audio communication, are joined according to the audio status management Number adjustment present video state, restores Audio communication.
In one embodiment, as shown in figure 13, module 530, which is uniformly processed, includes:
Asynchronous audio processing unit 533 is used for when present video communications status is that asynchronous audio communicates, audio processing mould Formula is determined as audio file mode, audio file is written in coded audio data, and pass through first network channel transfer, the first net Network channel includes at least one of http protocol channel, Transmission Control Protocol channel.
Real time audio processing unit 534 is used for when present video communications status is Audio communication, audio processing mould Formula is determined as real-time audio frame pattern, by coded audio data assembling at audio frame, audio frame is passed through to the second network in real time and is led to Road transmission, the second network channel includes udp protocol channel.
In one embodiment, as shown in figure 14, device further includes:
The processing of encoded audio data is generated audio number for obtaining encoded audio data by unified decoder module 550 According to stream, encoded audio data includes at least one of files-audio data and real-time audio frame data, according to encoded sound The acquisition network channel of frequency evidence obtains matched decoding algorithm, is decoded to obtain original sound to voice data stream according to decoding algorithm Frequency evidence.
In one embodiment, as shown in figure 15, device further includes:
Interface operation module 560 obtains asynchronous announcement for communicating key by asynchronous audio at Audio communication interface Frequency starts to operate, and present video communications status is switched to asynchronous audio communication from Audio communication, in Audio communication Link is kept under connection status, is communicated key by asynchronous audio and is obtained asynchronous audio end operation, present video is communicated shape State from asynchronous audio communication recovery be Audio communication.
In one embodiment, device is applied to multi-conference scene, and asynchronous audio processing unit 533 is also used to encode Audio file is written in audio data user information correlation corresponding with the target user in multi-conference, and logical by first network Road is transmitted to server, so that server determines intended recipient terminal according to the user information in the audio file, by audio File is sent to the target user in multi-conference.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory Computer-readable instruction, when computer-readable instruction is executed by processor, so that processor executes following steps:By uniformly connecing Mouth obtains voice data stream, obtains the corresponding present video communications status of voice data stream, obtains and works as from Unified coding module The preceding matched target code algorithm of voice communication state, encodes voice data stream using target code algorithm Audio data executes following steps by the way that module is uniformly processed:According to present video communications status from optional audio processing mode The middle matched target audio tupe of determination, optional audio processing mode include audio file mode and real-time audio frame mould Formula is handled to obtain audio data to be sent, according to current according to target audio tupe to the coded audio data Voice communication state determines target network channel from optional network channel, and audio data to be sent is passed through target network channel Transmission.
In one embodiment, computer-readable instruction makes processor execution receive voice data stream by unified interface Before, following steps are also executed:Voice communication state is detected, when detecting the presence of the switching of voice communication state, passes through unification Audio configuration management interface audio status management parameters are determined according to voice communication state switch data, according to audio status pipe It manages parameter and adjusts present video state.
In one embodiment, voice communication state is detected, when detecting the presence of the switching of voice communication state, passes through system One audio configuration management interface determines audio status management parameters according to voice communication state switch data, including:Work as audio When communications status is switched to asynchronous audio communication by Audio communication, holding Audio communication link is connection status, is led to Unified audio configuration management interface modification play parameter, decoded state parameter and recording configuration parameter are crossed, the broadcasting is joined Number is determined as real-time audio and stops playing, and the decoded state parameter is determined as realaudio data and stops decoding, will be described Recording configuration parameter is updated to communicate matched state with asynchronous audio.
Present video state is adjusted according to the audio status management parameters, including:Stopped according to the play parameter The broadcasting for decoding realaudio data stops the decoding of realaudio data according to the decoded state parameter, by receipt of subsequent Realaudio data to be decoded abandon, the acquisitions of asynchronous audio data is started according to updated recording configuration parameter.
In one embodiment, computer-readable instruction executes processor according to the audio status management parameters tune After whole present video state, following steps are also executed:When voice communication state is switched to real-time audio by asynchronous audio communication When communication, Audio communication link be connection status when, by unified audio configuration management interface by play parameter, solution Code state parameter and recording configuration parameter are revised as the parameter to match with Audio communication, according to audio status management parameters Present video state is adjusted, Audio communication is restored.
In one embodiment, according to the present video communications status, determination is matched from optional audio processing mode Target audio tupe, the optional audio processing mode includes audio file mode and real-time audio frame pattern, according to institute It states target audio tupe the coded audio data are handled to obtain audio data to be sent, according to the current sound Frequency communications status determines target network channel from optional network channel, and the audio data to be sent is passed through the target network Network channel transfer, including:When present video communications status is that asynchronous audio communicates, audio processing mode is determined as audio file Audio file is written in coded audio data by mode, and by first network channel transfer, first network channel includes HTTP association At least one of channel, Transmission Control Protocol channel are discussed, when present video communications status is Audio communication, audio processing mould Formula is determined as real-time audio frame pattern, by coded audio data assembling at audio frame, audio frame is passed through to the second network in real time and is led to Road transmission, the second network channel includes udp protocol channel.
In one embodiment, computer-readable instruction makes processor also execute following steps:Obtain encoded audio Data, by encoded audio data processing generate voice data stream, encoded audio data include files-audio data and in real time At least one of audio frame number evidence obtains matching from unified decoder module according to the acquisition network channel of encoded audio data Decoding algorithm, voice data stream is decoded according to decoding algorithm to obtain original audio data.
In one embodiment, computer-readable instruction makes processor also execute following steps:In Audio communication Interface, which by asynchronous audio communicates key and obtains asynchronous audio, to be started to operate, by present video communications status from Audio communication It is switched to asynchronous audio communication, in the case where Audio communication link keeps connection status, key is communicated by asynchronous audio and is obtained Asynchronous audio end operation, by present video communications status from asynchronous audio communication recovery be Audio communication.
In one embodiment, it is applied to multi-conference scene, when the Current Communications Status is that asynchronous audio communicates, Audio processing mode is determined as audio file mode, audio file is written in the coded audio data, and pass through first network Channel transfer, including:Audio is written into coded audio data user information correlation corresponding with the target user in multi-conference File, and by first network channel transfer to server, so that server determines mesh according to the user information in audio file Tag splice receives terminal, target user audio file being sent in multi-conference.
In one embodiment, a kind of computer readable storage medium is provided, is stored on computer readable storage medium Computer executable instructions, when computer executable instructions are executed by processor, so that processor executes following steps:Pass through system One interface obtains voice data stream, obtains the corresponding present video communications status of voice data stream, obtains from Unified coding module With the matched target code algorithm of present video communications status, voice data stream is encoded to obtain using target code algorithm Coded audio data execute following steps by the way that module is uniformly processed:According to present video communications status from optional audio processing Matched target audio tupe is determined in mode, optional audio processing mode includes audio file mode and real-time audio frame Mode is handled to obtain audio data to be sent according to target audio tupe to the coded audio data, according to working as Preceding voice communication state determines target network channel from optional network channel, and audio data to be sent is led to by target network Road transmission.
In one embodiment, computer-readable instruction makes processor execution receive voice data stream by unified interface Before, following steps are also executed:Voice communication state is detected, when detecting the presence of the switching of voice communication state, passes through unification Audio configuration management interface audio status management parameters are determined according to voice communication state switch data, according to audio status pipe It manages parameter and adjusts present video state.
In one embodiment, voice communication state is detected, when detecting the presence of the switching of voice communication state, passes through system One audio configuration management interface determines audio status management parameters according to voice communication state switch data, including:Work as audio When communications status is switched to asynchronous audio communication by Audio communication, holding Audio communication link is connection status, is led to Unified audio configuration management interface modification play parameter, decoded state parameter and recording configuration parameter are crossed, the broadcasting is joined Number is determined as real-time audio and stops playing, and the decoded state parameter is determined as realaudio data and stops decoding, will be described Recording configuration parameter is updated to communicate matched state with asynchronous audio.
Present video state is adjusted according to the audio status management parameters, including:Stopped according to the play parameter The broadcasting for decoding realaudio data stops the decoding of realaudio data according to the decoded state parameter, by receipt of subsequent Realaudio data to be decoded abandon, the acquisitions of asynchronous audio data is started according to updated recording configuration parameter.
In one embodiment, computer-readable instruction executes processor according to the audio status management parameters tune After whole present video state, following steps are also executed:When voice communication state is switched to real-time audio by asynchronous audio communication When communication, Audio communication link be connection status when, by unified audio configuration management interface by play parameter, solution Code state parameter and recording configuration parameter are revised as the parameter to match with Audio communication, according to audio status management parameters Present video state is adjusted, Audio communication is restored.
In one embodiment, according to the present video communications status, determination is matched from optional audio processing mode Target audio tupe, the optional audio processing mode includes audio file mode and real-time audio frame pattern, according to institute It states target audio tupe the coded audio data are handled to obtain audio data to be sent, according to the current sound Frequency communications status determines target network channel from optional network channel, and the audio data to be sent is passed through the target network Network channel transfer, including:When present video communications status is that asynchronous audio communicates, audio processing mode is determined as audio file Audio file is written in coded audio data by mode, and by first network channel transfer, first network channel includes HTTP association At least one of channel, Transmission Control Protocol channel are discussed, when present video communications status is Audio communication, audio processing mould Formula is determined as real-time audio frame pattern, by coded audio data assembling at audio frame, audio frame is passed through to the second network in real time and is led to Road transmission, the second network channel includes udp protocol channel.
In one embodiment, computer-readable instruction makes processor also execute following steps:Obtain encoded audio Data, by encoded audio data processing generate voice data stream, encoded audio data include files-audio data and in real time At least one of audio frame number evidence obtains matching from unified decoder module according to the acquisition network channel of encoded audio data Decoding algorithm, voice data stream is decoded according to decoding algorithm to obtain original audio data.
In one embodiment, computer-readable instruction makes processor also execute following steps:In Audio communication Interface, which by asynchronous audio communicates key and obtains asynchronous audio, to be started to operate, by present video communications status from Audio communication It is switched to asynchronous audio communication, in the case where Audio communication link keeps connection status, key is communicated by asynchronous audio and is obtained Asynchronous audio end operation, by present video communications status from asynchronous audio communication recovery be Audio communication.
In one embodiment, it is applied to multi-conference scene, when the Current Communications Status is that asynchronous audio communicates, Audio processing mode is determined as audio file mode, audio file is written in the coded audio data, and pass through first network Channel transfer, including:Audio is written into coded audio data user information correlation corresponding with the target user in multi-conference File, and by first network channel transfer to server, so that server determines mesh according to the user information in audio file Tag splice receives terminal, target user audio file being sent in multi-conference.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, described program can be stored in a computer-readable storage medium In, in the embodiment of the present invention, which be can be stored in the storage medium of computer system, and by the computer system At least one processor executes, and includes the process such as the embodiment of above-mentioned each method with realization.Wherein, the storage medium can be Magnetic disk, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
Each technical characteristic of embodiment described above can be combined arbitrarily, for simplicity of description, not to above-mentioned reality It applies all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited In contradiction, all should be considered as described in this specification.
The embodiments described above only express several embodiments of the present invention, and the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to protection of the invention Range.Therefore, the scope of protection of the patent of the invention shall be subject to the appended claims.

Claims (18)

1. a kind of audio data processing method, the method includes:
Voice data stream is obtained by unified interface, obtains the corresponding present video communications status of the voice data stream;
From the acquisition of Unified coding module and the matched target code algorithm of the present video communications status, compiled using the target Code algorithm encodes the voice data stream to obtain coded audio data;
Following steps are executed by the way that module is uniformly processed:
Matched target audio tupe, institute are determined from optional audio processing mode according to the present video communications status Stating optional audio processing mode includes audio file mode and real-time audio frame pattern, according to the target audio tupe pair The coded audio data are handled to obtain audio data to be sent;
Target network channel is determined from optional network channel according to the present video communications status, by the audio to be sent Data pass through the target network channel transfer.
2. the method according to claim 1, wherein described the step of receiving voice data stream by unified interface Further include before:
Voice communication state is detected, when detecting the presence of the switching of voice communication state, is connect by unified audio configuration management Mouth determines audio status management parameters according to voice communication state switch data;
Present video state is adjusted according to the audio status management parameters.
3. according to the method described in claim 2, it is characterized in that, the detection voice communication state, when detecting the presence of sound When frequency communications status switches, audio shape is determined according to voice communication state switch data by unified audio configuration management interface The step of state management parameters includes:
When voice communication state is switched to asynchronous audio communication by Audio communication, keeping Audio communication link is to connect State is connect, play parameter, decoded state parameter and recording configuration parameter are modified by unified audio configuration management interface;
The play parameter is determined as real-time audio to stop playing, the decoded state parameter is determined as realaudio data Stop decoding, is updated to the recording configuration parameter to communicate matched state with asynchronous audio;
It is described according to the audio status management parameters adjust present video state the step of include:
Stop having decoded the broadcasting of realaudio data according to the play parameter;
The decoding for stopping realaudio data according to the decoded state parameter, by the realaudio data to be decoded of receipt of subsequent It abandons;
Start the acquisition of asynchronous audio data according to updated recording configuration parameter.
4. according to the method described in claim 3, it is characterized in that, described adjust currently according to the audio status management parameters After the step of audio status, further include:
It is connection in Audio communication link when voice communication state is switched to Audio communication by asynchronous audio communication When state, play parameter, decoded state parameter and recording configuration parameter are revised as by unified audio configuration management interface The parameter to match with Audio communication;
Present video state is adjusted according to the audio status management parameters, restores Audio communication.
5. the method according to claim 1, wherein it is described according to the present video communications status from optional sound Determine matched target audio tupe in frequency tupe, the optional audio processing mode include audio file mode and Real-time audio frame pattern is handled to obtain sound to be sent according to the target audio tupe to the coded audio data Frequency evidence determines target network channel according to the present video communications status from optional network channel, will be described to be sent Audio data includes by the step of target network channel transfer:
When the present video communications status is that asynchronous audio communicates, audio processing mode is determined as audio file mode, will Audio file is written in the coded audio data, and by first network channel transfer, the first network channel includes HTTP At least one of protocol channel, Transmission Control Protocol channel;
When the present video communications status is Audio communication, audio processing mode is determined as real-time audio frame pattern, By the coded audio data assembling at audio frame, the audio frame is passed through into the second network tunnel transports in real time, described second Network channel includes udp protocol channel.
6. the method according to claim 1, wherein the method also includes:
Encoded audio data is obtained, the encoded audio data processing is generated into voice data stream, the encoded audio Data include at least one of files-audio data and real-time audio frame data;
Matched decoding algorithm is obtained from unified decoder module according to the acquisition network channel of the encoded audio data, according to The decoding algorithm decodes to obtain original audio data to the voice data stream.
7. the method according to claim 1, wherein the method also includes:
Key acquisition asynchronous audio is communicated by asynchronous audio at Audio communication interface to start to operate, and present video is communicated State is switched to asynchronous audio communication from Audio communication;
In the case where Audio communication link keeps connection status, communicating key acquisition asynchronous audio by asynchronous audio terminates to grasp Make, by present video communications status from asynchronous audio communication recovery be Audio communication.
8. according to the method described in claim 5, it is characterized in that, the method be applied to multi-conference scene, it is described to work as institute State Current Communications Status be asynchronous audio communication when, audio processing mode is determined as audio file mode, by the coded audio Audio file is written in data, and includes by the step of first network channel transfer:
Audio file is written into coded audio data user information correlation corresponding with the target user in multi-conference, and By first network channel transfer to server, so that the server determines mesh according to the user information in the audio file Tag splice receives terminal, and the audio file is sent to the target user in the multi-conference.
9. a kind of audio-frequency data processing device, which is characterized in that described device includes:
Module is obtained, for obtaining voice data stream by unified interface, obtains the corresponding present video of the voice data stream Communications status;
Unified coding module, for acquisition and the matched target code algorithm of the present video communications status, using the mesh Mark encryption algorithm encodes the voice data stream to obtain coded audio data;
Module is uniformly processed, including:
Packaged unit, for determining matched target sound from optional audio processing mode according to the present video communications status Frequency tupe, the optional audio processing mode includes audio file mode and real-time audio frame pattern, according to the target Audio processing mode is handled to obtain audio data to be sent to the coded audio data;
Transmission unit will for determining target network channel from optional network channel according to the present video communications status The audio data to be sent passes through the target network channel transfer.
10. device according to claim 9, which is characterized in that described device further includes:
Voice communication state switching module, for detecting voice communication state, when detecting the presence of the switching of voice communication state, Audio status management parameters are determined according to voice communication state switch data by unified audio configuration management interface, according to institute State audio status management parameters adjustment present video state.
11. device according to claim 10, which is characterized in that the voice communication state switching module includes:
Switch asynchronous unit in real time, for protecting when voice communication state is switched to asynchronous audio communication by Audio communication Holding Audio communication link is connection status, modifies play parameter, decoded state by unified audio configuration management interface The play parameter is determined as real-time audio and stops playing by parameter and recording configuration parameter, and the decoded state parameter is true It is set to realaudio data and stops decoding, is updated to the recording configuration parameter to communicate matched state, root with asynchronous audio The broadcasting for stopping having decoded realaudio data according to the play parameter stops real-time audio number according to the decoded state parameter According to decoding, the realaudio data to be decoded of receipt of subsequent is abandoned, is started according to updated recording configuration parameter asynchronous The acquisition of audio data.
12. device according to claim 11, which is characterized in that the voice communication state switching module includes:
Asynchronised handover Real time capable module, for when voice communication state by asynchronous audio communication be switched to Audio communication when, When Audio communication link is connection status, play parameter, decoded state are joined by unified audio configuration management interface Number and recording configuration parameter are revised as the parameter to match with Audio communication, are adjusted according to the audio status management parameters Present video state restores Audio communication.
13. device according to claim 9, which is characterized in that the module that is uniformly processed includes:
Asynchronous audio processing unit is used for when the present video communications status is that asynchronous audio communicates, audio processing mode It is determined as audio file mode, audio file is written into the coded audio data, and by first network channel transfer, it is described First network channel includes at least one of http protocol channel, Transmission Control Protocol channel;
Real time audio processing unit is used for when the present video communications status is Audio communication, audio processing mode It is determined as real-time audio frame pattern, by the coded audio data assembling at audio frame, the audio frame is passed through second in real time Network tunnel transports, second network channel include udp protocol channel.
14. device according to claim 9, which is characterized in that described device further includes:
The encoded audio data processing is generated audio data for obtaining encoded audio data by unified decoder module Stream, the encoded audio data include at least one of files-audio data and real-time audio frame data, according to it is described The acquisition network channel of coded audio data obtains matched decoding algorithm, according to the decoding algorithm to the voice data stream Decoding obtains original audio data.
15. device according to claim 9, which is characterized in that described device further includes:
Interface operation module starts to grasp for communicating key acquisition asynchronous audio by asynchronous audio at Audio communication interface Make, present video communications status is switched to asynchronous audio communication from Audio communication, is kept in Audio communication link Under connection status, key is communicated by asynchronous audio and obtains asynchronous audio end operation, by present video communications status from asynchronous Voice communication reverts to Audio communication.
16. device according to claim 13, which is characterized in that described device is applied to multi-conference scene, described different Step audio treatment unit is also used to close coded audio data user information corresponding with the target user in multi-conference Connection write-in audio file, and by first network channel transfer to server, so that the server is according to the audio file In user information determine intended recipient terminal, the audio file is sent to the target user in the multi-conference.
17. a kind of computer equipment, which is characterized in that including memory and processor, store computer in the memory Readable instruction, when the computer-readable instruction is executed by the processor, so that the processor perform claim requires 1 to 8 Any one of the method the step of.
18. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Executable instruction, when the computer executable instructions are executed by processor, so that the processor perform claim requires 1 to 8 Any one of the method the step of.
CN201710386977.5A 2017-05-26 2017-05-26 Audio data processing method and device, computer equipment and computer readable storage medium Active CN108932948B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710386977.5A CN108932948B (en) 2017-05-26 2017-05-26 Audio data processing method and device, computer equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710386977.5A CN108932948B (en) 2017-05-26 2017-05-26 Audio data processing method and device, computer equipment and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN108932948A true CN108932948A (en) 2018-12-04
CN108932948B CN108932948B (en) 2021-12-14

Family

ID=64451598

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710386977.5A Active CN108932948B (en) 2017-05-26 2017-05-26 Audio data processing method and device, computer equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN108932948B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110865619A (en) * 2019-11-26 2020-03-06 国核自仪系统工程有限公司 DCS system signal flow configuration module
CN112581993A (en) * 2020-12-22 2021-03-30 北京字节跳动网络技术有限公司 Audio recording method and device, readable medium and electronic equipment
CN113488065A (en) * 2021-07-01 2021-10-08 上海卓易科技股份有限公司 Audio output method and device based on cloud mobile phone, computer equipment and storage medium
CN114095451A (en) * 2021-11-17 2022-02-25 腾讯科技(深圳)有限公司 Data processing method and device and computer readable storage medium
CN114205633A (en) * 2020-08-31 2022-03-18 腾讯科技(深圳)有限公司 Live broadcast interaction method and device, storage medium and electronic equipment
CN114520687A (en) * 2022-02-17 2022-05-20 深圳震有科技股份有限公司 Audio data processing method, device and equipment applied to satellite system
CN114846810A (en) * 2019-12-31 2022-08-02 华为技术有限公司 Communication method and device
WO2023000894A1 (en) * 2021-07-21 2023-01-26 腾讯科技(深圳)有限公司 Data transmission method and apparatus, and server, storage medium and program product

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1599353A (en) * 2003-09-19 2005-03-23 华为技术有限公司 Telephone station system based on network interconnection protocat and its talking method
US20070130117A1 (en) * 1999-05-25 2007-06-07 Silverbrook Research Pty Ltd Method of Providing Information via a Printed Substrate with Every Interaction
CN101687547A (en) * 2007-07-05 2010-03-31 空中客车运作有限责任公司 System and method for transmitting audio data
WO2011084966A1 (en) * 2010-01-11 2011-07-14 Alcatel-Lucent Usa Inc. SINGLE CHANNEL EVRCx, ISLP AND G. 711 TRANSCODING IN PACKET NETWORKS
CN104142868A (en) * 2013-05-10 2014-11-12 腾讯科技(深圳)有限公司 Connection establishment method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070130117A1 (en) * 1999-05-25 2007-06-07 Silverbrook Research Pty Ltd Method of Providing Information via a Printed Substrate with Every Interaction
CN1599353A (en) * 2003-09-19 2005-03-23 华为技术有限公司 Telephone station system based on network interconnection protocat and its talking method
CN101687547A (en) * 2007-07-05 2010-03-31 空中客车运作有限责任公司 System and method for transmitting audio data
WO2011084966A1 (en) * 2010-01-11 2011-07-14 Alcatel-Lucent Usa Inc. SINGLE CHANNEL EVRCx, ISLP AND G. 711 TRANSCODING IN PACKET NETWORKS
CN104142868A (en) * 2013-05-10 2014-11-12 腾讯科技(深圳)有限公司 Connection establishment method and device

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110865619A (en) * 2019-11-26 2020-03-06 国核自仪系统工程有限公司 DCS system signal flow configuration module
CN114846810A (en) * 2019-12-31 2022-08-02 华为技术有限公司 Communication method and device
CN114846810B (en) * 2019-12-31 2023-08-22 华为技术有限公司 Communication method, device and storage medium
CN114205633B (en) * 2020-08-31 2024-03-08 腾讯科技(深圳)有限公司 Live interaction method and device, storage medium and electronic equipment
CN114205633A (en) * 2020-08-31 2022-03-18 腾讯科技(深圳)有限公司 Live broadcast interaction method and device, storage medium and electronic equipment
CN112581993A (en) * 2020-12-22 2021-03-30 北京字节跳动网络技术有限公司 Audio recording method and device, readable medium and electronic equipment
CN113488065A (en) * 2021-07-01 2021-10-08 上海卓易科技股份有限公司 Audio output method and device based on cloud mobile phone, computer equipment and storage medium
CN113488065B (en) * 2021-07-01 2024-05-14 上海卓易科技股份有限公司 Audio output method and device based on cloud mobile phone, computer equipment and storage medium
WO2023000894A1 (en) * 2021-07-21 2023-01-26 腾讯科技(深圳)有限公司 Data transmission method and apparatus, and server, storage medium and program product
CN114095451A (en) * 2021-11-17 2022-02-25 腾讯科技(深圳)有限公司 Data processing method and device and computer readable storage medium
CN114095451B (en) * 2021-11-17 2024-05-21 腾讯科技(深圳)有限公司 Data processing method, device and computer readable storage medium
CN114520687B (en) * 2022-02-17 2023-11-03 深圳震有科技股份有限公司 Audio data processing method, device and equipment applied to satellite system
CN114520687A (en) * 2022-02-17 2022-05-20 深圳震有科技股份有限公司 Audio data processing method, device and equipment applied to satellite system

Also Published As

Publication number Publication date
CN108932948B (en) 2021-12-14

Similar Documents

Publication Publication Date Title
CN108932948A (en) Audio data processing method, device, computer equipment and computer readable storage medium
CN100466718C (en) Mixed-media telecommunication call set-up
CN101583009B (en) Video terminal and method thereof for realizing interface content sharing
CN109257646A (en) Method for processing video frequency, device, electronic equipment and computer-readable medium
CN109495761A (en) Video switching method and device
CN108881820B (en) A kind of acquisition methods and device of monitoring data
CN104837057B (en) Video file broadcasting method, device and system
CN108093197A (en) For the method, system and machine readable media of Information Sharing
CN108040264A (en) A kind of speaker sound control method and equipment for TV programme channel selection
CN108958762A (en) A kind of upgrade method and device of software
CN201238327Y (en) Amalgamation type network media telephone terminal
CN110650255A (en) Method and device for editing color ring back tone, color ring back tone editing unit and storage medium
CN107566168A (en) Remote configuring method, equipment configuration method and remote configuration facility method
CN113473395B (en) Message processing method, device, medium and electronic equipment
CN107547517A (en) Audio/video program method for recording and the network equipment and computer installation
CN105939392A (en) Wireless adaption control terminal, system thereof and control method
CN107124706A (en) The method of forward call, apparatus and system between a kind of mobile phone
CN107809409A (en) A kind of method and device of the transmission of speech data, reception and interaction
CN101925203A (en) Mobile terminal
CN114024787B (en) Remote control method, device, equipment and storage medium for smart home
CN108989737A (en) A kind of data playing method, device and electronic equipment
CN100461878C (en) Method for realizing media gateway control protocol playback
CN110366118A (en) A kind of radio station, application program and the method for realizing radio station function
CN101621847B (en) Method and terminal for acquiring 3G radio resource management information
CN102246502A (en) Multimedia provision service

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant