CN106128468A - Audio communication method and device - Google Patents

Audio communication method and device Download PDF

Info

Publication number
CN106128468A
CN106128468A CN201610539161.7A CN201610539161A CN106128468A CN 106128468 A CN106128468 A CN 106128468A CN 201610539161 A CN201610539161 A CN 201610539161A CN 106128468 A CN106128468 A CN 106128468A
Authority
CN
China
Prior art keywords
information
voice call
vocoded
terminal
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610539161.7A
Other languages
Chinese (zh)
Other versions
CN106128468B (en
Inventor
卢林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201610539161.7A priority Critical patent/CN106128468B/en
Publication of CN106128468A publication Critical patent/CN106128468A/en
Priority to PCT/CN2017/087317 priority patent/WO2018006678A1/en
Application granted granted Critical
Publication of CN106128468B publication Critical patent/CN106128468B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/07User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail characterised by the inclusion of specific contents
    • H04L51/10Multimedia information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0018Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding

Abstract

The invention discloses a kind of audio communication method and device, belong to voice call technical field.Described method includes: receives the voice call request that calling terminal sends, carries the mark of terminal called in voice call request;Obtain the first vocoded information of calling terminal, and, the second vocoded information of terminal called;Receive calling terminal or the voice call information of terminal called transmission;According to the first vocoded information and the second vocoded information, voice call information is converted to the voice call information that another terminal is supported;The voice call information after conversion that sends is to another terminal;Solve in prior art when new type of coding occurs, transcoding could be realized after only voice call client being updated, the problem that flexibility ratio is poor;Reach background server and can directly carry out transcoding according to the vocoded information of call ends, and without voice call client is updated, improve the effect of flexibility ratio.

Description

Audio communication method and device
Technical field
The present embodiments relate to voice call technical field, particularly to a kind of audio communication method and device.
Background technology
Voice call has become as a kind of talking mode conventional during people's communication exchange.
Existing a kind of audio communication method includes: voice call client obtains the vocoded information of terminal called; Receive voice call information;If this voice call information is the voice call information from local terminal, then according to the language of terminal called This voice call information is converted to the voice call information that terminal called can be supported by sound coding information, and by the language after conversion Sound call-information sends to terminal called;If voice call information is the voice call information from terminal called, then by this language Sound call-information is converted to the voice call information that local terminal can be supported.
Inventor, during realizing the embodiment of the present invention, finds that prior art at least there is problems in that
In said method, voice call client needs to perform voice transcoding, and when new speech-encoded format occurs, In order to ensure voice call client can normal transcoding, need this voice call client is updated, flexibility ratio is poor.
Summary of the invention
The problem poor in order to solve flexibility ratio in prior art, embodiments provides a kind of audio communication method And device.Described technical scheme is as follows:
First aspect, it is provided that a kind of audio communication method, described method includes:
Receive the voice call request that calling terminal sends, institute's voice call request carries the mark of terminal called Know;
Obtain the first vocoded information of described calling terminal, and, the second voice coding letter of described terminal called Breath;
Receive described calling terminal or the voice call information of described terminal called transmission;
According to described first vocoded information and described second vocoded information, described voice call information is changed The voice call information supported by another terminal;
The described voice call information after conversion that sends is to another terminal described.
Second aspect, it is provided that a kind of voice call device, described method includes:
Receiver module, for receiving the voice call request that calling terminal sends, carries in institute's voice call request The mark of terminal called;
Acquisition module, for obtaining the first vocoded information of described calling terminal, and, the of described terminal called Two vocoded information;
Described receiver module, is additionally operable to receive the voice call letter of described calling terminal or the transmission of described terminal called Breath;
Modular converter, for according to described first vocoded information and described second vocoded information, by institute's predicate Sound call-information is converted to the voice call information that another terminal is supported;
Sending module, the described voice call information after sending the conversion of described modular converter is to another terminal described.
The beneficial effect that the technical scheme that the embodiment of the present invention provides is brought includes:
Background server, by after receiving the voice call request that calling terminal sends, obtains the of calling terminal One vocoded information and the second vocoded information of terminal called, and then receive calling terminal or called thereafter During the voice call information that terminal sends, according to this first vocoded information and the second vocoded information by this voice call Information is converted to the voice call information that another terminal is supported, the voice call information after conversion that sends is to another terminal;Solve In prior art of having determined when new type of coding occurs, after only voice call client being updated, transcoding could be realized, The problem that flexibility ratio is poor;Reach background server and can directly carry out transcoding according to the vocoded information of call ends, And without voice call client is updated, improve the effect of flexibility ratio.
Accompanying drawing explanation
For the technical scheme being illustrated more clearly that in the embodiment of the present invention, in embodiment being described below required for make Accompanying drawing be briefly described, it should be apparent that, below describe in accompanying drawing be only some embodiments of the present invention, for From the point of view of those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to obtain other according to these accompanying drawings Accompanying drawing.
Fig. 1 is the schematic diagram of the implementation environment involved by each embodiment of the present invention;
Fig. 2 is the flow chart of the audio communication method that one embodiment of the invention provides;
Fig. 3 is the flow chart of the audio communication method that another embodiment of the present invention provides;
Fig. 4 A is the flow chart of the audio communication method that another embodiment of the present invention provides;
Fig. 4 B is the schematic diagram of the audio communication method that another embodiment of the present invention provides;
Fig. 4 C is another flow chart of the audio communication method that another embodiment of the present invention provides;
Fig. 4 D is the schematic diagram that the target terminal that another embodiment of the present invention provides updates vocoded information;
Fig. 5 is the structural representation of the voice call device that one embodiment of the invention provides;
Fig. 6 is the structural representation of the server that one embodiment of the invention provides.
Detailed description of the invention
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to embodiment party of the present invention Formula is described in further detail.
Refer to Fig. 1, it illustrates the implementation environment involved by audio communication method that each embodiment of the present invention provides Schematic diagram.As it is shown in figure 1, this implementation environment includes calling terminal 110, background server 120, telephone operator 130 and quilt It is terminal 140.
Calling terminal 110 can be the terminal possessing voice call ability, such as, can be mobile phone.Actual when realizing, main Crying and be provided with voice call client 111 in terminal 110, calling terminal 110 can be initiated by this voice call client 111 And the voice call between terminal called 140.Optionally, voice call client can pass through VoIP (Voice over Internet Protocol, the networking telephone) initiate the voice call between terminal called 140.Wherein, calling terminal 110 can To be connected with background server 120 by wireless network.
Background server 120 is used to voice call client 111 and provides the background server of service.This background service Device 120 can be connected with telephone operator 130 by wired or wireless network.Actual when realizing, this background server 120 can Think a station server, it is also possible to for the server cluster being made up of multiple servers.
As a example by background server 120 is as server cluster, this background server 120 can include RTP (Real-time Transport Protocol, RTP) server, transcoding server and call server.RTP server for Telephone operator 130 communicates, and transcoding server is for carrying out transcoding to voice call information, and call server is used for receiving caller The calling of terminal 110 is also initiated terminal called 130 to telephone operator 130.Optionally, background server 120 is all right Including other servers, this is not limited by the present embodiment.
Telephone operator 130 can be movement, UNICOM, telecommunications or other operators.
Terminal called 140 can also be the terminal possessing voice call ability, such as, can be mobile phone.It is actual when realizing, Terminal called 140 can be provided with voice call client, it is also possible to be not installed with voice call client, the present embodiment pair This does not limit.Further, this terminal called 140 can be PSTN (Public Switched Telephone Network, PSTN) in terminal.
Refer to Fig. 2, it illustrates the method flow diagram of the audio communication method that one embodiment of the invention provides, this reality Execute example to illustrate in the background server 120 shown in Fig. 1 with this audio communication method.As in figure 2 it is shown, this voice leads to Words method may include that
Step 201, receives the voice call request that calling terminal sends, carries terminal called in voice call request Mark.
Step 202, obtains the first vocoded information of calling terminal, and, the second voice coding letter of terminal called Breath.
Step 203, receives calling terminal or the voice call information of terminal called transmission.
Step 204, according to the first vocoded information and the second vocoded information, is converted to voice call information separately The voice call information that one terminal is supported.
Step 205, the voice call information after conversion that sends is to another terminal.
In sum, the audio communication method that the present embodiment provides, background server is by sending out receiving calling terminal After the voice call request sent, obtain the first vocoded information and second voice coding of terminal called of calling terminal Information, and then when receiving the voice call information of calling terminal or terminal called transmission thereafter, according to this first voice This voice call information is converted to the voice call information that another terminal is supported by coding information and the second vocoded information, The voice call information after conversion that sends is to another terminal;Solve in prior art when new type of coding occurs, only Transcoding could be realized, the problem that flexibility ratio is poor after voice call client is updated;Having reached background server can root Directly carry out transcoding according to the vocoded information of call ends, and without voice call client is updated, improve flexibly The effect of degree.
Refer to Fig. 3, it illustrates the method flow diagram of the audio communication method that one embodiment of the invention provides, this reality Execute example to illustrate in the background server 120 shown in Fig. 1 with this audio communication method.As it is shown on figure 3, this voice leads to Words method may include that
Step 301, receives the voice call request that calling terminal sends, carries terminal called in voice call request Mark.
Calling terminal is provided with voice call client, when user needs to carry out voice call with other users, uses The voice call request of calling terminal called can be initiated in family by this voice call client in calling terminal.
After voice call client in calling terminal initiates voice call request, background server can be corresponding Receive this voice call request.Wherein, voice call request carries the mark of terminal called.Such as, carry called The cell-phone number of terminal.
Actual when realizing, voice call request can also include the first vocoded information that calling terminal is supported. Wherein, the first vocoded information may include that the type of coding of encoder, or, type of coding and encoder are used Coding parameter.Type of coding can be: silk, g711a, g729a etc., and coding parameter can include sample rate, coding complexity At least one in the transmission interval of degree and transmission adjacent data bag.
Optionally, during due to actual realization, some encoders can't configuration codes parameter, therefore, for this kind of situation, Its vocoded information can only include type of coding.And if encoder configuration codes parameter, the most now, voice coding newly in bag Include type of coding and coding parameter.
Simply illustrate with type of coding and coding parameter respectively foregoing it should be noted that above-mentioned, optionally, Type of coding can also be other types, and coding parameter can also include other guide, and this is not limited by the present embodiment.
Step 302, extracts the first vocoded information carried in voice call request.
Step 303, according to the mark of the terminal called in voice call request, at the operator corresponding to terminal called Obtain the second vocoded information.
After background server receives voice call request, the terminal called carried in voice call request can be extracted Mark, determine the operator corresponding to terminal called according to the mark of terminal called, then obtain at the operator determined Second vocoded information of terminal called.Optionally, background server can send information acquisition request to operator, reception The second vocoded information that operator returns, this information acquisition request is for the second voice coding of acquisition request terminal called Information.
Such as, terminal called be designated 158616xxx12, then background server may determine that terminal called is mobile using Family, now, background server can send information acquisition request to mobile operator, the second language that reception mobile operator returns Sound coding information.
It should be noted that the vocoded information of each user in same operator can be identical or different, and not Vocoded information with each user in operator can also be identical or different, and this is not limited by the present embodiment.And And, when the vocoded information difference of each user in same operator, the information acquisition request that background server sends In can include the mark of terminal called, after operator receives this information acquisition request, the mark according to terminal called is true Determine the second vocoded information of terminal called, return the second vocoded information determined to background server.
Step 304, receives calling terminal or the voice call information of terminal called transmission.
Calling terminal is with the voice call process of terminal called, if calling terminal sends voice, then background server can The voice call information sent with corresponding voice call client in calling terminal;And if terminal called sends voice, Then voice call information is sent after operator by terminal called, and operator can forward this voice call information to take to backstage Business device, background server receives this voice call information accordingly.
Step 305, according to the first vocoded information and the second vocoded information, is converted to voice call information separately The voice call information that one terminal is supported.
After background server receives this voice call information, this voice call information can be turned by background server It is changed to the voice call information that another terminal is supported.
Such as, if voice call information is the information that calling terminal sends, then background server is by this voice call information Be converted to the voice call information corresponding to the second vocoded information of terminal called;And if voice call information is called end End send information, then background server this voice call information is converted to calling terminal first vocoded information institute right The voice call information answered.
If it should be noted that the first vocoded information is identical with the second vocoded information, then background server Without changing, directly forwarding, the present embodiment does not repeats them here.
Step 306, the voice call information after conversion that sends is to another terminal.
After converted, background server can send the voice call information after conversion to another terminal.
After another terminal receives the voice call information after conversion, another terminal can successfully resolve this voice call Information, it is ensured that being normally carried out of call.
Step 307, in voice call process, receives the coding information updating request that target terminal sends, target terminal For calling terminal or terminal called, coding information updating request carries the vocoded information after renewal.
In voice call process, along with the change of speech path network is it is possible that network delay, network jitter or net The problems such as network packet loss, and in order to avoid this problem, either one in both call sides can automatically update the voice coding letter of self Breath, and send coding information updating request to background server.Accordingly, background server can receive target terminal transmission Coding information updating request.
In communication process, both call sides can monitor call tone quality in real time, according between tonequality and vocoded information Corresponding relation, obtain the vocoded information corresponding to current tonequality, if the vocoded information got is different from currently The vocoded information used, then send coding information updating and ask to background server.
Owing to type of coding generally will not change, therefore, actual when realizing, need the vocoded information updated to be Coding parameter.Further, when coding parameter is encoder complexity, tonequality and encoder complexity correlation;When coding ginseng When number is spaced for giving out a contract for a project, tonequality and interval of giving out a contract for a project are in negative correlativing relation;When coding parameter includes sample rate, tonequality and sample rate Correlation.Optionally, a range of tonequality can corresponding identical vocoded information, the present embodiment is to this also Do not limit.
Step 308, updates the vocoded information corresponding to target terminal according to the vocoded information after updating.
After background server receives coding information updating request, update corresponding vocoded information.Hereafter, backstage Server can carry out transcoding according to the vocoded information after updating, and the present embodiment does not repeats them here.
It should be noted is that, step 307 and step 308 are optional step, and actual can execution when realizing can also Do not perform, and, the present embodiment is as a example by performing after step 306, and optionally, it can also be after step 302 Either step perform, the present embodiment does not repeats them here.
Needing explanation on the other hand, after the conversation is over, calling terminal can send end of conversation and instruct to backstage Server, after background server receives end of conversation instruction, the first voice coder of the calling terminal received before deletion Code information and the second vocoded information of terminal called.
In sum, the audio communication method that the present embodiment provides, background server is by sending out receiving calling terminal After the voice call request sent, obtain the first vocoded information and second voice coding of terminal called of calling terminal Information, and then when receiving the voice call information of calling terminal or terminal called transmission thereafter, according to this first voice This voice call information is converted to the voice call information that another terminal is supported by coding information and the second vocoded information, The voice call information after conversion that sends is to another terminal;Solve in prior art when new type of coding occurs, only Transcoding could be realized, the problem that flexibility ratio is poor after voice call client is updated;Having reached background server can root Directly carry out transcoding according to the vocoded information of call ends, and without voice call client is updated, improve flexibly The effect of degree.
Meanwhile, target terminal can update the vocoded information that in transcoding server, it is corresponding so that both call sides After receiving the voice call information of opposite end, all can successfully resolve, it is ensured that call can be normally carried out.
Above-described embodiment is simply used for background server with this audio communication method, and background server is a station server Illustrate.Optionally, this background server can also be for the clothes being made up of RTP server, transcoding server and call server Business device cluster, now, refer to Fig. 4 A, and this audio communication method may include that
Step 401, call server receives the voice call request that calling terminal sends.
After voice call client in calling terminal sends voice call request, call server can connect accordingly Receive this voice call request.Wherein, this voice call request carries calling terminal the first vocoded information and The mark of terminal called.Optionally, voice call client can pass through SIP (Session Initiation Protocol, Session initiation protocol) signaling sends this voice call request.
As shown in Figure 4 B, voice call client can access to call server by signaling.Accordingly, calling service Device receives this voice call request.
Step 402, call server sends the first vocoded information carried in voice call request and services to RTP Device.
Optionally, call server sends the first vocoded information to while RTP server, can send RTP clothes The address of business device is to calling terminal, in order to follow-up calling terminal can send voice call information according to the address of RTP server To this RTP server.
Step 403, RTP server receives the first vocoded information.
Step 404, call server obtains at operator according to the mark of the terminal called carried in voice call request Take the second vocoded information.
This step is similar with the step 303 in above-described embodiment, and the present embodiment does not repeats them here.
Step 405, the second vocoded information is synchronized to RTP server by call server.
Step 406, RTP server receives the second vocoded information.
Step 407, RTP server sends the first vocoded information and the second vocoded information to transcoding server.
After RTP server gets the first vocoded information and the second vocoded information, RTP server can To send this first vocoded information and the second vocoded information to transcoding server.
Simply first obtain the first vocoded information with RTP server it should be noted that above-mentioned, then obtain the second language As a example by sound coding information, optionally, RTP server can also obtain the first voice coding after first obtaining the second vocoded information Information, or, RTP server obtains both simultaneously, and this is not limited by the present embodiment.
Step 408, transcoding server feedback indicator information is to RTP server.
After transcoding server receives the first vocoded information and the second vocoded information, for this first voice coder Code information and the second vocoded information uniquely distribute an identification information, and feed back this identification information to RTP server.Its In, this identification information is for the corresponding relation between unique mark the first vocoded information and the second vocoded information.
Step 409, RTP server receives the identification information of transcoding server feedback.
Step 410, RTP server receives calling terminal or the voice call information of terminal called transmission.
In communication process, calling terminal or terminal called can send voice call information, and accordingly, RTP services Device can receive this voice call information.
Concrete, when calling terminal sends voice call information, the voice call client in calling terminal can be straight Receive and send this voice call information to RTP server.And when terminal called sends voice call information, terminal called can lead to Cross operator to send this voice call information to RTP server.
Step 411, RTP server sends voice call information and identification information to transcoding server.
After RTP server receives voice call information, RTP server can send this voice call information and Identification information is to transcoding server.
Step 412, voice call information, according to identification information, is converted to the language that another terminal is supported by transcoding server Sound call-information.
Step 413, transcoding server sends the voice call information after conversion to RTP server.
Step 414, the voice call information after conversion is sent to another terminal by RTP server.
Step 415, after the conversation is over, RTP server sends end of conversation and instructs to transcoding server, end of conversation Instruction includes identification information.
Step 416, transcoding server deletes the first vocoded information corresponding to identification information and the second voice coding Information.
After transcoding server receives end of conversation instruction, extract the identification information in the instruction of this end of conversation, delete The first vocoded information corresponding to this identification information and the second vocoded information, release storage above-mentioned information time institute The memory space needed.
It addition, similar to the above embodiments, calling terminal or terminal called can ask to update the voice of oneself Coding information, now, refer to Fig. 4 C, and this audio communication method can also comprise the steps:
Step 417, RTP server receives the coding information updating request that target terminal sends.
Optionally, when target terminal is calling terminal, access this coding information of transmission more at calling terminal by signaling Newly requested after call server, call server can forward this coding information updating request to take to RTP server, RTP Business device receives this coding information updating request that call server sends accordingly.And when target terminal is terminal called, quilt Making terminal can send this coding information updating to ask to call server, call server forwards this coding information updating to ask To RTP server, accordingly, RTP server receives this coding information updating request that call server forwards.
Step 418, RTP server forwards coding information updating to ask to transcoding server.
Step 419, transcoding server updates mesh according to the vocoded information after the renewal in coding information updating request The vocoded information of mark terminal.
Refer to Fig. 4 D, it illustrates the schematic diagram of vocoded information renewal process.
In sum, the audio communication method that the present embodiment provides, background server is by sending out receiving calling terminal After the voice call request sent, obtain the first vocoded information and second voice coding of terminal called of calling terminal Information, and then when receiving the voice call information of calling terminal or terminal called transmission thereafter, according to this first voice This voice call information is converted to the voice call information that another terminal is supported by coding information and the second vocoded information, The voice call information after conversion that sends is to another terminal;Solve in prior art when new type of coding occurs, only Transcoding could be realized, the problem that flexibility ratio is poor after voice call client is updated;Having reached background server can root Directly carry out transcoding according to the vocoded information of call ends, and without voice call client is updated, improve flexibly The effect of degree.
Transcoding server, after receiving the first vocoded information and the second vocoded information, distributes one and is used for Represent the identification information of both corresponding relations, feed back this identification information to RTP server so that RTP server receives one end Voice call information after, it is only necessary to voice call information and this identification information are sent and can realize turning to transcoding server Code, and without sending the first vocoded information and the second vocoded information to transcoding server every time, reduce transmission The transfer resource consumed needed for during.
Meanwhile, target terminal can update the vocoded information that in transcoding server, it is corresponding so that both call sides After receiving the voice call information of opposite end, all can successfully resolve, it is ensured that call can be normally carried out.
Refer to Fig. 5, it illustrates the structural representation of the voice call device that one embodiment of the invention provides, such as figure Shown in 5, this voice call device may include that receiver module 510, acquisition module 520, modular converter 530 and sending module 540。
Receiver module 510, for receiving the voice call request that calling terminal sends, carries in institute's voice call request There is the mark of terminal called;
Acquisition module 520, for obtaining the first vocoded information of described calling terminal, and, described terminal called The second vocoded information;
Described receiver module 510, is additionally operable to receive described calling terminal or the voice call of described terminal called transmission Information;
Modular converter 530, for according to described first vocoded information and described second vocoded information, by described Voice call information is converted to the voice call information that another terminal is supported;
Sending module 540, for send described modular converter 530 conversion after described voice call information to the most described another Terminal.
In sum, the voice call device that the present embodiment provides, by exhaling at the voice receiving calling terminal transmission After crying request, obtain the first vocoded information and second vocoded information of terminal called of calling terminal, and then When receiving the voice call information that calling terminal or terminal called send thereafter, according to this first vocoded information and This voice call information is converted to the voice call information that another terminal is supported by the second vocoded information, after sending conversion Voice call information to another terminal;Solve in prior art when new type of coding occurs, only to voice call Client could realize transcoding after updating, the problem that flexibility ratio is poor;Having reached background server can be according to call ends Vocoded information directly carry out transcoding, and without voice call client is updated, improve the effect of flexibility ratio.
The voice call device provided based on above-described embodiment, optionally, described acquisition module 520, it is additionally operable to extract institute Described first vocoded information carried in voice call request.
Optionally, described acquisition module 520, it is additionally operable to the mark according to the described terminal called in institute's voice call request Know, at the operator corresponding to described terminal called, obtain described second vocoded information.
Optionally, described device is in background server, and described background server includes: realtime transmission protocol RTP mould Block and transcoding module;
Described acquisition module 520, is additionally operable to:
Obtain described first vocoded information and described second vocoded information by described RTP module, send Described first vocoded information and described second vocoded information are to described transcoding module;
By described transcoding module feedback identification information to described RTP module, described identification information is for uniquely identifying institute State the corresponding relation between the first vocoded information and described second vocoded information;
Described receiver module 510, is additionally operable to receive described voice call information by described RTP module;
Described transcoding module 530, is additionally operable to:
Described voice call information and described identification information is sent to described transcoding module by described RTP module;
By described transcoding module according to described identification information, described voice call information is converted to another terminal described The voice call information supported.
Optionally, described sending module 540, it is additionally operable to after the conversation is over, sends call by described RTP server END instruction, to described transcoding server, comprises described identification information in the instruction of described end of conversation;
Described device also includes:
Removing module, for deleting described first voice coder corresponding to described identification information by described transcoding server Code information and described second vocoded information.
Optionally, described receiver module 510, it is additionally operable in voice call process, receives the coding that target terminal sends Information updating is asked, and described target terminal is described calling terminal or described terminal called, and described coding information updating is asked In carry the vocoded information after renewal;
Described device also includes:
More new module, for updating the voice corresponding to described target terminal according to the described vocoded information after updating Coding information.
It should be noted that the RTP module in the present embodiment can be formed as RTP server, transcoding module can be formed For transcoding server, this is not limited by the present embodiment.
It should be noted that the voice call device that above-described embodiment provides, only being partitioned into above-mentioned each functional module Row illustrates, and in actual application, can above-mentioned functions distribution be completed by different functional modules as desired, Ji Jiangshe Standby internal structure is divided into different functional modules, to complete all or part of function described above.It addition, above-mentioned reality The embodiment of the method for the voice call device and audio communication method of executing example offer belongs to same design, and it is detailed that it implements process See embodiment of the method, repeat no more here.
Refer to Fig. 6, it illustrates the structural representation of the server that one embodiment of the invention provides.This server is used In implementing the audio communication method of offer in above-described embodiment.Specifically:
Described server 600 includes CPU (CPU) 601, includes random access memory (RAM) 602 and only Read the system storage 604 of memorizer (ROM) 603, and connection system memorizer 604 and the system of CPU 601 Bus 605.Described server 600 also includes the basic input/output transmitting information between each device in help computer System (I/O system) 606, and deposit for storing the Large Copacity of operating system 613, application program 614 and other program modules 615 Storage equipment 607.
Described basic input/output 606 includes the display 608 for showing information and inputs letter for user The input equipment 609 of such as mouse, keyboard etc of breath.Wherein said display 608 and input equipment 609 all pass through to be connected to The IOC 610 of system bus 605 is connected to CPU 601.Described basic input/output 606 Can also include IOC 610 for receive and process from keyboard, mouse or electronic touch pen etc. multiple its The input of his equipment.Similarly, IOC 610 also provides output to display screen, printer or other kinds of defeated Go out equipment.
Described mass-memory unit 607 is by being connected to the bulk memory controller (not shown) of system bus 605 It is connected to CPU 601.Described mass-memory unit 607 and the computer-readable medium being associated thereof are server 600 provide non-volatile memories.It is to say, described mass-memory unit 607 can include such as hard disk or CD-ROM The computer-readable medium (not shown) of driver etc.
Without loss of generality, described computer-readable medium can include computer-readable storage medium and communication media.Computer Storage medium includes for storing the information such as such as computer-readable instruction, data structure, program module or other data Volatibility that any method or technology realize and medium non-volatile, removable and irremovable.Computer-readable storage medium includes RAM, ROM, EPROM, EEPROM, flash memory or its technology of other solid-state storage, CD-ROM, DVD or other optical storage, tape Box, tape, disk storage or other magnetic storage apparatus.Certainly, skilled person will appreciate that described computer-readable storage medium It is not limited to above-mentioned several.Above-mentioned system storage 604 and mass-memory unit 607 may be collectively referred to as memorizer.
According to various embodiments of the present invention, described server 600 can also be connected to by networks such as such as the Internets Remote computer on network runs.Namely server 600 can be by being connected to the network interface on described system bus 605 Unit 611 is connected to network 612, in other words, it is possible to use NIU 611 be connected to other kinds of network or Remote computer system (not shown).
Described memorizer also includes that one or more than one program, one or more than one program are stored in In memorizer, and it is configured to be performed by one or more than one processor.Said one or more than one program comprise For performing the instruction of the method for above-mentioned server side.
It should be appreciated that it is used in the present context, unless exceptional case, singulative " clearly supported in context Individual " (" a ", " an ", " the ") be intended to also include plural form.It is to be further understood that "and/or" used herein is Refer to include arbitrarily and likely combining of or more than one project listed explicitly.
The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.
One of ordinary skill in the art will appreciate that all or part of step realizing above-described embodiment can pass through hardware Completing, it is also possible to instruct relevant hardware by program and complete, described program can be stored in a kind of computer-readable In storage medium, storage medium mentioned above can be read only memory, disk or CD etc..
The foregoing is only presently preferred embodiments of the present invention, not in order to limit the present invention, all spirit in the present invention and Within principle, any modification, equivalent substitution and improvement etc. made, should be included within the scope of the present invention.

Claims (12)

1. an audio communication method, it is characterised in that described method includes:
Receive the voice call request that calling terminal sends, institute's voice call request carries the mark of terminal called;
Obtain the first vocoded information of described calling terminal, and, the second vocoded information of described terminal called;
Receive described calling terminal or the voice call information of described terminal called transmission;
According to described first vocoded information and described second vocoded information, described voice call information is converted to separately The voice call information that one terminal is supported;
The described voice call information after conversion that sends is to another terminal described.
Method the most according to claim 1, it is characterised in that the first voice coding letter of the described calling terminal of described acquisition Breath, including:
Extract described first vocoded information carried in institute's voice call request.
Method the most according to claim 1, it is characterised in that the second voice coding letter of the described terminal called of described acquisition Breath, including:
According to the mark of the described terminal called in institute's voice call request, at the operator corresponding to described terminal called Obtain described second vocoded information.
Method the most according to claim 1, it is characterised in that described method is in background server, and described backstage takes Business device includes: realtime transmission protocol RTP server and transcoding server;
First vocoded information of the described calling terminal of described acquisition, and, the second voice coding letter of described terminal called Breath, including:
Obtain described first vocoded information and described second vocoded information by described RTP server, send institute State the first vocoded information and described second vocoded information to described transcoding server;
By described transcoding server feedback indicator information to described RTP server, described identification information is for uniquely identifying institute State the corresponding relation between the first vocoded information and described second vocoded information;
The voice call information that the described calling terminal of described reception or described terminal called send, including:
Described voice call information is received by described RTP server;
Described according to described first vocoded information with described second vocoded information, described voice call information is changed The voice call information supported by another terminal, including:
Described voice call information and described identification information is sent to described transcoding server by described RTP server;
By described transcoding server according to described identification information, described voice call information is converted to another terminal institute described The voice call information supported.
Method the most according to claim 4, it is characterised in that described method also includes:
After the conversation is over, end of conversation instruction is sent to described transcoding server, described call by described RTP server END instruction comprises described identification information;
Described first vocoded information and described second corresponding to described identification information is deleted by described transcoding server Vocoded information.
6. according to the arbitrary described method of claim 1 to 5, it is characterised in that described method also includes:
In voice call process, receiving the coding information updating request that target terminal sends, described target terminal is described master It is terminal or described terminal called, described coding information updating request carries the vocoded information after renewal;
The vocoded information corresponding to described target terminal is updated according to the described vocoded information after updating.
7. a voice call device, it is characterised in that described device includes:
Receiver module, for receiving the voice call request that calling terminal sends, carries called in institute's voice call request The mark of terminal;
Acquisition module, for obtaining the first vocoded information of described calling terminal, and, the second language of described terminal called Sound coding information;
Described receiver module, is additionally operable to receive described calling terminal or the voice call information of described terminal called transmission;
Modular converter, for according to described first vocoded information and described second vocoded information, leads to described voice Words information is converted to the voice call information that another terminal is supported;
Sending module, the described voice call information after sending the conversion of described modular converter is to another terminal described.
Device the most according to claim 7, it is characterised in that
Described acquisition module, is additionally operable to extract described first vocoded information carried in institute's voice call request.
Device the most according to claim 7, it is characterised in that
Described acquisition module, is additionally operable to the mark according to the described terminal called in institute's voice call request, from described called Described second vocoded information is obtained at operator corresponding to terminal.
Device the most according to claim 7, it is characterised in that described device is in background server, and described backstage takes Business device includes: realtime transmission protocol RTP module and transcoding module;
Described acquisition module, is additionally operable to:
Obtain described first vocoded information and described second vocoded information by described RTP module, send described First vocoded information and described second vocoded information are to described transcoding module;
By described transcoding module feedback identification information to described RTP module, described identification information is for unique mark described the Corresponding relation between one vocoded information and described second vocoded information;
Described receiver module, is additionally operable to receive described voice call information by described RTP module;
Described transcoding module, is additionally operable to:
Described voice call information and described identification information is sent to described transcoding module by described RTP module;
By described transcoding module according to described identification information, described voice call information is converted to another terminal described and is propped up The voice call information held.
11. devices according to claim 10, it is characterised in that
Described sending module, is additionally operable to after the conversation is over, sends end of conversation instruction to described by described RTP server Transcoding server, comprises described identification information in the instruction of described end of conversation;
Described device also includes:
Removing module, is believed for being deleted described first voice coding corresponding to described identification information by described transcoding server Breath and described second vocoded information.
12. according to the arbitrary described device of claim 7 to 11, it is characterised in that
Described receiver module, is additionally operable in voice call process, receives the coding information updating request that target terminal sends, institute Stating target terminal is described calling terminal or described terminal called, after carrying renewal in described coding information updating request Vocoded information;
Described device also includes:
More new module, for updating the voice coding corresponding to described target terminal according to the described vocoded information after updating Information.
CN201610539161.7A 2016-07-08 2016-07-08 Voice communication method and device Active CN106128468B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610539161.7A CN106128468B (en) 2016-07-08 2016-07-08 Voice communication method and device
PCT/CN2017/087317 WO2018006678A1 (en) 2016-07-08 2017-06-06 Voice call method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610539161.7A CN106128468B (en) 2016-07-08 2016-07-08 Voice communication method and device

Publications (2)

Publication Number Publication Date
CN106128468A true CN106128468A (en) 2016-11-16
CN106128468B CN106128468B (en) 2021-02-12

Family

ID=57283682

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610539161.7A Active CN106128468B (en) 2016-07-08 2016-07-08 Voice communication method and device

Country Status (2)

Country Link
CN (1) CN106128468B (en)
WO (1) WO2018006678A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018006678A1 (en) * 2016-07-08 2018-01-11 腾讯科技(深圳)有限公司 Voice call method and apparatus
CN108986828A (en) * 2018-08-31 2018-12-11 北京中兴高达通信技术有限公司 Call establishment method and device, storage medium, electronic device
CN114760273A (en) * 2022-04-14 2022-07-15 深圳震有科技股份有限公司 Voice forwarding method, system, server and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113923065B (en) * 2021-09-06 2023-11-24 贵阳语玩科技有限公司 Cross-version communication method, system, medium and server based on chat room audio

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101547343A (en) * 2009-03-06 2009-09-30 深圳市融创天下科技发展有限公司 System and method for remote video monitoring
JP2011166660A (en) * 2010-02-15 2011-08-25 Nec Access Technica Ltd Voice recording device, voice recording method, and voice recording program
CN103414697A (en) * 2013-07-22 2013-11-27 中国联合网络通信集团有限公司 VOIP self-adaptation speech coding method and system and SIP server
CN103428284A (en) * 2013-08-07 2013-12-04 合肥迈腾信息科技有限公司 Cloud technology based on-board Internet phoning method
CN103916678A (en) * 2012-12-31 2014-07-09 中国移动通信集团广东有限公司 Multimedia data transcoding method, transcoding device and multimedia data play system
CN105374359A (en) * 2014-08-29 2016-03-02 中国电信股份有限公司 Encoding method and system of speech data
CN105491044A (en) * 2015-12-11 2016-04-13 中青冠岳科技(北京)有限公司 Instant voice messaging method and device based on mobile terminal

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6529602B1 (en) * 1997-08-19 2003-03-04 Walker Digital, Llc Method and apparatus for the secure storage of audio signals
CN1937663A (en) * 2006-09-30 2007-03-28 华为技术有限公司 Method, system and device for realizing variable voice telephone business
US20080310612A1 (en) * 2007-06-15 2008-12-18 Sony Ericsson Mobile Communications Ab System, method and device supporting delivery of device-specific data objects
CN103581129A (en) * 2012-07-30 2014-02-12 中兴通讯股份有限公司 Conversation processing method and device
CN104125138B (en) * 2013-04-28 2017-07-25 腾讯科技(深圳)有限公司 A kind of speech communication and device, system
CN104580166B (en) * 2014-12-19 2018-08-31 大唐移动通信设备有限公司 A kind of method and apparatus based on the conversion of CSCF media coding formats
CN104994245A (en) * 2015-05-08 2015-10-21 小米科技有限责任公司 Conversation realization method and apparatus
CN106128468B (en) * 2016-07-08 2021-02-12 腾讯科技(深圳)有限公司 Voice communication method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101547343A (en) * 2009-03-06 2009-09-30 深圳市融创天下科技发展有限公司 System and method for remote video monitoring
JP2011166660A (en) * 2010-02-15 2011-08-25 Nec Access Technica Ltd Voice recording device, voice recording method, and voice recording program
CN103916678A (en) * 2012-12-31 2014-07-09 中国移动通信集团广东有限公司 Multimedia data transcoding method, transcoding device and multimedia data play system
CN103414697A (en) * 2013-07-22 2013-11-27 中国联合网络通信集团有限公司 VOIP self-adaptation speech coding method and system and SIP server
CN103428284A (en) * 2013-08-07 2013-12-04 合肥迈腾信息科技有限公司 Cloud technology based on-board Internet phoning method
CN105374359A (en) * 2014-08-29 2016-03-02 中国电信股份有限公司 Encoding method and system of speech data
CN105491044A (en) * 2015-12-11 2016-04-13 中青冠岳科技(北京)有限公司 Instant voice messaging method and device based on mobile terminal

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018006678A1 (en) * 2016-07-08 2018-01-11 腾讯科技(深圳)有限公司 Voice call method and apparatus
CN108986828A (en) * 2018-08-31 2018-12-11 北京中兴高达通信技术有限公司 Call establishment method and device, storage medium, electronic device
CN114760273A (en) * 2022-04-14 2022-07-15 深圳震有科技股份有限公司 Voice forwarding method, system, server and storage medium

Also Published As

Publication number Publication date
CN106128468B (en) 2021-02-12
WO2018006678A1 (en) 2018-01-11

Similar Documents

Publication Publication Date Title
CN103229495B (en) Improving multipoint conference scalability for co-located participants
US9351308B2 (en) Multi-modal communication priority over wireless networks
CN206807569U (en) Softphone device
US7257090B2 (en) Multi-site teleconferencing system
CN106128468A (en) Audio communication method and device
CN104284132A (en) Video communication method and device
CN104580247A (en) Information synchronization method and information synchronization device based on IMS multi-party calls
CN105682006A (en) Earphone and method for achieving teleconference based on earphones
CN105516176A (en) Call center system, communication connection method and device of call center system
CN100493123C (en) Teleconference system and controlling method
CN103595704B (en) A kind of enterprise communication towards VOIP applies a key method of calling
US8983043B2 (en) Data communication
CN109218542A (en) Method, apparatus and computer readable storage medium for call manager
CN104158989B (en) Fixed telephone roaming system and method
CN103997491A (en) Quantum secret communication telephone subscriber terminal extension gateway system
CN101106611A (en) Voip inter-network switching system based on H323 protocol
CN108605077A (en) For transmitting imminent event automatically by interface to the method for the endpoint for distributing to user and the conversion equipment constructed thus
CN106921615A (en) System, method and the mobile terminal of fixed network number communication are realized in the terminal
CN109379504A (en) A kind of ringing system of car networking
CN107395625B (en) Method for realizing steady-state call redundancy of soft switching equipment
WO2012052705A1 (en) Data communication
CN106453265B (en) IP call scheduling method and system, IPPBX and server
CN103532935A (en) Domain strategy-based P2P (Peer-to-Peer) streaming media transmission control method
CN110012180A (en) A kind of connection method, device, equipment and the medium of voip network phone
CN106303117A (en) The means of communication of IP based network and communication system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant