Background technology
Media server is used for all medium relevant with audio frequency and video to be handled, and comprises that video and audio frequency RTP data flow to the mutual conversion of looking audio file.Simultaneously, also be responsible for to receive the DTMF input of user by terminal, play service the guiding voice, show dynamic guide picture.Mutual with the user that Session Initiation Protocol that it has and MSML/MOML ability make that it can finish the whole session process under the control of application server APP.
Media control unit (MSCU) is a significant element in the media server, mainly finishes with other entities and carries out capability negotiation, management, the maintenance of resource itself is provided and controls the function that complicated business is finished in other service resources unit.
Media store transmission of audio unit (MSTU-audio) is the service resources unit in the media server, finishes the voice data storage of magnanimity, comprises and realizes the audio file playing function.External network interface is arranged on the media storage unit, can be directly by the external network interface transmitting-receiving on the unit.
Media store transmission of video unit (MSTU-video) is the service resources unit in the media server, finishes the multimedia audio-video storage of magnanimity, comprises realizing the video file playing function.External network interface is arranged on the media storage unit, can be directly by the external network interface transmitting-receiving on the unit.
Now, the use broadcast of media server is very wide.Mainly can reduce audio frequency and video and play, collect the digits and function such as meeting.
From the function of Text To Speech (Text To Speech abbreviates TTS as) is that the text message that will import identifies, and is converted into voice messaging, and voice medium is sent to the user.At field of telecommunications, the application of TTS is special TTS server of configuration substantially at present, specifies TTS that voice are sent to the user by signaling and brings in the once business of finishing.
Fig. 1 is the system configuration schematic diagram according to the realization TTS audio frequency transcoding of correlation technique.As shown in Figure 1, the workflow of this system comprises the steps:
Step 101: terminal is initiated call, activates the business of APP.APP initiates operation flow to media server;
Step 102:APP passes through the SIP signaling to media server request TTS business;
Step 103: media server passes through the SIP signaling to TTS server requests TTS resource, and finishes business function by MRCP agreement control TTS server;
Step 104:TTS server sends medium to terminal
More than be at present typical networking and operation flow.The TTS server uses as the external device of media server.APP just initiates to media server in requested service, media server is judged type of service, when type of service is the TTS application, media server is initiated request to the TTS server again, the application resource, and the behavior of control TTS server, the TTS server sends to medium the terminal in a distant place automatically after receiving signaling.
Above flow process can be finished a basic TTS business.But some problems have appearred in the Application Expansion along with business.Such as, the audio capability collection of TTS server causes service fail with the unmatched problem of media server capability set.Because APP is in media server agreement SDP, media server does not also know whether type of service is TTS, so can consult audio frequency parameter with terminal according to the limit of power of oneself.When APP issues INFO when instruction to media server, media server just can identify the TTS type of service, this moment media server by terminal SDP information to TTS server application resource.If the audio capability scope of TTS server does not satisfy the result that media server negotiates with terminal, cause service fail exactly.Such as: it is the G726 form that media server negotiates code/decode type with terminal, but the TTS server is only supported the audio format of G711.。
At satisfying under the situation of business demand of media server at the audio capability collection of TTS server in the above-mentioned prior art, the problem of terminal access media business packet data failure does not also have effective solution at present.
Summary of the invention
Main purpose of the present invention is to provide a kind of audio frequency code-transferring method, Apparatus and system of realizing from Text To Speech TTS, to solve in the prior art under the situation of business demand that audio capability collection at the TTS server can't satisfy media server the problem of terminal access media business packet data failure.
To achieve these goals, according to an aspect of the present invention, provide a kind of audio frequency code-transferring method of realizing from Text To Speech TTS.
Method according to realization TTS audio frequency transcoding of the present invention comprises: media server receives the access request from application server APP, and the code/decode type collection of definite media server support; Media server receives the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; Media server is held consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
Further, media server is held consultation according to code/decode type collection and TTS server, to obtain the audio coding decoding type after the negotiation, and be sent to terminal according to the audio coding decoding type after with media business packet transcoding and comprise: media control unit MSCU sends session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU receives the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; The media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Further, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU and TTS server establish a communications link; TTS server identification text, and be the media business packet with text-converted.
Further, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU issues the transcoding order to MRU; The port type that MRU and TTS server are connected is appointed as the audio coding decoding type after the negotiation.
Further, the media business packet of MSCU control MSTU after with transcoding is sent to terminal and comprises: MSCU issues the order of opening the NAT passage to MSTU; The media business packet of MSTU after with transcoding carries out being sent to terminal behind the NAT.
Further, before the access request of media server reception from application server APP, method also comprises: terminal sends the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
To achieve these goals, according to another aspect of the present invention, provide a kind of audio frequency trans-coding system of realizing from Text To Speech TTS.
System according to realization TTS audio frequency transcoding of the present invention comprises: terminal; The TTS server; Media server, be used to receive access request from application server APP, with the code/decode type collection of determining that media server is supported, and the TTS service request of reception APP application, to satisfy the media business packet of this type of service to the application of TTS server according to the TTS type of service, hold consultation according to code/decode type collection and TTS server then, obtaining the audio coding decoding type after the negotiation, and be sent to terminal after with media business packet transcoding according to the audio coding decoding type.
Further, media server comprises: media control unit MSCU, be used to send session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU, be used to receive the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; Wherein, the media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Further, terminal sends the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
To achieve these goals, according to another aspect of the present invention, provide a kind of audio frequency transcoding device of realizing from Text To Speech TTS.
Device according to realization TTS audio frequency transcoding of the present invention comprises: first processing module, be used to receive access request from application server APP, and the code/decode type collection of definite media server support; Second processing module is used to receive the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; The 3rd processing module is used for holding consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
By the present invention, adopt the access request of media server reception from application server APP, and the code/decode type collection of definite media server support; Media server receives the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; Media server is held consultation according to code/decode type collection and TTS server, to obtain the audio coding decoding type after the negotiation, and be sent to terminal after with media business packet transcoding according to the audio coding decoding type, solved in the prior art under the situation of business demand that audio capability collection at the TTS server can't satisfy media server, the problem of terminal access media business packet data failure, and then reached the effect that improves terminal access media business packet data success rate.
Embodiment
In order to make technical problem to be solved by this invention, technical scheme and beneficial effect clearer, clear,, the present invention is further elaborated below in conjunction with drawings and Examples.Should be appreciated that specific embodiment described herein only in order to explanation the present invention, and be not used in qualification the present invention.
The invention provides a kind of system of the TTS of realization audio frequency transcoding.Fig. 2 is that as shown in Figure 2, this system comprises: terminal according to the system configuration schematic diagram of the realization TTS audio frequency transcoding of the embodiment of the invention; The TTS server; Media server, be used to receive access request from application server APP, with the code/decode type collection of determining that media server is supported, and the TTS service request of reception APP application, to satisfy the media business packet of this type of service to the application of TTS server according to the TTS type of service, hold consultation according to code/decode type collection and TTS server then, obtaining the audio coding decoding type after the negotiation, and be sent to terminal after with media business packet transcoding according to the audio coding decoding type.
The foregoing description is implemented in when the TTS server application resource by media server, Session Description Protocol SDP not with the audio coding decoding of terminal as the capability set of consulting, the code/decode format that media server is all supported is as capability set, after in media server, consulting successfully, the TTS server sends to media server inside with medium, media server is by behind the transcoding then, the form that needs according to terminal sends, thereby solved under the situation of business demand that audio capability collection at the TTS server can't satisfy media server, the problem of terminal access media business packet data failure, and then reached the effect that improves terminal access media business packet data success rate.
Fig. 3 is the detailed structure schematic diagram according to media server in embodiment illustrated in fig. 2.As shown in Figure 3, media server in the above embodiments of the present application can comprise: media control unit MSCU, be used to send session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU, be used to receive the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; Wherein, the media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Preferably, in the foregoing description, terminal can send the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
Concrete, as shown in Figure 2, the detailed operation flow process of this system comprises the steps:
Step S10, terminal is to APP request multimedia service data bag, APP sends the INVITE signaling to media server and carries out media negotiation, media server is by the selected code/decode type of the capability set of self, and with the outer port address of MSTU as with the address of terminal interaction.
Step S20, the APP server sends the INFO request to media server, and the content among the INFO is application TTS business, simultaneously, transfers after the media server identification services type to TTS server application media business packet data;
Step S30, media server and TTS server are held consultation, and control TTS server to carry out text-converted be voice.As shown in Figure 3, this step S30 specifically can comprise the steps:
Step S301, media control unit MSCU consult code/decode type to TTS server initiation session initiation protocol SIP signaling.The audio coding decoding capability set that consult this moment in the INVITE signaling is what media server had, is all code/decode types that MRU supports, and requires the TTS server that the medium bag is sent to media server, and received by MRU.
Step S302, MSCU issues the order of opening the NAT passage to MSTU, and indication will send to terminal (user side) from the data that MRU receives.
Step S303, MSCU issues the transcoding order to MRU.Specify MRU to receive the medium bag that sends over from TTS, and MRU is appointed as the result that negotiations obtains from step S301 with the port audio coding decoding type that TTS connects, and the medium code/decode type exported of MRU is set to the terminal actual needs and receives code/decode type.
Step S304, MSCU sets up the TCP/IP link with TTS.And send instruction to the TTS server by the MRCP agreement, indication TTS server identification text, and the media business packet after will change sends to MRU and holds.
Step S305, TTS server send to the media business packet receiving port of MRU.
Step S306, MRU will carry out transcoding from the medium that the TTS termination is received, and the audio frequency media behind the transcoding is sent to media store transmission of audio unit MSTU receiving port;
After step S307, MSTU receive the audio pack of MRU, directly audio pack is carried out sending to terminal behind the NAT.
By above several steps, terminal just can receive the audio stream that is by text-converted.
At last, media server reports the INFO execution result to APP, and APP sends the BYE signaling to media server simultaneously, discharges resource.Discharge resource at media server to the TTS server requests, the success back is to the APP return results, this moment end of conversation.
Fig. 4 is the method flow diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention.As shown in Figure 4, the method for this realization TTS audio frequency transcoding comprises the steps:
Step S41, media server receives the access request from application server APP, and the code/decode type collection of definite media server support;
Step S43, media server receives the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service;
Step S45, media server is held consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
In the foregoing description, media server is by to TTS server application resource the time, the SDP among this embodiment not with the audio coding decoding of terminal as the capability set of consulting, the code/decode formats that media server is all supported are as capability set.After waiting to consult successfully, the TTS server sends to media server inside with medium, media server is by behind the transcoding then, the form that needs according to terminal sends, thereby solved under the situation of business demand that audio capability collection at the TTS server can't satisfy media server, the problem of terminal access media business packet data failure, and then reached the effect that improves terminal access media business packet data success rate.
In the foregoing description, step S45 media services are held consultation according to code/decode type collection and TTS server, to obtain the audio coding decoding type after the negotiation, and can comprise according to the step that the audio coding decoding type is sent to terminal after with media business packet transcoding: media control unit MSCU sends session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU receives the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; The media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Preferably, in the foregoing description, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU and TTS server establish a communications link; TTS server identification text, and be the media business packet with text-converted.
Preferably, in the foregoing description, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU issues the transcoding order to MRU; The port type that MRU and TTS server are connected is appointed as the audio coding decoding type after the negotiation.
Among above-mentioned each embodiment of the present invention, the step that the media business packet of MSCU control MSTU after with transcoding is sent to terminal can comprise: MSCU issues the order of opening the NAT passage to MSTU; The media business packet of MSTU after with transcoding carries out being sent to terminal behind the NAT.
Preferably, before the access request of media server reception from application server APP, method also comprises: terminal sends the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
The present invention also provides a kind of device of the TTS of realization audio frequency transcoding.Fig. 5 is the apparatus structure schematic diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention, and as shown in Figure 5, the device of this realization TTS audio frequency transcoding comprises: first processing module 101, second processing module 103 and the 3rd processing module 105.
Wherein, first processing module 101 is used to receive the access request from application server APP, and the code/decode type collection of definite media server support; Second processing module 103 is used to receive the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; The 3rd processing module 105 is used for holding consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
This device can be a kind of media server, this media server to TTS server application resource the time, SDP not with the audio coding decoding of terminal as the capability set of consulting, the code/decode formats that media server is all supported are as capability set.After waiting to consult successfully, the TTS server sends to media server inside with medium, and media server is by behind the transcoding then, and the form that needs according to terminal sends.
Need to prove, the embodiment of the invention can be carried out in the computer system such as a set of computer-executable instructions in the step shown in the flow chart of accompanying drawing, and, though there is shown logical order in flow process, but in some cases, can carry out step shown or that describe with the order that is different from herein.
From above embodiment described, as can be seen, the present invention had realized following technique effect: the effect that improves terminal access media business packet data success rate.
Obviously, those skilled in the art should be understood that, above-mentioned each module of the present invention or each step can realize with the general calculation device, they can concentrate on the single calculation element, perhaps be distributed on the network that a plurality of calculation element forms, alternatively, they can be realized with the executable program code of calculation element, thereby, they can be stored in the storage device and carry out by calculation element, perhaps they are made into a plurality of integrated circuit modules respectively, perhaps a plurality of modules in them or step are made into the single integrated circuit module and realize.Like this, the present invention is not restricted to any specific hardware and software combination.
Above-mentioned explanation illustrates and has described a preferred embodiment of the present invention, but as previously mentioned, be to be understood that the present invention is not limited to the disclosed form of this paper, should not regard eliminating as to other embodiment, and can be used for various other combinations, modification and environment, and can in invention contemplated scope described herein, change by the technology or the knowledge of above-mentioned instruction or association area.And change that those skilled in the art carried out and variation do not break away from the spirit and scope of the present invention, then all should be in the protection range of claims of the present invention.