CN102231734A - Method, device and system for realizing audio transcoding of TTS (Text To Speech) - Google Patents

Method, device and system for realizing audio transcoding of TTS (Text To Speech) Download PDF

Info

Publication number
CN102231734A
CN102231734A CN2011101697033A CN201110169703A CN102231734A CN 102231734 A CN102231734 A CN 102231734A CN 2011101697033 A CN2011101697033 A CN 2011101697033A CN 201110169703 A CN201110169703 A CN 201110169703A CN 102231734 A CN102231734 A CN 102231734A
Authority
CN
China
Prior art keywords
server
tts
media
type
business packet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011101697033A
Other languages
Chinese (zh)
Other versions
CN102231734B (en
Inventor
张闽
张伟
刘澍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201110169703.3A priority Critical patent/CN102231734B/en
Publication of CN102231734A publication Critical patent/CN102231734A/en
Priority to PCT/CN2012/072860 priority patent/WO2012174908A1/en
Application granted granted Critical
Publication of CN102231734B publication Critical patent/CN102231734B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42391Systems providing special services or facilities to subscribers where the subscribers are hearing-impaired persons, e.g. telephone devices for the deaf
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a method, device and system for realizing the audio transcoding of TTS (Text To Speech), wherein the method comprises the following steps: a media server receives an access request from an application (APP) server and determines a coding and decoding type set which is supported by the media server; the media server receives a TTS service request which is applied by the APP and applies a media service data packet which meets the service type to a TTS server according to the service type of the TTS; and the media server consults with the TTS server according to the coding and decoding type set so as to obtain the consulted audio coding and decoding type, transcodes the media service data packet according to the audio coding and decoding type and then sends the transcoded media service data packet to a terminal. According to the invention, the success rate of accessing the data of the media service data packet by the terminal can be improved.

Description

Realization is from audio frequency code-transferring method, the Apparatus and system of Text To Speech TTS
Technical field
The present invention relates to the communications field, particularly, relate in particular to a kind of audio frequency code-transferring method, Apparatus and system of realizing from Text To Speech TTS.
Background technology
Media server is used for all medium relevant with audio frequency and video to be handled, and comprises that video and audio frequency RTP data flow to the mutual conversion of looking audio file.Simultaneously, also be responsible for to receive the DTMF input of user by terminal, play service the guiding voice, show dynamic guide picture.Mutual with the user that Session Initiation Protocol that it has and MSML/MOML ability make that it can finish the whole session process under the control of application server APP.
Media control unit (MSCU) is a significant element in the media server, mainly finishes with other entities and carries out capability negotiation, management, the maintenance of resource itself is provided and controls the function that complicated business is finished in other service resources unit.
Media store transmission of audio unit (MSTU-audio) is the service resources unit in the media server, finishes the voice data storage of magnanimity, comprises and realizes the audio file playing function.External network interface is arranged on the media storage unit, can be directly by the external network interface transmitting-receiving on the unit.
Media store transmission of video unit (MSTU-video) is the service resources unit in the media server, finishes the multimedia audio-video storage of magnanimity, comprises realizing the video file playing function.External network interface is arranged on the media storage unit, can be directly by the external network interface transmitting-receiving on the unit.
Now, the use broadcast of media server is very wide.Mainly can reduce audio frequency and video and play, collect the digits and function such as meeting.
From the function of Text To Speech (Text To Speech abbreviates TTS as) is that the text message that will import identifies, and is converted into voice messaging, and voice medium is sent to the user.At field of telecommunications, the application of TTS is special TTS server of configuration substantially at present, specifies TTS that voice are sent to the user by signaling and brings in the once business of finishing.
Fig. 1 is the system configuration schematic diagram according to the realization TTS audio frequency transcoding of correlation technique.As shown in Figure 1, the workflow of this system comprises the steps:
Step 101: terminal is initiated call, activates the business of APP.APP initiates operation flow to media server;
Step 102:APP passes through the SIP signaling to media server request TTS business;
Step 103: media server passes through the SIP signaling to TTS server requests TTS resource, and finishes business function by MRCP agreement control TTS server;
Step 104:TTS server sends medium to terminal
More than be at present typical networking and operation flow.The TTS server uses as the external device of media server.APP just initiates to media server in requested service, media server is judged type of service, when type of service is the TTS application, media server is initiated request to the TTS server again, the application resource, and the behavior of control TTS server, the TTS server sends to medium the terminal in a distant place automatically after receiving signaling.
Above flow process can be finished a basic TTS business.But some problems have appearred in the Application Expansion along with business.Such as, the audio capability collection of TTS server causes service fail with the unmatched problem of media server capability set.Because APP is in media server agreement SDP, media server does not also know whether type of service is TTS, so can consult audio frequency parameter with terminal according to the limit of power of oneself.When APP issues INFO when instruction to media server, media server just can identify the TTS type of service, this moment media server by terminal SDP information to TTS server application resource.If the audio capability scope of TTS server does not satisfy the result that media server negotiates with terminal, cause service fail exactly.Such as: it is the G726 form that media server negotiates code/decode type with terminal, but the TTS server is only supported the audio format of G711.。
At satisfying under the situation of business demand of media server at the audio capability collection of TTS server in the above-mentioned prior art, the problem of terminal access media business packet data failure does not also have effective solution at present.
Summary of the invention
Main purpose of the present invention is to provide a kind of audio frequency code-transferring method, Apparatus and system of realizing from Text To Speech TTS, to solve in the prior art under the situation of business demand that audio capability collection at the TTS server can't satisfy media server the problem of terminal access media business packet data failure.
To achieve these goals, according to an aspect of the present invention, provide a kind of audio frequency code-transferring method of realizing from Text To Speech TTS.
Method according to realization TTS audio frequency transcoding of the present invention comprises: media server receives the access request from application server APP, and the code/decode type collection of definite media server support; Media server receives the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; Media server is held consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
Further, media server is held consultation according to code/decode type collection and TTS server, to obtain the audio coding decoding type after the negotiation, and be sent to terminal according to the audio coding decoding type after with media business packet transcoding and comprise: media control unit MSCU sends session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU receives the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; The media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Further, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU and TTS server establish a communications link; TTS server identification text, and be the media business packet with text-converted.
Further, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU issues the transcoding order to MRU; The port type that MRU and TTS server are connected is appointed as the audio coding decoding type after the negotiation.
Further, the media business packet of MSCU control MSTU after with transcoding is sent to terminal and comprises: MSCU issues the order of opening the NAT passage to MSTU; The media business packet of MSTU after with transcoding carries out being sent to terminal behind the NAT.
Further, before the access request of media server reception from application server APP, method also comprises: terminal sends the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
To achieve these goals, according to another aspect of the present invention, provide a kind of audio frequency trans-coding system of realizing from Text To Speech TTS.
System according to realization TTS audio frequency transcoding of the present invention comprises: terminal; The TTS server; Media server, be used to receive access request from application server APP, with the code/decode type collection of determining that media server is supported, and the TTS service request of reception APP application, to satisfy the media business packet of this type of service to the application of TTS server according to the TTS type of service, hold consultation according to code/decode type collection and TTS server then, obtaining the audio coding decoding type after the negotiation, and be sent to terminal after with media business packet transcoding according to the audio coding decoding type.
Further, media server comprises: media control unit MSCU, be used to send session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU, be used to receive the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; Wherein, the media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Further, terminal sends the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
To achieve these goals, according to another aspect of the present invention, provide a kind of audio frequency transcoding device of realizing from Text To Speech TTS.
Device according to realization TTS audio frequency transcoding of the present invention comprises: first processing module, be used to receive access request from application server APP, and the code/decode type collection of definite media server support; Second processing module is used to receive the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; The 3rd processing module is used for holding consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
By the present invention, adopt the access request of media server reception from application server APP, and the code/decode type collection of definite media server support; Media server receives the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; Media server is held consultation according to code/decode type collection and TTS server, to obtain the audio coding decoding type after the negotiation, and be sent to terminal after with media business packet transcoding according to the audio coding decoding type, solved in the prior art under the situation of business demand that audio capability collection at the TTS server can't satisfy media server, the problem of terminal access media business packet data failure, and then reached the effect that improves terminal access media business packet data success rate.
Description of drawings
Accompanying drawing described herein is used to provide further understanding of the present invention, constitutes a part of the present invention, and illustrative examples of the present invention and explanation thereof are used to explain the present invention, does not constitute improper qualification of the present invention.In the accompanying drawings:
Fig. 1 is the system configuration schematic diagram according to the realization TTS audio frequency transcoding of correlation technique;
Fig. 2 is the system configuration schematic diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention;
Fig. 3 is the detailed structure schematic diagram according to media server in embodiment illustrated in fig. 2;
Fig. 4 is the method flow diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention; And
Fig. 5 is the apparatus structure schematic diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention.
Embodiment
In order to make technical problem to be solved by this invention, technical scheme and beneficial effect clearer, clear,, the present invention is further elaborated below in conjunction with drawings and Examples.Should be appreciated that specific embodiment described herein only in order to explanation the present invention, and be not used in qualification the present invention.
The invention provides a kind of system of the TTS of realization audio frequency transcoding.Fig. 2 is that as shown in Figure 2, this system comprises: terminal according to the system configuration schematic diagram of the realization TTS audio frequency transcoding of the embodiment of the invention; The TTS server; Media server, be used to receive access request from application server APP, with the code/decode type collection of determining that media server is supported, and the TTS service request of reception APP application, to satisfy the media business packet of this type of service to the application of TTS server according to the TTS type of service, hold consultation according to code/decode type collection and TTS server then, obtaining the audio coding decoding type after the negotiation, and be sent to terminal after with media business packet transcoding according to the audio coding decoding type.
The foregoing description is implemented in when the TTS server application resource by media server, Session Description Protocol SDP not with the audio coding decoding of terminal as the capability set of consulting, the code/decode format that media server is all supported is as capability set, after in media server, consulting successfully, the TTS server sends to media server inside with medium, media server is by behind the transcoding then, the form that needs according to terminal sends, thereby solved under the situation of business demand that audio capability collection at the TTS server can't satisfy media server, the problem of terminal access media business packet data failure, and then reached the effect that improves terminal access media business packet data success rate.
Fig. 3 is the detailed structure schematic diagram according to media server in embodiment illustrated in fig. 2.As shown in Figure 3, media server in the above embodiments of the present application can comprise: media control unit MSCU, be used to send session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU, be used to receive the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; Wherein, the media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Preferably, in the foregoing description, terminal can send the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
Concrete, as shown in Figure 2, the detailed operation flow process of this system comprises the steps:
Step S10, terminal is to APP request multimedia service data bag, APP sends the INVITE signaling to media server and carries out media negotiation, media server is by the selected code/decode type of the capability set of self, and with the outer port address of MSTU as with the address of terminal interaction.
Step S20, the APP server sends the INFO request to media server, and the content among the INFO is application TTS business, simultaneously, transfers after the media server identification services type to TTS server application media business packet data;
Step S30, media server and TTS server are held consultation, and control TTS server to carry out text-converted be voice.As shown in Figure 3, this step S30 specifically can comprise the steps:
Step S301, media control unit MSCU consult code/decode type to TTS server initiation session initiation protocol SIP signaling.The audio coding decoding capability set that consult this moment in the INVITE signaling is what media server had, is all code/decode types that MRU supports, and requires the TTS server that the medium bag is sent to media server, and received by MRU.
Step S302, MSCU issues the order of opening the NAT passage to MSTU, and indication will send to terminal (user side) from the data that MRU receives.
Step S303, MSCU issues the transcoding order to MRU.Specify MRU to receive the medium bag that sends over from TTS, and MRU is appointed as the result that negotiations obtains from step S301 with the port audio coding decoding type that TTS connects, and the medium code/decode type exported of MRU is set to the terminal actual needs and receives code/decode type.
Step S304, MSCU sets up the TCP/IP link with TTS.And send instruction to the TTS server by the MRCP agreement, indication TTS server identification text, and the media business packet after will change sends to MRU and holds.
Step S305, TTS server send to the media business packet receiving port of MRU.
Step S306, MRU will carry out transcoding from the medium that the TTS termination is received, and the audio frequency media behind the transcoding is sent to media store transmission of audio unit MSTU receiving port;
After step S307, MSTU receive the audio pack of MRU, directly audio pack is carried out sending to terminal behind the NAT.
By above several steps, terminal just can receive the audio stream that is by text-converted.
At last, media server reports the INFO execution result to APP, and APP sends the BYE signaling to media server simultaneously, discharges resource.Discharge resource at media server to the TTS server requests, the success back is to the APP return results, this moment end of conversation.
Fig. 4 is the method flow diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention.As shown in Figure 4, the method for this realization TTS audio frequency transcoding comprises the steps:
Step S41, media server receives the access request from application server APP, and the code/decode type collection of definite media server support;
Step S43, media server receives the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service;
Step S45, media server is held consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
In the foregoing description, media server is by to TTS server application resource the time, the SDP among this embodiment not with the audio coding decoding of terminal as the capability set of consulting, the code/decode formats that media server is all supported are as capability set.After waiting to consult successfully, the TTS server sends to media server inside with medium, media server is by behind the transcoding then, the form that needs according to terminal sends, thereby solved under the situation of business demand that audio capability collection at the TTS server can't satisfy media server, the problem of terminal access media business packet data failure, and then reached the effect that improves terminal access media business packet data success rate.
In the foregoing description, step S45 media services are held consultation according to code/decode type collection and TTS server, to obtain the audio coding decoding type after the negotiation, and can comprise according to the step that the audio coding decoding type is sent to terminal after with media business packet transcoding: media control unit MSCU sends session initiation protocol SIP signaling to the TTS server, with the audio coding decoding type of negotiation and designated media server and TTS server coupling, the type of coding collection comprises the audio coding decoding type; Voice center crosspoint MRU receives the media business packet that the TTS server returns, and the media business packet carried out transcoding according to the audio coding decoding type of consulting, and the media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU; The media business packet of MSCU control MSTU after with transcoding is sent to terminal.
Preferably, in the foregoing description, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU and TTS server establish a communications link; TTS server identification text, and be the media business packet with text-converted.
Preferably, in the foregoing description, before the media business packet that voice center crosspoint MRU reception TTS server returns, method also comprises: MSCU issues the transcoding order to MRU; The port type that MRU and TTS server are connected is appointed as the audio coding decoding type after the negotiation.
Among above-mentioned each embodiment of the present invention, the step that the media business packet of MSCU control MSTU after with transcoding is sent to terminal can comprise: MSCU issues the order of opening the NAT passage to MSTU; The media business packet of MSTU after with transcoding carries out being sent to terminal behind the NAT.
Preferably, before the access request of media server reception from application server APP, method also comprises: terminal sends the request of multimedia service data bag to APP; APP sends the signaling of access request according to the request of multimedia service data bag to media server, and with the outer port address of MSTU as with the address of terminal interaction.
The present invention also provides a kind of device of the TTS of realization audio frequency transcoding.Fig. 5 is the apparatus structure schematic diagram according to the realization TTS audio frequency transcoding of the embodiment of the invention, and as shown in Figure 5, the device of this realization TTS audio frequency transcoding comprises: first processing module 101, second processing module 103 and the 3rd processing module 105.
Wherein, first processing module 101 is used to receive the access request from application server APP, and the code/decode type collection of definite media server support; Second processing module 103 is used to receive the TTS service request of APP application, and satisfies the media business packet of this type of service to the application of TTS server according to the TTS type of service; The 3rd processing module 105 is used for holding consultation according to code/decode type collection and TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to the audio coding decoding type after with media business packet transcoding.
This device can be a kind of media server, this media server to TTS server application resource the time, SDP not with the audio coding decoding of terminal as the capability set of consulting, the code/decode formats that media server is all supported are as capability set.After waiting to consult successfully, the TTS server sends to media server inside with medium, and media server is by behind the transcoding then, and the form that needs according to terminal sends.
Need to prove, the embodiment of the invention can be carried out in the computer system such as a set of computer-executable instructions in the step shown in the flow chart of accompanying drawing, and, though there is shown logical order in flow process, but in some cases, can carry out step shown or that describe with the order that is different from herein.
From above embodiment described, as can be seen, the present invention had realized following technique effect: the effect that improves terminal access media business packet data success rate.
Obviously, those skilled in the art should be understood that, above-mentioned each module of the present invention or each step can realize with the general calculation device, they can concentrate on the single calculation element, perhaps be distributed on the network that a plurality of calculation element forms, alternatively, they can be realized with the executable program code of calculation element, thereby, they can be stored in the storage device and carry out by calculation element, perhaps they are made into a plurality of integrated circuit modules respectively, perhaps a plurality of modules in them or step are made into the single integrated circuit module and realize.Like this, the present invention is not restricted to any specific hardware and software combination.
Above-mentioned explanation illustrates and has described a preferred embodiment of the present invention, but as previously mentioned, be to be understood that the present invention is not limited to the disclosed form of this paper, should not regard eliminating as to other embodiment, and can be used for various other combinations, modification and environment, and can in invention contemplated scope described herein, change by the technology or the knowledge of above-mentioned instruction or association area.And change that those skilled in the art carried out and variation do not break away from the spirit and scope of the present invention, then all should be in the protection range of claims of the present invention.

Claims (10)

1. a realization is characterized in that from the audio frequency code-transferring method of Text To Speech TTS, comprising:
Media server receives the access request from application server APP, and determines the code/decode type collection that described media server is supported;
Described media server receive described APP application from Text To Speech TTS service request, and satisfy the media business packet of this type of service to the application of TTS server according to described TTS type of service;
Described media server is held consultation according to described code/decode type collection and described TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to described audio coding decoding type after with described media business packet transcoding.
2. method according to claim 1, it is characterized in that, described media server is held consultation according to described code/decode type collection and described TTS server, obtaining the audio coding decoding type after the negotiation, and be sent to terminal according to described audio coding decoding type after with described media business packet transcoding and comprise:
Media control unit MSCU sends session initiation protocol SIP signaling to described TTS server, and to consult and to specify the described audio coding decoding type of described media server and described TTS server coupling, described type of coding collection comprises described audio coding decoding type;
Voice center crosspoint MRU receives the described media business packet that described TTS server returns, and described media business packet carried out transcoding according to the described audio coding decoding type of consulting, and the described media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU;
Described MSCU controls the described media business packet of described MSTU after with transcoding and is sent to described terminal.
3. method according to claim 2 is characterized in that, before voice center crosspoint MRU received the described media business packet that described TTS server returns, described method also comprised:
Described MSCU and described TTS server establish a communications link;
Described TTS server identification text, and be the media business packet with described text-converted.
4. method according to claim 2 is characterized in that, before voice center crosspoint MRU received the described media business packet that described TTS server returns, described method also comprised:
Described MSCU issues the transcoding order to described MRU;
The port type that described MRU and described TTS server are connected is appointed as the described audio coding decoding type after the negotiation.
5. according to each described method among the claim 2-4, it is characterized in that described MSCU controls the described media business packet of described MSTU after with transcoding and is sent to described terminal and comprises:
Described MSCU issues the order of opening the NAT passage to described MSTU;
The described media business packet of described MSTU after with transcoding carries out being sent to described terminal behind the NAT.
6. method according to claim 5 is characterized in that, before the access request of media server reception from application server APP, described method also comprises:
Described terminal sends the request of multimedia service data bag to described APP;
Described APP sends the signaling of described access request according to the request of described multimedia service data bag to described media server, and with the outer port address of described MSTU as with the address of terminal interaction.
7. a realization is characterized in that from the audio frequency trans-coding system of Text To Speech TTS, comprising:
Terminal;
The TTS server;
Media server, be used to receive access request from application server APP, with the code/decode type collection of determining that described media server is supported, and receive the TTS service request of described APP application, to satisfy the media business packet of this type of service to the application of TTS server according to described TTS type of service, hold consultation according to described code/decode type collection and described TTS server then, obtaining the audio coding decoding type after the negotiation, and be sent to terminal after with described media business packet transcoding according to described audio coding decoding type.
8. system according to claim 7 is characterized in that, described media server comprises:
Media control unit MSCU, be used to send session initiation protocol SIP signaling to described TTS server, to consult and to specify the described audio coding decoding type of described media server and described TTS server coupling, described type of coding collection comprises described audio coding decoding type;
Voice center crosspoint MRU, be used to receive the described media business packet that described TTS server returns, and described media business packet carried out transcoding according to the described audio coding decoding type of consulting, and the described media business packet behind the transcoding is sent and is saved to media store transmission of audio unit MSTU;
Wherein, described MSCU controls the described media business packet of described MSTU after with transcoding and is sent to described terminal.
9. system according to claim 8 is characterized in that, described terminal sends the request of multimedia service data bag to described APP; Described APP sends the signaling of described access request according to the request of described multimedia service data bag to described media server, and with the outer port address of described MSTU as with the address of terminal interaction.
10. a realization is characterized in that from the audio frequency transcoding device of Text To Speech TTS, comprising:
First processing module is used to receive the access request from application server APP, and determines the code/decode type collection that described media server is supported;
Second processing module is used to receive the TTS service request of described APP application, and satisfies the media business packet of this type of service to the application of TTS server according to described TTS type of service;
The 3rd processing module is used for holding consultation according to described code/decode type collection and described TTS server, obtaining the audio coding decoding type after the negotiation, and is sent to terminal according to described audio coding decoding type after with described media business packet transcoding.
CN201110169703.3A 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS Active CN102231734B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201110169703.3A CN102231734B (en) 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS
PCT/CN2012/072860 WO2012174908A1 (en) 2011-06-22 2012-03-22 Method, device and system for realizing audio transcoding of text to speech

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110169703.3A CN102231734B (en) 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS

Publications (2)

Publication Number Publication Date
CN102231734A true CN102231734A (en) 2011-11-02
CN102231734B CN102231734B (en) 2017-10-03

Family

ID=44844267

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110169703.3A Active CN102231734B (en) 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS

Country Status (2)

Country Link
CN (1) CN102231734B (en)
WO (1) WO2012174908A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012174908A1 (en) * 2011-06-22 2012-12-27 中兴通讯股份有限公司 Method, device and system for realizing audio transcoding of text to speech
CN103151041A (en) * 2013-01-28 2013-06-12 中兴通讯股份有限公司 Method and system for achieving automatic speech recognition business and media server
CN105306420A (en) * 2014-06-27 2016-02-03 中兴通讯股份有限公司 Method, device and server for realizing loop play of text to speech service
CN105635158A (en) * 2016-01-07 2016-06-01 福建星网智慧科技股份有限公司 Speech call automatic warning method based on SIP (Session Initiation Protocol)
CN107181723A (en) * 2016-03-11 2017-09-19 中兴通讯股份有限公司 A kind of media coding/decoding negotiation method and terminal device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070133518A1 (en) * 2005-12-13 2007-06-14 International Business Machines Corporation Distributed off-line voice services
CN101163023A (en) * 2006-10-09 2008-04-16 中兴通讯股份有限公司 Media server resource allocation processing method
CN101437047A (en) * 2008-12-09 2009-05-20 中兴通讯股份有限公司 Method, system and media server for playback/ sound-recording for user terminal
US20100017509A1 (en) * 2006-12-08 2010-01-21 Tomas Frankkila Handling announcement media in a communication network environment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100544463C (en) * 2007-06-29 2009-09-23 中兴通讯股份有限公司 A kind of system and method that speech synthesis application united development platform is provided
CN102231734B (en) * 2011-06-22 2017-10-03 南京中兴新软件有限责任公司 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070133518A1 (en) * 2005-12-13 2007-06-14 International Business Machines Corporation Distributed off-line voice services
CN101163023A (en) * 2006-10-09 2008-04-16 中兴通讯股份有限公司 Media server resource allocation processing method
US20100017509A1 (en) * 2006-12-08 2010-01-21 Tomas Frankkila Handling announcement media in a communication network environment
CN101437047A (en) * 2008-12-09 2009-05-20 中兴通讯股份有限公司 Method, system and media server for playback/ sound-recording for user terminal

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012174908A1 (en) * 2011-06-22 2012-12-27 中兴通讯股份有限公司 Method, device and system for realizing audio transcoding of text to speech
CN103151041A (en) * 2013-01-28 2013-06-12 中兴通讯股份有限公司 Method and system for achieving automatic speech recognition business and media server
WO2013189430A2 (en) * 2013-01-28 2013-12-27 中兴通讯股份有限公司 Method, system, and media server for implementing automatic speech recognition service
WO2013189430A3 (en) * 2013-01-28 2014-02-20 中兴通讯股份有限公司 Method, system, and media server for implementing automatic speech recognition service
CN103151041B (en) * 2013-01-28 2016-02-10 中兴通讯股份有限公司 A kind of implementation method of automatic speech recognition business, system and media server
CN105306420A (en) * 2014-06-27 2016-02-03 中兴通讯股份有限公司 Method, device and server for realizing loop play of text to speech service
CN105635158A (en) * 2016-01-07 2016-06-01 福建星网智慧科技股份有限公司 Speech call automatic warning method based on SIP (Session Initiation Protocol)
CN107181723A (en) * 2016-03-11 2017-09-19 中兴通讯股份有限公司 A kind of media coding/decoding negotiation method and terminal device

Also Published As

Publication number Publication date
WO2012174908A1 (en) 2012-12-27
CN102231734B (en) 2017-10-03

Similar Documents

Publication Publication Date Title
CN101924772B (en) Communication system and method supporting cross-network and cross-terminal realization of multimedia session merging
CN113746808B (en) Converged communication method, gateway, electronic equipment and storage medium for online conference
EP2779579B1 (en) Method and apparatuses for realizing voip call in cloud computing environment
CN101682642B (en) Improved codec negotiation
US20230353603A1 (en) Call processing system and call processing method
US20230353673A1 (en) Call processing method, call processing apparatus, and related device
CN106921843A (en) Data transmission method and device
CN102231734A (en) Method, device and system for realizing audio transcoding of TTS (Text To Speech)
US20150116450A1 (en) Video Data Transmission Method and Apparatus, and Communications Device
CN103151041B (en) A kind of implementation method of automatic speech recognition business, system and media server
US20120331510A1 (en) Method, server and system for providing real-time video service in telecommunication network
CN102843336A (en) Method and system for accessing IMS (IP Multimedia Subsystem) multimedia conference
US9071690B2 (en) Call transfer processing in SIP mode
JP2007328405A (en) Terminal connection program and device
CN103795958A (en) Multimedia call negotiation method, system and video interworking gateway, multimedia terminal
CN102055961A (en) Method for monitoring visible terminal of called party and video monitoring system
EP2034664A1 (en) Method and device of congrolling media resource, method and system of establishing calling
CN102223386A (en) Method, device and system for remotely accessing home network
CN106791992A (en) Signal source method for pushing and system
CN113726968B (en) Terminal communication method, device, server and storage medium
CN101594623B (en) Method and equipment for monitoring call made via voice over Internet protocol
CN105306420B (en) Realize the method, apparatus played from Text To Speech cycle of business operations and server
US9143726B2 (en) Video media server for realizing video intercommunication gateway function and video intercommunication method
CN101803357A (en) Method, apparatus and system for multimedia communication
KR20120075594A (en) Method and apparatus for device capability information based incompatible media contents transformation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20170628

Address after: Yuhuatai District of Nanjing City, Jiangsu province 210012 Bauhinia Road No. 68

Applicant after: Nanjing Zhongxing New Software Co., Ltd.

Address before: 518057 Nanshan District Guangdong high tech Industrial Park, South Road, science and technology, ZTE building, Ministry of Justice

Applicant before: ZTE Corporation

GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20191125

Address after: 518057 Nanshan District science and technology, Guangdong Province, South Road, No. 55, No.

Patentee after: ZTE Communications Co., Ltd.

Address before: Yuhuatai District of Nanjing City, Jiangsu province 210012 Bauhinia Road No. 68

Patentee before: Nanjing Zhongxing New Software Co., Ltd.