CN102231734B - Realize audio code-transferring method, the apparatus and system from Text To Speech TTS - Google Patents

Realize audio code-transferring method, the apparatus and system from Text To Speech TTS Download PDF

Info

Publication number
CN102231734B
CN102231734B CN201110169703.3A CN201110169703A CN102231734B CN 102231734 B CN102231734 B CN 102231734B CN 201110169703 A CN201110169703 A CN 201110169703A CN 102231734 B CN102231734 B CN 102231734B
Authority
CN
China
Prior art keywords
media
tts
service data
data bag
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201110169703.3A
Other languages
Chinese (zh)
Other versions
CN102231734A (en
Inventor
张闽
张伟
刘澍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
Nanjing ZTE New Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing ZTE New Software Co Ltd filed Critical Nanjing ZTE New Software Co Ltd
Priority to CN201110169703.3A priority Critical patent/CN102231734B/en
Publication of CN102231734A publication Critical patent/CN102231734A/en
Priority to PCT/CN2012/072860 priority patent/WO2012174908A1/en
Application granted granted Critical
Publication of CN102231734B publication Critical patent/CN102231734B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42391Systems providing special services or facilities to subscribers where the subscribers are hearing-impaired persons, e.g. telephone devices for the deaf
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition

Abstract

The invention discloses a kind of method realized from Text To Speech TTS audio transcoding, apparatus and system.Wherein, this method includes:Media server receives the access request from application server APP, and determines the code/decode type collection that media server is supported;Media server receives the TTS service requests of APP applications, and the media service data bag of the type of service is met according to TTS types of service to TTS engine application;Media server is held consultation according to code/decode type collection and TTS engine, to obtain the audio coding decoding type after consulting, and will be sent according to audio coding decoding type after media service data bag transcoding to terminal.By means of the invention it is possible to improve the success rate of terminal access media service data bag data.

Description

Realize audio code-transferring method, the apparatus and system from Text To Speech TTS
Technical field
The present invention relates to the communications field, specifically, more particularly to a kind of audio transcoding realized from Text To Speech TTS Method, apparatus and system.
Background technology
Media server is used for all media handlings related to audio frequency and video, including video and audio RTP data are flowed to and regarded The mutual conversion of audio file.Meanwhile, also be responsible for receive user by the DTMF inputs of terminal, the guiding voice of play service, Show dynamic guide picture.Session Initiation Protocol and MSML/MOML abilities that it has enable it to the control in application server APP System is lower being interacted with user of completing whole conversation procedure.
Media control unit (MSCU) is a significant element in media server, main to complete to carry out with other entities There is provided resource management in itself, the function of safeguarding and control other service resources units to complete complicated business for capability negotiation.
Media storage transmission audio unit (MSTU-audio) is the service resources unit in media server, completes magnanimity Voice data storage, including realize audio frequency document display function.There is external network interface in media storage unit, can directly pass through External network interface transmitting-receiving on unit.
Media storage transmission video unit (MSTU-video) is the service resources unit in media server, completes magnanimity Multimedia audio-video data storage, including realize video file playing function.There is external network interface in media storage unit, can be with Directly received and dispatched by the external network interface on unit.
Now, the use that media server is broadcast is very wide.Audio and video playing can be mainly summarized as, is collected the digits and the work(such as meeting Energy.
Function from Text To Speech (Text To Speech, referred to as TTS) is to identify the text message of input Come, be converted into voice messaging, voice medium is sent to user.At present in field of telecommunications, TTS application is substantially configuration one Special TTS engine, specifies TTS that voice is sent into user terminal to complete a business by signaling.
Fig. 1 is the system structure diagram for realizing TTS audio transcodings according to correlation technique.As shown in figure 1, the system Workflow comprises the following steps:
Step 101:Terminal initiates call, activates APP business.APP initiates operation flow to media server;
Step 102:APP asks TTS business by SIP signalings to media server;
Step 103:Media server asks TTS resources by SIP signalings to TTS engine, and passes through MRCP agreement controls TTS engine finishing service function processed;
Step 104:TTS engine sends media to terminal
It is current typical networking and operation flow above.TTS engine makes as the external device of media server With.APP is simply initiated when requested service to media server, and media server judges type of service, works as type of service When being applied for TTS, media server initiates to ask again to TTS engine, applies for resource, and controls the behavior of TTS engine, TTS engine automatic terminal that media are sent to a distant place after signaling is received.
Above flow can complete a basic TTS business.But as the extension of the application of business occurs in that some are asked Topic.Such as, the audio capability collection of TTS engine causes service fail with the unmatched problem of media server capability set.Because APP is when same media server agreement SDP, and media server is not aware that whether type of service is TTS, so can be according to The limit of power of oneself consults audio frequency parameter with terminal.When APP issues INFO instructions to media server, media server is Can recognize that TTS types of service, now media server by terminal SDP information to TTS engine application resource.If TTS The audio capability scope of server is unsatisfactory for the result that media server is negotiated with terminal, exactly causes service fail.Such as: Media server negotiates code/decode type for G726 forms with terminal, but TTS engine only supports G711 audio lattice Formula..
The business demand of media server can not be met for above-mentioned in the audio capability collection of TTS engine in the prior art In the case of, the problem of terminal access media service data bag data fails, there is presently no effective solution.
The content of the invention
It is a primary object of the present invention to provide it is a kind of realize the audio code-transferring method from Text To Speech TTS, device and System, the feelings of the business demand of media server can not be met with solution in the audio capability collection of TTS engine in the prior art Under condition, the problem of terminal access media service data bag data fails.
To achieve these goals, there is provided a kind of sound realized from Text To Speech TTS according to an aspect of the present invention Frequency code-transferring method.
Realize that the method for TTS audio transcodings includes according to the present invention:Media server, which is received, comes from application server APP Access request, and determine media server support code/decode type collection;Media server receives the TTS business of APP applications Request, and the media service data bag of the type of service is met according to TTS types of service to TTS engine application;Media services Device is held consultation according to code/decode type collection and TTS engine, to obtain the audio coding decoding type after consulting, and according to audio Code/decode type will be sent to terminal after media service data bag transcoding.
Further, media server is held consultation according to code/decode type collection and TTS engine, to obtain after negotiation Audio coding decoding type, and will send to terminal and include after media service data bag transcoding according to audio coding decoding type:Media Control unit MSCU sends session initiation protocol SIP signalings to TTS engine, is taken with consulting simultaneously designated media server with TTS The audio coding decoding type of business device matching, type of coding collection includes audio coding decoding type;Speech centre crosspoint MRU is received The media service data bag that TTS engine is returned, and media service data bag is carried out according to the audio coding decoding type of negotiation Transcoding, and the media service data bag after transcoding is sent and preserved to media storage transmission audio unit MSTU;MSCU is controlled MSTU sends the media service data bag after transcoding to terminal.
Further, before the media service data bag that heart crosspoint MRU receptions TTS engine is returned in voice, Method also includes:MSCU sets up with TTS engine and communicated to connect;TTS engine recognizes text, and converts text to media sector Business packet.
Further, before the media service data bag that heart crosspoint MRU receptions TTS engine is returned in voice, Method also includes:MSCU issues transcoding order to MRU;The port type that MRU and TTS engine are connected is appointed as after consulting Audio coding decoding type.
Further, MSCU, which controls MSTU to send the media service data bag after transcoding to terminal, includes:MSCU to MSTU issues the order for opening NAT passages;MSTU is sent to terminal after the media service data bag after transcoding is carried out into NAT.
Further, before media server receives the access request from application server APP, method also includes: Terminal sends multimedia service data bag to APP and asked;APP asks to send to media server according to multimedia service data bag The signaling of access request, and port address outside MSTU is used as to the address with terminal interaction.
To achieve these goals, realized according to another aspect of the present invention there is provided one kind from Text To Speech TTS Audio trans-coding system.
Realize that the system of TTS audio transcodings includes according to the present invention:Terminal;TTS engine;Media server, is used for The access request from application server APP is received, to determine the code/decode type collection of media server support, and APP is received The TTS service requests of application, to meet the media business number of the type of service to TTS engine application according to TTS types of service According to bag, then held consultation according to code/decode type collection and TTS engine, to obtain the audio coding decoding type after consulting, and It will be sent according to audio coding decoding type after media service data bag transcoding to terminal.
Further, media server includes:Media control unit MSCU, for sending session initiation protocol SIP signalings To TTS engine, with the audio coding decoding type consulted and designated media server is matched with TTS engine, type of coding collection Including audio coding decoding type;Speech centre crosspoint MRU, the media service data bag for receiving TTS engine return, And media service data bag is subjected to transcoding according to the audio coding decoding type of negotiation, and by the media service data bag after transcoding Send and preserve to media storage transmission audio unit MSTU;Wherein, MSCU controls MSTU by the media service data after transcoding Bag is sent to terminal.
Further, terminal sends multimedia service data bag to APP and asked;APP please according to multimedia service data bag The signaling that access request is sent to media server is sought, and port address outside MSTU is used as to the address with terminal interaction.
To achieve these goals, realized according to another aspect of the present invention there is provided one kind from Text To Speech TTS Audio transcoding device.
Realize that the device of TTS audio transcodings includes according to the present invention:First processing module, self-application clothes are carried out for receiving Business device APP access request, and determine the code/decode type collection that media server is supported;Second processing module, for receiving APP The TTS service requests of application, and the media business number of the type of service is met according to TTS types of service to TTS engine application According to bag;3rd processing module, for being held consultation according to code/decode type collection and TTS engine, to obtain the audio after consulting Code/decode type, and will be sent according to audio coding decoding type after media service data bag transcoding to terminal.
By the present invention, the access request from application server APP is received using media server, and determine that media take The code/decode type collection that business device is supported;Media server receive APP application TTS service requests, and according to TTS types of service to TTS engine application meets the media service data bag of the type of service;Media server takes according to code/decode type collection and TTS Business device is held consultation, to obtain the audio coding decoding type after consulting, and according to audio coding decoding type by media service data Sent after bag transcoding to terminal, media server can not be met in the audio capability collection of TTS engine in the prior art by solving Business demand in the case of, the problem of terminal access media service data bag data fails, so reached raising terminal visit Ask the effect of media service data bag data success rate.
Brief description of the drawings
Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the present invention, this hair Bright schematic description and description is used to explain the present invention, does not constitute inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the system structure diagram for realizing TTS audio transcodings according to correlation technique;
Fig. 2 is the system structure diagram for realizing TTS audio transcodings according to embodiments of the present invention;
Fig. 3 is the detailed construction schematic diagram according to media server in embodiment illustrated in fig. 2;
Fig. 4 is the method flow diagram for realizing TTS audio transcodings according to embodiments of the present invention;And
Fig. 5 is the apparatus structure schematic diagram for realizing TTS audio transcodings according to embodiments of the present invention.
Embodiment
In order that technical problems, technical solutions and advantages to be solved are clearer, clear, tie below Drawings and examples are closed, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only To explain the present invention, it is not intended to limit the present invention.
The invention provides a kind of system for realizing TTS audio transcodings.Fig. 2 is according to embodiments of the present invention to realize TTS The system structure diagram of audio transcoding, as shown in Fig. 2 the system includes:Terminal;TTS engine;Media server, is used for The access request from application server APP is received, to determine the code/decode type collection of media server support, and APP is received The TTS service requests of application, to meet the media business number of the type of service to TTS engine application according to TTS types of service According to bag, then held consultation according to code/decode type collection and TTS engine, to obtain the audio coding decoding type after consulting, and It will be sent according to audio coding decoding type after media service data bag transcoding to terminal.
Above-described embodiment realizes that when to TTS engine application resource Session Description Protocol SDP is not by media server Using the audio coding decoding of terminal as the capability set of negotiation, and the code/decode format that media server has all been supported is used as ability Collection, after consulting successfully in the media server, media are sent to inside media server by TTS engine, then media services After device is by transcoding, the form needed according to terminal is sent, so that the audio capability collection solved in TTS engine can not In the case of the business demand for meeting media server, the problem of terminal access media service data bag data fails, Jin Erda The effect for improving terminal access media service data bag data success rate is arrived.
Fig. 3 is the detailed construction schematic diagram according to media server in embodiment illustrated in fig. 2.As shown in figure 3, in the application Stating the media server in embodiment can include:Media control unit MSCU, for sending session initiation protocol SIP signalings extremely TTS engine, with the audio coding decoding type consulted and designated media server is matched with TTS engine, type of coding Ji Bao Include audio coding decoding type;Speech centre crosspoint MRU, the media service data bag for receiving TTS engine return, and Media service data bag is subjected to transcoding according to the audio coding decoding type of negotiation, and the media service data bag after transcoding is sent out Send and preserve to media storage transmission audio unit MSTU;Wherein, MSCU controls MSTU by the media service data bag after transcoding Send to terminal.
Preferably, in above-described embodiment, terminal can send multimedia service data bag to APP and ask;APP is according to many matchmakers Body business data packet ask to media server send access request signaling, and using port address outside MSTU as with terminal interaction Address.
Specifically, as shown in Fig. 2 the detailed operation flow of the system comprises the following steps:
Step S10, terminal asks multimedia service data bag to APP, and APP sends INVITE signalings to media server and entered Row media negotiation, media server by the capability set of itself select code/decode type, and using port address outside MSTU as with end Hold the address of interaction.
It is application TTS business that step S20, APP server sends the content in INFO requests, INFO to media server, Meanwhile, transfer after media server identification type of service to TTS engine application media service data bag data;
Step S30, media server is held consultation with TTS engine, and controls TTS engine progress text to be converted to language Sound.As shown in figure 3, step S30 specifically may include steps of:
Step S301, media control unit MSCU initiate session initiation protocol SIP signalings to TTS engine, consult to compile solution Code type.What the audio coding decoding capability set now consulted in INVITE signalings was possessed by media server, as MRU branch All code/decode types held, and require that media bag is sent to media server by TTS engine, and received by MRU.
Step S302, MSCU issue the order for opening NAT passages to MSTU, indicate that the data that will be received from MRU are sent To terminal (user terminal).
Step S303, MSCU issue transcoding order to MRU.MRU is specified to receive the media bag sended over from TTS, and will The result that MRU is appointed as consulting to obtain from step S301 with the portable audio code/decode type that TTS is connected, and MRU is exported Media code/decode type be set to terminal and be actually needed receive code/decode type.
Step S304, MSCU set up TCP/IP links with TTS.And instruction is sent to TTS engine by MRCP agreements, refer to Show that TTS engine recognizes text, and the media service data bag after conversion is sent to MRU end.
Media service data bag is sent to MRU receiving port by step S305, TTS engine.
Step S306, MRU will terminate the media progress transcoding received from TTS, and the audio frequency media after transcoding is sent to Media storage transmits audio unit MSTU receiving ports;
Step S307, MSTU are received after MRU audio pack, directly will audio pack carry out NAT after be sent to terminal.
Several steps more than, terminal just can receive the audio stream be converted to by text.
Finally, media server reports INFO implementing results to APP, while APP sends BYE signalings to media server, Discharge resource.In media server to TTS engine request release resource, the backward APP returning results of success, now call is tied Beam.
Fig. 4 is the method flow diagram for realizing TTS audio transcodings according to embodiments of the present invention.As shown in figure 4, the realization The method of TTS audio transcodings comprises the following steps:
Step S41, media server receives the access request from application server APP, and determines media server branch The code/decode type collection held;
Step S43, media server receives the TTS service requests of APP applications, and according to TTS types of service to TTS service Device application meets the media service data bag of the type of service;
Step S45, media server is held consultation according to code/decode type collection and TTS engine, to obtain after negotiation Audio coding decoding type, and will be sent according to audio coding decoding type after media service data bag transcoding to terminal.
In above-described embodiment, media server is by the way that when to TTS engine application resource, the SDP in the embodiment is not Using the audio coding decoding of terminal as the capability set of negotiation, and the code/decode format that media server has all been supported is used as ability Collection.After consulting successfully, media are sent to inside media server by TTS engine, after then media server is by transcoding, The form needed according to terminal is sent, so that the audio capability collection solved in TTS engine can not meet media services In the case of the business demand of device, the problem of terminal access media service data bag data fails, and then reached raising terminal Access the effect of media service data bag data success rate.
In above-described embodiment, step S45 media services are held consultation according to code/decode type collection and TTS engine, to obtain The audio coding decoding type after consulting is taken, and will be sent according to audio coding decoding type after media service data bag transcoding to terminal The step of can include:Media control unit MSCU sends session initiation protocol SIP signalings to TTS engine, to consult and refer to Determine the audio coding decoding type that media server is matched with TTS engine, type of coding collection includes audio coding decoding type;Voice Center crosspoint MRU receives the media service data bag that TTS engine is returned, and by media service data bag according to negotiation Audio coding decoding type carries out transcoding, and the media service data bag after transcoding is sent and preserved to media storage transmission audio Unit MSTU;MSCU controls MSTU sends the media service data bag after transcoding to terminal.
Preferably, in above-described embodiment, heart crosspoint MRU receives the media business that TTS engine is returned in voice Before packet, method also includes:MSCU sets up with TTS engine and communicated to connect;TTS engine recognizes text, and by text Be converted to media service data bag.
Preferably, in above-described embodiment, heart crosspoint MRU receives the media business that TTS engine is returned in voice Before packet, method also includes:MSCU issues transcoding order to MRU;The MRU port types connected with TTS engine are referred to It is set to the audio coding decoding type after consulting.
In each above-mentioned embodiment of the present invention, MSCU controls MSTU sends the media service data bag after transcoding to terminal The step of can include:MSCU issues the order for opening NAT passages to MSTU;MSTU enters the media service data bag after transcoding Sent after row NAT to terminal.
Preferably, before media server receives the access request from application server APP, method also includes:Eventually Hold to APP and send the request of multimedia service data bag;APP is asked to send to media server and visited according to multimedia service data bag The signaling of request is asked, and port address outside MSTU is used as to the address with terminal interaction.
Present invention also offers a kind of device for realizing TTS audio transcodings.Fig. 5 is realization according to embodiments of the present invention The apparatus structure schematic diagram of TTS audio transcodings, as shown in figure 5, this realizes that the device of TTS audio transcodings includes:First processing mould Block 101, the processing module 105 of Second processing module 103 and the 3rd.
Wherein, first processing module 101, for receiving the access request from application server APP, and determine that media take The code/decode type collection that business device is supported;Second processing module 103, the TTS service requests for receiving APP applications, and according to TTS Type of service meets the media service data bag of the type of service to TTS engine application;3rd processing module 105, for root Hold consultation, solved with obtaining the audio coding decoding type after consulting, and being compiled according to audio according to code/decode type collection and TTS engine Code type will be sent to terminal after media service data bag transcoding.
The device can be a kind of media server, and the media server is when to TTS engine application resource, and SDP is not Using the audio coding decoding of terminal as the capability set of negotiation, and the code/decode format that media server has all been supported is used as ability Collection.After consulting successfully, media are sent to inside media server by TTS engine, after then media server is by transcoding, The form needed according to terminal is sent.
It should be noted that the embodiment of the present invention can be in such as one group computer the step of the flow of accompanying drawing is illustrated Performed in the computer system of executable instruction, and, although logical order is shown in flow charts, but in some situations Under, can be with the step shown or described by being performed different from order herein.
In embodiment description more than, it can be seen that the present invention realizes following technique effect:Improve terminal access matchmaker The effect of body business datum bag data success rate.
Obviously, those skilled in the art should be understood that above-mentioned each module of the invention or each step can be with general Computing device realize that they can be concentrated on single computing device, or be distributed in multiple computing devices and constituted Network on, alternatively, the program code that they can be can perform with computing device be realized, it is thus possible to they are stored Performed in the storage device by computing device, either they are fabricated to respectively multiple integrated circuit modules or by they In multiple modules or step single integrated circuit module is fabricated to realize.So, the present invention is not restricted to any specific Hardware and software is combined.
A preferred embodiment of the present invention has shown and described in described above, but as previously described, it should be understood that the present invention Be not limited to form disclosed herein, be not to be taken as the exclusion to other embodiment, and available for various other combinations, Modification and environment, and above-mentioned teaching or the technology or knowledge of association area can be passed through in invention contemplated scope described herein It is modified., then all should be in this hair and the change and change that those skilled in the art are carried out do not depart from the spirit and scope of the present invention In the protection domain of bright appended claims.

Claims (10)

1. a kind of audio code-transferring method realized from Text To Speech TTS, it is characterised in that including:
Media server receives the access request from application server APP, and determines the volume solution that the media server is supported Code type collection;
The media server receive the APP applications from Text To Speech TTS service requests, and according to TTS types of service The media service data bag of the type of service is met to TTS engine application;
The media server is held consultation according to the code/decode type collection and the TTS engine, to obtain after negotiation Audio coding decoding type, and will be sent according to the audio coding decoding type after the media service data bag transcoding to terminal.
2. according to the method described in claim 1, it is characterised in that the media server according to the code/decode type collection with The TTS engine is held consultation, to obtain the audio coding decoding type after consulting, and will according to the audio coding decoding type Being sent after the media service data bag transcoding to terminal includes:
Media control unit MSCU sends session initiation protocol SIP signalings to the TTS engine, to consult and specify the matchmaker The audio coding decoding type that body server is matched with the TTS engine, the code/decode type collection includes the audio Code/decode type;
Speech centre crosspoint MRU receives the media service data bag that the TTS engine is returned, and by the media Business data packet carries out transcoding according to the audio coding decoding type of negotiation, and by the media service data bag after transcoding Send and preserve to media storage transmission audio unit MSTU;
The MSCU controls the MSTU to send the media service data bag after transcoding to the terminal.
3. method according to claim 2, it is characterised in that heart crosspoint MRU receives the TTS service in voice Before the media service data bag that device is returned, methods described also includes:
The MSCU sets up with the TTS engine and communicated to connect;
The TTS engine recognizes text, and the text is converted into media service data bag.
4. method according to claim 2, it is characterised in that heart crosspoint MRU receives the TTS service in voice Before the media service data bag that device is returned, methods described also includes:
The MSCU issues transcoding order to the MRU;
The port type that the MRU and the TTS engine are connected is appointed as the audio coding decoding type after consulting.
5. the method according to any one of claim 2-4, it is characterised in that the MSCU controls the MSTU by transcoding The media service data bag afterwards, which is sent to the terminal, to be included:
The MSCU issues the order for opening NAT passages to the MSTU;
The MSTU is sent to the terminal after the media service data bag after transcoding is carried out into NAT.
6. method according to claim 5, it is characterised in that received in media server from application server APP's Before access request, methods described also includes:
The terminal sends multimedia service data bag to the APP and asked;
The APP asks to send the letter of the access request to the media server according to the multimedia service data bag Order, and it regard port address outside the MSTU as the address with terminal interaction.
7. a kind of audio trans-coding system realized from Text To Speech TTS, it is characterised in that including:
Terminal;
TTS engine;
Media server, for receiving the access request from application server APP, to determine the media server support Code/decode type collection, and receive the TTS service requests of APP application, with according to TTS types of service to TTS engine application The media service data bag of the type of service is met, is then assisted according to the code/decode type collection and the TTS engine Business, to obtain the audio coding decoding type after consulting, and according to the audio coding decoding type by the media service data bag Sent after transcoding to terminal.
8. system according to claim 7, it is characterised in that the media server includes:
Media control unit MSCU, for sending session initiation protocol SIP signalings to the TTS engine, to consult and specify The audio coding decoding type that the media server is matched with the TTS engine, the code/decode type collection includes institute State audio coding decoding type;
Speech centre crosspoint MRU, for receiving the media service data bag that the TTS engine is returned, and by institute State media service data bag and carry out transcoding according to the audio coding decoding type of negotiation, and by the media business after transcoding Packet sends and preserved to media storage transmission audio unit MSTU;
Wherein, the MSCU controls the MSTU to send the media service data bag after transcoding to the terminal.
9. system according to claim 8, it is characterised in that the terminal sends multimedia service data to the APP Bag request;The APP asks to send the access request to the media server according to the multimedia service data bag Signaling, and it regard port address outside the MSTU as the address with terminal interaction.
10. a kind of audio transcoding device realized from Text To Speech TTS, it is characterised in that including:
First processing module, for receiving the access request from application server APP, and determines the volume that media server is supported Coding type collection;
Second processing module, the TTS service requests for receiving APP application, and according to TTS types of service to TTS service Device application meets the media service data bag of the type of service;
3rd processing module, for being held consultation according to the code/decode type collection and the TTS engine, to obtain after negotiation Audio coding decoding type, and will be sent according to the audio coding decoding type after the media service data bag transcoding to end End.
CN201110169703.3A 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS Active CN102231734B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201110169703.3A CN102231734B (en) 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS
PCT/CN2012/072860 WO2012174908A1 (en) 2011-06-22 2012-03-22 Method, device and system for realizing audio transcoding of text to speech

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110169703.3A CN102231734B (en) 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS

Publications (2)

Publication Number Publication Date
CN102231734A CN102231734A (en) 2011-11-02
CN102231734B true CN102231734B (en) 2017-10-03

Family

ID=44844267

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110169703.3A Active CN102231734B (en) 2011-06-22 2011-06-22 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS

Country Status (2)

Country Link
CN (1) CN102231734B (en)
WO (1) WO2012174908A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102231734B (en) * 2011-06-22 2017-10-03 南京中兴新软件有限责任公司 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS
CN103151041B (en) * 2013-01-28 2016-02-10 中兴通讯股份有限公司 A kind of implementation method of automatic speech recognition business, system and media server
CN105306420B (en) * 2014-06-27 2019-08-30 中兴通讯股份有限公司 Realize the method, apparatus played from Text To Speech cycle of business operations and server
CN105635158A (en) * 2016-01-07 2016-06-01 福建星网智慧科技股份有限公司 Speech call automatic warning method based on SIP (Session Initiation Protocol)
CN107181723A (en) * 2016-03-11 2017-09-19 中兴通讯股份有限公司 A kind of media coding/decoding negotiation method and terminal device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101163023A (en) * 2006-10-09 2008-04-16 中兴通讯股份有限公司 Media server resource allocation processing method
CN101437047A (en) * 2008-12-09 2009-05-20 中兴通讯股份有限公司 Method, system and media server for playback/ sound-recording for user terminal

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8451823B2 (en) * 2005-12-13 2013-05-28 Nuance Communications, Inc. Distributed off-line voice services
CN101601269B (en) * 2006-12-08 2015-11-25 艾利森电话股份有限公司 The method switched between user media and announcement media, system and announcement server
CN100544463C (en) * 2007-06-29 2009-09-23 中兴通讯股份有限公司 A kind of system and method that speech synthesis application united development platform is provided
CN102231734B (en) * 2011-06-22 2017-10-03 南京中兴新软件有限责任公司 Realize audio code-transferring method, the apparatus and system from Text To Speech TTS

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101163023A (en) * 2006-10-09 2008-04-16 中兴通讯股份有限公司 Media server resource allocation processing method
CN101437047A (en) * 2008-12-09 2009-05-20 中兴通讯股份有限公司 Method, system and media server for playback/ sound-recording for user terminal

Also Published As

Publication number Publication date
CN102231734A (en) 2011-11-02
WO2012174908A1 (en) 2012-12-27

Similar Documents

Publication Publication Date Title
US8300772B2 (en) Method and apparatus for emergency call processing
CN106850399B (en) Communication method based on WebRTC technology instant message
CN102231734B (en) Realize audio code-transferring method, the apparatus and system from Text To Speech TTS
WO2021057642A1 (en) Call processing method and device
WO2016150213A1 (en) Data processing method in webpage-based real-time communication media and device utilizing same
US9894128B2 (en) Selective transcoding
US20130091291A1 (en) Method and apparatus for improving voice or video transmission quality in cloud computing mode
CN103647764B (en) A method for implementing LTE system voice business and a single-chip terminal
CN101656715B (en) Method, system and device for media bypass
US20110224969A1 (en) Method, a Media Server, Computer Program and Computer Program Product For Combining a Speech Related to a Voice Over IP Voice Communication Session Between User Equipments, in Combination With Web Based Applications
CN107566671A (en) Network voice communication method and its system, storage medium, electronic equipment
CN104320403B (en) Communication means and device
CN105227418A (en) Data channel establishing method and communication equipment
US20230269280A1 (en) Communication method, communication apparatus, and communication system
CN107124417A (en) MMTel application servers, conversational system and method based on Heterogeneous Computing
CN103684970B (en) The transmission method of media data flow and thin terminal
CN103151041B (en) A kind of implementation method of automatic speech recognition business, system and media server
CN101902455A (en) Open multimedia conference service system and implementing method thereof
CN103702063A (en) Method for realizing dynamic media negotiation in video conference system
CN106488260A (en) Resource request method and device, terminal control method and device
CN105306420B (en) Realize the method, apparatus played from Text To Speech cycle of business operations and server
CN103634697B (en) Net the implementation method of true technology and net true equipment
CN106488261A (en) Resource request method and device
US9143726B2 (en) Video media server for realizing video intercommunication gateway function and video intercommunication method
KR20060038296A (en) Apparatus and method for multiplexing the packet in mobile communication network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20170628

Address after: Yuhuatai District of Nanjing City, Jiangsu province 210012 Bauhinia Road No. 68

Applicant after: Nanjing Zhongxing New Software Co., Ltd.

Address before: 518057 Nanshan District Guangdong high tech Industrial Park, South Road, science and technology, ZTE building, Ministry of Justice

Applicant before: ZTE Corporation

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20191125

Address after: 518057 Nanshan District science and technology, Guangdong Province, South Road, No. 55, No.

Patentee after: ZTE Communications Co., Ltd.

Address before: Yuhuatai District of Nanjing City, Jiangsu province 210012 Bauhinia Road No. 68

Patentee before: Nanjing Zhongxing New Software Co., Ltd.

TR01 Transfer of patent right