CN101010931A

CN101010931A - Audio chunking

Info

Publication number: CN101010931A
Application number: CN 200580021511
Authority: CN
Inventors: 桑尼·R·贝蒂斯; 乔恩·S·普洛特基; 伊恩·M·莫赖斯; 菲利普·L·洛曼; 詹姆斯·H·斯潘塞
Original assignee: Glenayre Electronics Inc
Current assignee: Glenayre Electronics Inc
Priority date: 2004-06-30
Filing date: 2005-06-30
Publication date: 2007-08-01
Also published as: CA2571122A1

Abstract

A voice mail system that allows for the delivery of voice messages by chunks, thereby optimizing the delivery of the information. When a subscriber attempts to listen to his or her voice mail message, the header information for the first voice mail message is down loaded and played back to the subscriber. While the subscriber listens to the header information, the next two blocks of data of the voice message are downloaded. Upon the completion of the playback of the header information, the first two blocks of the voice mail message are available for playback and the first block is immediately available for playback. As the subscriber listens to the voice mail message subsequent blocks of the first voice mail message, and optionally header information and blocks of subsequent voice mail messages are simultaneously down loaded. Thus, the subscriber is able to seamlessly listen to the voice mail messages without a delay in the retrieval of the voice mail messages regardless of the order the subscriber listens to the messages.

Description

Audio chunking

The cross reference of related application

The application's case advocates that being entitled as of application on June 30th, 2004 " AUDIO CHUNKING " and assigned serial number are the priority of 60/584,058 United States Patent (USP) provisional application case, and described application case is incorporated herein by reference.

Technical field

Do not have

Background technology

Alexander Graham Bell has described the phone experiment of his success in the notebook record on March 10th, 1876.His assistant Thomas A.Watson conversation by equipment and next room.Bell has told famous a word, and " Mr. Watson～come here～I infer you." hereafter, engineer, marketeer and consumer pursue the faster transmission of carrying out more information by telecommunications and/or computer network always.At short notice, we are from transferring to ripe T1 carrier wave to family's Data transmission with 300 baud modulator-demodulators, and cable modem and DSL circuit are sent to the consumer with data with the speed of the millions of bits of per second.

Although the scientific and technological progress of transfer speed of data is astonishing, but still be subjected to the challenge of user's imagination.Because bandwidth and data rate increase, the user continues application programs and claims, and this challenges in the ability to current scientific and technological state.Require to download lot of data, audio file, the application program of video file and picture is the bandwidth and the data-handling capacity of challenge data family and intraoffice network solution easily.Because data rate increases, the quality of audio frequency, video or other data also can increase, and then the treating capacity that needs to download more data and challenge network once more.

As a result, the user gets used to (especially at the personal computer applications program limit) a little and waits for that data file, audio file, video file or picture file download that a period of time just can be used described file then at least.More particularly, for download audio files, when being written into a buffer at download audio files stream or with heap file, the user has been accustomed to waiting for the several seconds.

For a voice-mail system, described delay is unacceptable.Therefore, affiliated technical field needs a kind of technology, its energy minimization or reduce a user and downloading an audio file, and especially in the delay of the situation experience of following time of transmitting voice mail message by telecommunication system.

Summary of the invention

The present invention by provide a kind of technology based on the block-by-block file in download satisfy under above listed demand and other demand in the technical field, keep and have enough data on the target destination to guarantee uninterrupted playback or to the access of data.In general, when a file was downloaded in request, two parts of described file were sent to the described request target.In playback or when utilizing the described first of described file, download a third part of described file when just.Described operation continues to download up to whole file and finishes or till the user asked to stop to download.Therefore, the user can be with the delayed access data of continual mode with minimum.

In one embodiment, the present invention incorporates a voice-mail system into to help subscriber's access speech message.When a subscriber wanted to retrieve a voice mail message, with the metadata of file, first or preceding two pieces downloaded to request target.Finish (it takes place in a short period of time) in case download, begin described playback.When the playback of a first of voice mail message when being movable, the next part of described message is downloaded to described subscriber.Therefore, described subscriber has the continuous supply of described audio frequency with the minimum delay.Advantageously, the invention provides the continuous playback of a kind of audio frequency that need cushion in a large number in the target destination and/or video file, it can not cause the obvious delay of the beginning reception of audio frequency and/or video, and the continuous playback of content is provided.

Description of drawings

Fig. 1 is assembly and the internuncial system diagram that explanation can be incorporated of the present invention one exemplary next generation communication platform into.

Fig. 2 is the flow chart of the operation of an explanation one exemplary embodiment of the present invention.

Fig. 3 is the sequential chart of explanation another embodiment of the present invention.

Embodiment

The present invention is directed to the piecemeal technology of use provides audio message or transmits described audio frequency with segment.The present invention relates to described audio message is split into plurality of sections or some.At first, when a user asks to download described audio message, download two pieces immediately.In case preceding two pieces are through transmitting described first to described user's playback of beginning.When playing described first, download one the 3rd.In case finish described first playback, second of beginning playback downloaded described the 3rd fully, and the download of initial next piece.

Advantageously, aspect of the present invention and embodiment provide a kind of user to seem not have the seamless audio interface of remarkable playout-delay.Can use one can provide the ICP/IP protocol of ordering and re-transmission maybe can provide other some other agreements similar or identity function that receive assurance and data packet sequencing of some levels to carry out the transmission of described audio frequency.The size of selecting piece to be minimizing the transmission of initial download, and provides one to make described user can obtain the assurance rank of continuous audio frequency.

_ _ _ applies for and assigned serial number is the distributed IP framework that the U.S. patent application case of 11/___ has been described a kind of telecommunication voice mailing system.The content of this application case is incorporated herein by reference.

Fig. 1 is assembly and the internuncial system diagram that explanation can be incorporated of the present invention one exemplary next generation communication platform into.Illustrated system comprises the distributed IP-based framework that is used for telecommunication apparatus, and wherein said telecommunication apparatus can provide the telecommunications service such as voice mail, calling transfer or other telecommunications feature.In the illustrated embodiment, next generation communication platform 100 has a distributed IP framework and is connected to PSTN (PSTN) or mobile telephone exchange network (MSC) 110.Communications platform 100 is illustrated as and comprises a signaling gateway function (SGF) 120, one or more media servers (MS) 130, one or more System Management Unit (SMU) 140, one or more apps servers (AS) 150 and one or more central datas and message memory (CDMS) or message storage devices of future generation (NGMS) 160.Should be appreciated that what illustrate in the accompanying drawing is not only to be acceptable platform with described distribution of functionality, and others of the present invention can be merged in one and comprise still less or the system of multicompartment and described inter-module different functionalities configuration more.

In illustrated distributed system, introduce the problem that is associated with voice mail message download and playback.Call out one with a subscriber and be exclusively used in the system that speech message is provided differently, described system need transmit described message to described media server via an IP network.This can cause the remarkable delay of searching message and cause message or part message between dead band (dead space).The invention provides seamless delivery by the speech message of audio chunking.

In general, SGF 120 serves as signaling system 7 (SS7) interface of PSTN, MSC or other communication network 110.Media server 130 stops the IP and/or the circuit-switched traffic of communication network via Interface design more than one and is responsible for relaying and calls out control.Application server module 150 generations are used for the dynamic VoiceXML page of various application programs and submit the described pages to and provide an external interface via weblication server configuration by described media server 130.SMU 140 is a managing portal, and it makes the service supplier can provide and keep subscriber account and from a web interface management network element of concentrating.CDMS 160 stores voice messages, subscriber's record, and management comprises the application specific functionality of notice.Below describe each in these subsystems in detail.

But each the assembly standalone upgrade in the next generation communication platform also can independently interconnect with an IP network.Therefore, described assembly just can distribute on the region but still as the single communication platform in operation as long as can communicate via IP network to each other.This is a unexistent significant advantage of the present invention in the prior art communication system.

MS 130 stops from the IP traffic of SGF 120 with from the circuit-switched traffic of PSTN 110.Calling and setting and control that MS 130 is responsible in the platform architecture.MS 130 handles the user's input with voice, DTMF form or other signaling schemes (a very similar web client is collected a user keyboard and click input).MS 130 then returns content and is to the user (on the principle with similar to user's playing pictures and text on a pc client) with speech form.This client/server approach is very important in described platform architecture, and (World WideWeb) goes up the quick establishment of available new application program and the quick use of content in the World Wide Web (WWW) because it is enabled.User terminal/server framework also is to enable device (enabler) so that described system has the ability of Regional Distribution.

Be that speech message that a subscriber stays is stored among the CDMS 160 and can be in that the time is retrieved by the subscriber after a while.When subscriber's retrieves voice messages, can audio message be delivered to a media server 130 from CDMS 160 via one or more apps servers 150.Advantageously, it is staggered and therefore that described audio message can take place, a plurality of speech message playback of a plurality of users of tunable.

Fig. 2 is the flow chart of the operation of an explanation one exemplary embodiment of the present invention.Although in a voice mail retrieval environment, describe present embodiment, should be appreciated that, can in various environment, adopt described various aspects of the present invention.In described embodiment, suppose that described distributed voice mail system has received the plurality of voice messages of a certain subscriber.In step 210, the subscriber that MS 130 checks voice mail message from a request receives an incoming call.At the moment, MS 130 must extract voice mail message from CDMS 160 or message system of future generation (NGMS).In step 215, the voice mail of MS130 request retrieves subscriber, in the illustrated embodiment, this is shown as to AS 150 and sends a request.In step 220, AS 150 retrieval offers MS 130 from the header information of NGMS 160 or metadata information and in step 225 with this information.Described metadata is a relatively little block and can be at a good pace transmitted that in an one exemplary embodiment, described metadata comprises header information.As a limiting examples, described header information can comprise the grade of the priority of the length of the time that receives described message, described message, described sender of the message's identity, described message, described message or type etc.MS 130 cooperates with AS 150 to be a VXML page with metadata conversion and to begin it is submitted to caller 230A.AS 150 operations simultaneously are to extract the block that is associated with described metadata 230B of described speech message.In an one exemplary embodiment, retrieve described speech data 16K byte part two blocks and play described metadata VXML to the caller.In step 235,, just it is delivered to MS 130 for playback in case described AS 150 retrieves described two blocks or two pieces from CDMS 160.In case described metadata is finished, the caller can begin the described speech message of playback by the playback first block 240A immediately.Simultaneously, AS 150 continues to extract next block of described speech message and step 245 described next block is delivered to described MS 130 from described CDMS 160 in step 240B.Described MS 130 begins to submit to the second block 250A after described first block is submitted to described calling subscriber, AS 150 retrieves next block from described CDMS 160 in step 250B simultaneously.Therefore, described system operation is to keep shifting to an earlier date at least one block than the playback of described message always.Therefore, when the caller listened to first block, AS 150 asked the 3rd blocks and it is provided to MS 130.Should be appreciated that, can " in time " mode put into practice the present invention.This means that next block that can in time transmit speech message before last block is finished is for playback rather than guarantee that the transmission of described speech message shifts to an earlier date at least one block than playback.

An advantage of the present invention is if a user only wants to browse his or her message, so the amount of energy minimization institute data download.For example, if a user asks to download a speech message, download two in the speech message so and begin playback.If the user determines deletion or skips this message that the user can be by a voice command or DTMF order instruct MS 130 so.When the action of the positive process user of described system, can download preceding two pieces of next message.For example, if when described user is just listening to a message, described user selects to skip the remainder of described message and continue next message, extracts the metadata (unless before extracting) of described next message and be converted into a VXML page (unless before changing) from NGMS 160 so also then it to be submitted to MS 130.Equally, when submitting metadata VXML to, preceding two blocks of AS 150 next speech message of retrieval.Therefore, only download user is about to the content that needs but not downloads whole message or a series of message.

Fig. 3 is the sequential chart of explanation another embodiment of the present invention.In this figure, do not get rid of CDMS for simple and clear purpose.In step 310, MS 130 receives request from a subscriber to retrieve his voice mail message by communication network 110.In step 315, described MS 130 is from the described subscriber's of AS 150 requests voice mail message.In step 325, AS 150 is delivered to MS 130 with the metadata of first speech message, and in step 330, MS 130 then begins to submit metadata to described subscriber.AS 150 then extracts the metadata of second voice mail message and extract preceding two blocks of first voice mail message in step 340 in step 335.In step 345, finish the playback of metadata of first message and first block of MS 130 beginning playback first message.If the subscriber selects to skip the remainder of message 350, MS 130 has received the metadata of second voice mail message and therefore so, and and then it begin playback second voice mail message metadata 355.During the playback of the metadata of second voice mail message, the metadata of AS 150 retrievals the 3rd voice mail message, it is any block 360 and preceding two blocks of second voice mail message 365.In case finish the playback of the metadata of second voice mail message, MS 130 has just received the block of second message and has begun playback first block 370 immediately.

In another embodiment, can carry out a smart download of audio block.For example, if a user has a plurality of audio files to be downloaded, from song of MP3 download site etc., described smart download can merge aspect of the present invention and use such as a series of voice mail messages or some first-selections.In this embodiment, download the metadata of first audio file and retrieve next block when it is submitted to the user.If the selection resource block size so that playback duration surpasses average download time, downloads described whole audio file so the most at last but playback is still underway.Aspect of the present invention is based on this characteristic.In one embodiment, in case described first file through downloading fully, the present invention can operate to begin to download second file.Therefore, the user can carry out the transition to next file under the situation of not having delay any time.

In another embodiment, the most suitable voice mail environment but be not limited thereto environment is selected chunk sizes so that playback duration surpasses average download time.In the download of one first file, keep a block count.In case downloaded enough pieces with guarantee to remain playback duration surpass described download time at least one factor 2, just download the piece of described second file, in a high-speed delivery network, can the cascade system application program in this respect of the present invention so that download the part of a plurality of files simultaneously, and no matter whether the user listens to described whole file sequentially, skips or delete message or directly skip message or readjustment message before listening to message complete, described user can experience continual playback.

In the application program of smart download, described download policy can change in response to User Activity.For example, if smart download can be downloaded a plurality of message,, can adjust the message that the present invention is downloading if the user jumps to other message.For example, suppose that a user is just listening to the playback of one first audio file 1.When the user listens to, can download the part of the remainder of described first audio file together with following N audio file.If described user selects playback messages X, in case so initial file in download X and playback begin, with regard to the download of initial down N audio file.

In an exemplary and non-limiting example, the size of audio block is between 1 to 5 second.The present invention downloads audio frequency, video and data applicable to (but being not restricted to).The present invention can and use the multiple file type of multiple pass through mechanism and agreement to use with multiple form.

Though the detailed description of the embodiments of the invention that provide by way of example but do not expect to limit the scope of the invention has been provided the present invention is described.The present invention can be implemented as in the multiple systems environment or one comprise the method for moving in the whole system of multiple assembly.It is not the different characteristic of whole necessity that described embodiment is included among all embodiment of the present invention.Some embodiments of the invention are utilized may making up of some features, aspect or feature or aspect.The those skilled in the art can change the described embodiment of the invention and comprise the embodiment of the invention that different characteristic that institute describes among the embodiment to be explained makes up.

Claims

1. method that is used for seamlessly transmitting speech message to a subscriber of a voice-mail system, described method comprises following steps:

Receive the request of a retrieval one subscriber's speech message;

Retrieve a first of first speech message;

The described first of described first speech message of playback;

In the described first of described first speech message of playback, retrieve a next part of described first speech message or a first of second speech message; With

The described next part of playback; With

In the described next part of playback, retrieve next part again and again.

2. method according to claim 1, the described first of wherein said first speech message is the metadata that is associated with described first speech message, and the described first of described first speech message of playback further to comprise described metadata conversion be VXML and described VXML submitted to described subscriber.

3. method according to claim 2, the step of the described next part of described first speech message of wherein said retrieval further comprise preceding two blocks of retrieving described first speech message.

4. method according to claim 3, wherein said the retrieval step of next part again and again comprise one the 3rd block of retrieving described first speech message.

5. method according to claim 1, it further is included in the step that receives a skip indicator during the described first of described first speech message of playback.

6. method according to claim 5 is wherein in response to receiving described skip indicator, the described first of described second speech message of beginning playback.

7. method that is used for seamlessly transmitting speech message to a subscriber of a voice-mail system, described method comprises following steps:

Receive the request of a retrieval one subscriber's speech message;

Retrieve a first of first speech message;

The described first of described first speech message of playback;

In the described first of described first speech message of playback, retrieve a next part of described first speech message; With

The described next part of playback; With

In the described next part of playback, retrieve the next part again and again of described first speech message.

8. method according to claim 7, wherein the described first of first speech message is the metadata that is associated with described first speech message, and the described first of described first speech message of playback further to comprise described metadata conversion be VXML and described VXML submitted to described subscriber.

9. method according to claim 8, the step of a next part of described first speech message of wherein said retrieval comprise one first block and one second block of retrieving described first speech message.

10. method according to claim 9, the step of the described next part of wherein said playback further comprise described first block of described first speech message of playback.

11. method according to claim 10, it further comprises designator that receives a playback one different phonetic message and the step of retrieving a first of described different phonetic message in response to the described designator of reception.

12. one kind provides the distributive telecommunication system of a seamless telecommunication system for a plurality of regions disperse assemblies, described distributive telecommunication system comprises:

One SGW, described SGW comprises:

The signaling interface of one to one telephone network; With

The interface of one to one IP network;

One media server, described media server:

Comprise that one is used for receiving the also circuit switched interface of initial telephone service via described telephone network;

Comprise an interface to described IP network;

Can operate to provide communication service via described circuit switched interface; With

Can receive order and response input via described telephone network;

One apps server, described apps server comprise one to the interface of described IP network, described apps server via described IP network can operate with:

Receive and handle the order and the response input that receive by described media server; With

Call a communication service and provide telecommunications service applications to described media server in response to described media server according to described order that receives and response input;

At least one central data and message memory, described at least one central message and data storage can operate with:

Receive and store response from described apps server; With

Provide configuration data to described apps server, providing of described telecommunications service applications is provided described configuration data;

Receiving one in response to described media server is the order of subscriber's voice playback message:

The described apps server of described media server request is retrieved described speech message from described central data and message memory;

Described apps server is retrieved a first of one first speech message and described first is provided to described media server;

When the described first of described first speech message of described media server playback, described apps server is retrieved one first block of described speech message and one second block and described first block and described second block of described speech message is provided to described media server; With

When described first block of the described speech message of media server playback, described apps server is retrieved next block of described first speech message and described next block of described first speech message is provided to described media server.

13. distributive telecommunication system according to claim 12, wherein when the described first of described first speech message of described media server playback, described apps server further operation is provided to described media server with a first of retrieving next speech message and with the described first of described next speech message.

14. distributive telecommunication system according to claim 13, wherein after the described first of described first speech message of beginning playback, receive the designator of described next speech message of a playback in response to described media server, described media server is operated the current active part with the described playback that stops described first speech message of playback, and the described first of described next speech message of beginning playback.

15. distributive telecommunication system according to claim 14, wherein begin the described first of described next speech message of playback, one first block and one second block of described next speech message of described apps server retrieval in response to described media server.

16. distributive telecommunication system according to claim 15, wherein when described first block of described next speech message of described media server playback, one the 3rd block of described next speech message of described apps server retrieval also is provided to described media server with described the 3rd block of described next speech message.

17. a method that is used for seamlessly transmitting to a subscriber of a voice-mail system plurality of voice messages, described method comprises following steps:

Receive the request of a retrieval one subscriber's speech message;

Retrieve a first of available voice mail message;

The described first of the described voice mail message of beginning playback;

In the described first of the described voice mail message of playback, retrieve a next part of described voice mail message; With

In case finish the playback of the first of one first voice mail message,, so just begin a second portion of described first voice mail message of playback if do not receive further request; With

In case receive the request of next voice mail message of playback, just interrupt described first voice mail message described first playback and begin a first of described next voice mail message of playback.

18. method according to claim 17 comprises the header information of one or more described voice mail messages in the described first of wherein said available voice mail message.

19. method according to claim 18, the step of the described first of the described voice mail message of wherein said beginning playback comprises following steps:

The header information of one first voice mail message is converted to VXML; With

Described VXML is submitted to a subscriber.

20. method according to claim 19, the wherein said step of retrieving a next part of described voice mail message in the described first of the described voice mail message of playback comprises: at least one first and second blocks of retrieving described first voice mail message.