CN103281683A

CN103281683A - Method and device sending voice message

Info

Publication number: CN103281683A
Application number: CN2013102295436A
Authority: CN
Inventors: 阮良; 周兆春
Original assignee: Netease Hangzhou Network Co Ltd
Current assignee: Netease Hangzhou Network Co Ltd
Priority date: 2013-06-08
Filing date: 2013-06-08
Publication date: 2013-09-04
Anticipated expiration: 2033-06-08
Also published as: CN103281683B

Abstract

The invention provides a method and device sending a voice message. For instance, the method comprises responding to a corresponding trigger event, obtaining voice data by recording voices, carrying out voice recognition on the voice data to obtain a recognition text, matching the recognition text and contact person text information in a contact list, and sending the voice data to a receiving terminal determined by the contact person text information if the contact person text information matched with the recognition text is obtained. A user only needs an input operation to send the voice message to the corresponding receiving terminal by using the method. Compared with the prior art, the method simplifies the input operation of the user, and improves user experience. In addition, the invention provides a device sending the voice message.

Description

A kind of method and device that sends speech message

Technical field

Embodiments of the present invention relate to the speech message field, and more specifically, embodiments of the present invention relate to a kind of method and device that sends speech message.

Background technology

This part is intended to provide background or context for the embodiments of the present invention of stating in claims.Description herein can comprise the concept that can probe into, but the concept of having expected or having probed into not necessarily.Therefore, unless point out at this, otherwise for the application's specification and claims, be not prior art in the content of describing in this part, and not because be included in just admit it is prior art in this part.

In recent years, along with developing rapidly of mobile Internet, the mobile terminal is used and is emerged in large numbers like the mushrooms after rain, except the ground literal image information, also has mobile application characteristic functions such as speech message.Wherein speech message is a spotlight, is immediate communication is well replenished, and is subjected to liking of users deeply.When the user sent speech message, the user is selective reception end from address list earlier, opened the session window with receiving terminal again, clicked the recorded speech function, treated that voice recording finishes the back and click the transmission button, and the speech message of recording the most at last sends to receiving terminal.

Summary of the invention

But because prior art needs the user to carry out repeatedly different input operations and just can finish, complicated operating process has reduced user's experience.

The input operation complexity is very bothersome problem when therefore in the prior art, sending speech message.

For this reason, be starved of a kind of method of improved transmission speech message, to simplify user's input operation, improve the user and experience.

In the present context, the embodiments of the present invention expectation provides a kind of method and device that sends speech message.

In the first aspect of embodiment of the present invention, a kind of method that sends speech message is provided, for example, this method can be applied to client, can comprise: in response to corresponding trigger event, obtain speech data by recorded speech; Described speech data is carried out speech recognition, obtain the identification text; Contact person's text message in described identification text and the address list is mated; If obtain and contact person's text message of identifying text matches, described speech data be sent to the determined receiving terminal of this contact person's text message.

Alternatively, wherein said trigger event can comprise: the pre-set button control that is presented on touch-screen is pressed; Perhaps, two or more contacts are pressed also at touch-screen simultaneously and are stopped paddling behind one section track of paddling.

Alternatively, wherein in response to corresponding trigger event, obtaining speech data by recorded speech can comprise: during described pressing, and recorded speech; When detecting described when cancellation of pressing, stop to record, obtain the speech data of recording.

Alternatively, wherein described speech data being carried out speech recognition can comprise the pre-set interval of speech data is identified; After the contact person's text message in described identification text and the address list is mated, can also comprise: if obtain and contact person's text message of identifying text matches, recomputate pre-set interval, return the step that the pre-set interval of described speech data is identified.

Alternatively, if wherein obtain the contact person's text message with the identification text matches, described speech data is sent to the determined receiving terminal of this contact person's text message can be comprised: if obtain and contact person's text message of identifying text matches, send the prompting that whether sends voice; If receive according to the positive acknowledgement of described prompting feedback, described speech data be sent to the determined receiving terminal of this contact person's text message.

Alternatively, if wherein obtain the contact person's text message with the identification text matches, described speech data is sent to the determined receiving terminal of this contact person's text message can be comprised: if obtain and contact person's text message of identifying text matches, upload described speech data to media server; If upload success, the receiving media server is according to the download address of the speech data feedback of uploading; Send the new voice message notice to push server, described new voice message notice comprises the determined receiving terminal information of contact person's text message and the described download address of coupling, so that push server is notified the receiving terminal that is sent to described receiving terminal information correspondence with described new voice message, so that this receiving terminal is downloaded speech data according to described download address from media server.

Alternatively, wherein with described push server, media server between communication specifically be connected the safe lane of setting up by front end and communicate by letter.

Alternatively, wherein said upload successfully after, can also the receiving media server according to the voice identifier ID that is used for the unique identification speech data of the speech data feedback of uploading, so that comprise described voice ID in the described new voice message notice that sends to push server; And can also comprise: notify if receive the new voice message that comprises voice ID and download address from described push server, and do not download to the speech data of these voice ID sign, send the updating message that comprises these voice ID and do not receive voice messaging to described push server, do not receive voice so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence prompting message for the download speech data of not receiving the voice respective feedback; If receive the new voice message notice that comprises voice ID and download address from described push server, and download to the speech data of these voice ID sign and do not listen to, send the updating message that comprises these voice ID and do not listen to information to described push server, do not listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence for not listening to the prompting message of listening to speech data of respective feedback; If receive the new voice message notice that comprises voice ID and download address from described push server, and download to the speech data of these voice ID sign and listened to, send the updating message that comprises these voice ID and listened to information to described push server, listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and make described push server send the prompting message of having listened to speech data to the transmitting terminal that sends this new voice message notice.

Alternatively, wherein said upload successfully after, can also the receiving media server according to the voice ID of the logos sound data of the speech data feedback of uploading, so that the described new voice message that sends to push server also comprises described voice ID in notifying, also make described push server from described new voice message notice, extract voice ID, preserve the corresponding relation of the transmitting terminal of this new voice message notice of voice ID and transmission; And can also comprise: if receive new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, judge also whether the reply voice event is triggered; If obtain the reply voice data by recorded speech; Upload described reply voice data to media server; If upload success, the receiving media server is according to the download address of the reply voice data feedback of uploading; Send a reply the voice messaging notice to push server, described reply voice information notice comprises the download address of described voice ID and media server feedback, so that described push server inquires these voice ID corresponding sending terminal, the download address that reply voice information notice is comprised is sent to these voice ID corresponding sending terminal, so that described transmitting terminal is downloaded described reply voice data from this download address of media server.

In the second aspect of embodiment of the present invention, a kind of device that sends speech message is provided, and for example, this device can be disposed at client device, can comprise: obtain voice unit: can dispose in response to corresponding trigger event, obtain speech data by recorded speech; Recognition unit: can dispose for described speech data is carried out speech recognition, obtain the identification text; Matching unit: can dispose for the contact person's text message with described identification text and address list and mate; Transmitting element: if can dispose for obtaining and contact person's text message of identifying text matches, described speech data is sent to the determined receiving terminal of this contact person's text message.

Alternatively, the wherein said voice unit that obtains: configuration is for the trigger event that is pressed in response to the pre-set button control that is presented on touch-screen; Perhaps, in response to two or more contacts simultaneously touch-screen press and one section track of paddling after stop the trigger event of paddling.

Alternatively, the wherein said voice unit that obtains: can dispose for during described pressing recorded speech; When detecting described when cancellation of pressing, stop to record, obtain the speech data of recording.

Alternatively, wherein said recognition unit: can dispose for the pre-set interval to speech data and identify; Described matching unit: if can also dispose for not obtaining and contact person's text message of identifying text matches, recomputate pre-set interval, trigger recognition unit again and carry out.

Alternatively, wherein said transmitting element: if can dispose for obtaining and contact person's text message of identifying text matches, send the prompting that whether sends voice; If receive according to the positive acknowledgement of described prompting feedback, described speech data be sent to the determined receiving terminal of this contact person's text message.

Alternatively, wherein said transmitting element: if can dispose for obtaining and contact person's text message of identifying text matches, upload described speech data to media server; If upload success, the receiving media server is according to the download address of the speech data feedback of uploading; Send the new voice message notice to push server, described new voice message notice comprises the determined receiving terminal information of contact person's text message and the described download address of coupling, so that push server is sent to the determined receiving terminal of described receiving terminal information with described download address, so that this receiving terminal is downloaded speech data according to described download address from media server.

Alternatively, wherein said transmitting element: can dispose for connecting the safe lane of setting up by front end and communicate by letter with described push server, media server.

Alternatively, wherein said transmitting element: can also dispose for the voice identifier ID that be used for unique identification speech data of receiving media server according to the speech data feedback of uploading, so that the described new voice message that sends to push server comprises described voice ID in notifying; And can also comprise: receiving element: if can dispose for receive the new voice message notice from described push server, and do not download to the speech data that this new voice message is notified the voice ID sign that comprises, send the updating message that comprises these voice ID and do not receive voice messaging to described push server, so that described push server is updated to the voice status of these voice ID correspondence according to this updating message the prompting message of the download speech data of not receiving the voice respective feedback; If receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that new voice message notice comprises and do not listen to, send the updating message that comprises these voice ID and do not listen to information to described push server, do not listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence for not listening to the prompting message of listening to speech data of respective feedback; If receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, send the updating message that comprises these voice ID and listened to information to described push server, listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and make described push server send the prompting message of having listened to speech data to the transmitting terminal that sends this new voice message notice.

Alternatively, wherein said transmitting element: can also dispose for the voice ID of receiving media server according to the logos sound data of the speech data feedback of uploading, so that the described new voice message that sends to push server also comprises described voice ID in notifying, also make described push server from described new voice message notice, extract voice ID, preserve the corresponding relation of the transmitting terminal of this new voice message notice of voice ID and transmission; And can also comprise: receiving element: if can dispose for receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, judge also whether the reply voice event is triggered, if, obtain the reply voice data by recorded speech, upload described reply voice data to media server, if upload success, the receiving media server is according to the download address of the reply voice data feedback of uploading, send a reply the voice messaging notice to push server, described reply voice information notice comprises the download address of described voice ID and media server feedback, so that described push server inquires these voice ID corresponding sending terminal, the download address that reply voice information notice is comprised is sent to these voice ID corresponding sending terminal, so that described transmitting terminal is downloaded described reply voice data from this download address of media server.

In the third aspect of embodiment of the present invention, a kind of method that sends speech message is provided, for example, this method can be applied to the push server end, can comprise: receive the new voice message notice from transmitting terminal, described new voice message notice comprises receiving terminal information and download address, wherein, described receiving terminal information and download address are obtained by forwarding step by described transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server, if upload success, the receiving media server is determined receiving terminal information according to the download address of the speech data feedback of uploading according to contact person's text message of coupling; Described download address is sent to the receiving terminal of receiving terminal information correspondence, so that this receiving terminal is downloaded speech data according to described new voice message notice from the described download address of media server.

In the fourth aspect of embodiment of the present invention, a kind of device that sends speech message is provided, for example, this device can be disposed at push server end equipment, can comprise: the reception notification unit: can dispose for receive the new voice message notice from transmitting terminal, described new voice message notice comprises receiving terminal information and download address, wherein, described receiving terminal information and download address are obtained by forwarding step by described transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server, if upload success, the receiving media server is determined receiving terminal information according to the download address of the speech data feedback of uploading according to contact person's text message of coupling; Retransmission unit: can dispose for the receiving terminal that described download address is sent to receiving terminal information correspondence, so that this receiving terminal is downloaded speech data according to described new voice message notice from the described download address of media server.

Embodiment of the present invention the 5th aspect in, a kind of method that sends speech message is provided, for example, this method can be applied to the media server end, can comprise: the speech data that receiving end/sending end is uploaded, wherein, described speech data is uploaded by forwarding step by transmitting terminal, and wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtained the identification text, the contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server; If described the reception successfully, feed back download address according to the speech data that receives to transmitting terminal, so that described transmitting terminal sends the new voice message notice to push server, described new voice message notice comprises receiving terminal information and the download address of determining according to contact person's text message of coupling.

Embodiment of the present invention the 6th aspect in, a kind of device that sends speech message is provided, for example, this device can be disposed at media server end equipment, can comprise: receive uploading unit: can dispose the speech data of uploading for receiving end/sending end, wherein, described speech data is uploaded by forwarding step by transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtained the identification text, the contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server; Feedback unit: receive successfully if can dispose for described, feed back download address according to the speech data that receives to transmitting terminal, so that described transmitting terminal sends the new voice message notice to push server, described new voice message notice comprises receiving terminal information and the download address of determining according to contact person's text message of coupling.

Method and apparatus according to the transmission speech message of embodiment of the present invention, when carrying out a trigger action, the user (as presses specific button, or two fingers or three refer to press behind the touch-screen paddling) time, namely in response to corresponding trigger event, automatically perform the recorded speech function, obtain speech data, speech data is carried out speech recognition, obtain the identification text, the step that contact person's text message in the identification textview field address list is mated, in the speech data of recording, comprise contact person's text message in the address list (as name, during telephone number etc.) voice, can obtain and contact person's text message of identifying text matches, and then can automatically speech data be sent to the determined receiving terminal of this contact person's text message, it is the receiving terminal of speech message, therefore, the user only needs an input operation can send speech message to corresponding receiving terminal, compared with prior art, simplify user's input operation, improved user's experience.

Summary of the invention

The inventor finds, because prior art need manually be selected the speech message receiving terminal by the user, needs the user to send in this one of four states in selective reception end, voice recording, affirmation again and switches, and therefore causes user's input operation complexity.

The inventor finds again, because people's speech habits, the relevant information that in speech message, has comprised receiving terminal usually, and can access the identification text by speech data being carried out speech recognition, if the contact person's text message that will identify in text and the address list mates, then can determine the concurrent sending voice message of receiving terminal automatically, need not the manual selective reception end of user, also just need not the user and in states such as selective reception end, voice recording, switch, can simplify user's input operation.

After having introduced basic principle of the present invention, following mask body is introduced various non-limiting execution mode of the present invention.

The application scenarios overview

At first with reference to figure 2, as shown in Figure 2, the adaptable scene of embodiment of the present invention can be for comprising the voice message transmission system of transmitting terminal 201 and receiving terminal 202, and wherein transmitting terminal 201 and receiving terminal 202 can be intelligent terminals such as smart mobile phone or IPAD.

One of illustrative methods

Below in conjunction with the application scenarios of Fig. 2, be described with reference to Figure 3 one of method according to the transmission speech message of exemplary embodiment of the invention.It should be noted that above-mentioned application scenarios only is to illustrate for the ease of understanding spirit of the present invention and principle, embodiments of the present invention are unrestricted in this regard.On the contrary, embodiments of the present invention any scene that can be applied to be suitable for.

Referring to Fig. 3, one of a kind of method flow schematic diagram that sends speech message that provides for the embodiment of the invention, the method that this embodiment provides can be applied to client, and as shown in the figure, this method can comprise:

S310, in response to corresponding trigger event, obtain speech data by recorded speech;

For example, this trigger event can comprise:

The pre-set button control that is presented on touch-screen is pressed; Perhaps, two or more contacts are pressed also at touch-screen simultaneously and are stopped paddling behind one section track of paddling, for example, the user can use two refer to or three refer to touch-screen press and one section track of paddling after stop paddling, wherein the track of paddling can be redefined for tracks such as arc, straight line according to demand.

According to this trigger event, can be during pressing, recorded speech when detecting described when cancellation of pressing, stops to record, and then obtains the speech data recorded.

Certainly, be not limited to above-mentioned trigger event, for example, the mechanical key that can comprise input equipment is pressed and waits other trigger events, and embodiments of the present invention are unrestricted in this regard, can implement needs according to reality corresponding trigger event is set.

And, also be not limited to the implementation of this a kind of recorded speech of recorded speech during pressing, for example, recorded speech in can the default duration after corresponding trigger event takes place, the voice of recording can adopt AAC(Advanced Audio Coding, Advanced Audio Coding) speech coding.Embodiments of the present invention are unrestricted in this regard.

S320, described speech data is carried out speech recognition, obtain the identification text;

Wherein, speech data is carried out speech recognition, for example, can preserve the pronunciation dictionary that obtains by following steps in advance, comprise: the speech data that presets text is carried out preliminary treatment (can comprise the voice signal sampling, the anti aliasing bandpass filtering, remove individual pronunciation difference, the noise effect that equipment and environment cause etc.), extraction preset text speech data feature (for example, extract the parameters,acoustic of reflection substantive characteristics in the speech data, as average energy, on average stride zero rate, formant etc.), the acoustic model of text is preset in foundation (for example can be by the more same repeatedly repetition voice that preset text, from the raw tone sample, remove redundant information, keep critical data, again according to certain rule to data cluster in addition, formation pattern storehouse), setting up the language model preset text (for example, can adopt and comprise regular language, context-free grammar is at interior various language models), obtain pronunciation dictionary by the mapping of setting up between acoustic model modeling and the language model.

When speech data is identified, can carry out preliminary treatment to speech data as above-mentioned method, extract feature, with feature mating in pronunciation dictionary of extracting (as, according to certain rule, as distance measure, word-building rule, syntax rule, semantic rules etc., calculate the similarity between input feature vector and the stock's pattern, as matching distance, likelihood probability etc.), with similarity the highest and surpass threshold value preset text as the identification text of speech data.

And, can identify the full text of speech data, obtain speech data identification text in full, also can the pre-set interval of speech data be identified according to people's speech habits, obtain the identification text of pre-set interval.

Wherein, described pre-set interval can be set to the receiving terminal relevant information the higher interval of possibility occurs in speech data.After mating, if obtain and contact person's text message of identifying text matches, can also recomputate pre-set interval, return the step that the pre-set interval of speech data is identified, to improve the success rate of automatic transmission speech message.For example, described pre-set interval can be one section interval that speech data begins when initial, recomputating pre-set interval can comprise the steps: to extract in the speech data data between per two adjacent speech pause positions and be each interval to be selected, the length of an interval degree to be selected that extracts and contact name or the possible length of telephone number are compared, and the interval to be selected near the possible length of contact name or telephone number is set to pre-set interval.Certainly, other methods that recomputates pre-set interval can also be arranged, the present invention is unrestricted in this regard.

S330, the contact person's text message in described identification text and the address list is mated;

For example, can from the local address list of the portable terminal of using this method embodiment, extract contact person's text message, as relevant informations such as name, phones.

When mating, if certain the contact person's text message in identification text and the address list mates fully, can determine that this contact person's text message is and contact person's text message of identifying text matches;

If not and contact person's text message of mating fully of identification text, can also be when mating, calculate the matching degree of each contact person's text message, with contact person's text message that wherein matching degree is the highest as with contact person's text message of this identification text matches.

If S340 obtains the contact person's text message with the identification text matches, described speech data is sent to the determined receiving terminal of this contact person's text message.

In order to send speech message receiving terminal extremely accurately, before sending, can also send whether confirm to send speech message to the prompting of this receiving terminal to the user, if receive that the user according to the positive acknowledgement of this prompting feedback, is sent to the determined receiving terminal of this contact person's text message with described speech data.

If receive that the user according to the Negative Acknowledgement of this prompting feedback, then can abandon the transmission of this time speech message.

Wherein, speech data is sent to the determined receiving terminal of this contact person's text message, can sends by following transit server, for example:

If obtain and contact person's text message of identifying text matches, upload described speech data to media server;

If upload success, the receiving media server is according to the download address of the speech data feedback of uploading;

Send the new voice message notice to push server, described new voice message notice comprises the determined receiving terminal information of contact person's text message and the described download address of coupling, so that described push server is sent to the receiving terminal of described receiving terminal information correspondence with described download address, so that this receiving terminal is downloaded described speech data according to described download address from media server.

Wherein, use the transmitting terminal of this method embodiment and receiving terminal and can be connected safe lane such as HTTPS (the Hypertext Transfer Protocol over Secure Socket Layer that sets up by front end with communication between described push server and the media server, HTML (Hypertext Markup Language) is in security socket layer) communication, to ensure information security.For example, when transmitting terminal sends the new voice message notice to push server, can send the new voice message notice to push server by the safe lane HTTPS between transmitting terminal and the server.Receiving terminal also can receive the download address of speech data by HTTPS, downloads speech message by HTTPS one by one from media server.

And, transmitting terminal is uploaded the speech data success to media server after, can also the receiving media server according to the voice identifier ID that is used for the unique identification speech data of the speech data feedback of uploading, when sending the new voice message notice to push server, can comprise voice ID in the described new voice message notice, so that described push server is preserved the corresponding relation of voice ID and voice status, wherein when the receiving terminal of described receiving terminal information correspondence does not receive the speech data of these voice ID sign, described voice status is not for receiving voice, when the receiving terminal of described receiving terminal information correspondence receives the speech data of these voice ID sign and when not listening to, described voice status is not for listening to, when the receiving terminal of described receiving terminal information correspondence had been listened to the speech data of these voice ID sign, described voice status was for listening to.

For example, the described method of present embodiment can also comprise:

If receive the new voice message notice that comprises voice ID and download address from described push server, and do not download to the speech data of these voice ID sign, send the updating message that comprises these voice ID and do not receive voice messaging to described push server, do not receive voice so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence prompting message for the download speech data of not receiving the voice respective feedback;

If receive the new voice message notice that comprises voice ID and download address from described push server, and download to the speech data of these voice ID sign and do not listen to, send the updating message that comprises these voice ID and do not listen to information to described push server, do not listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence for not listening to the prompting message of listening to speech data of respective feedback;

If receive the new voice message notice that comprises voice ID and download address from described push server, and download to the speech data of these voice ID sign and listened to, send the updating message that comprises these voice ID and listened to information to described push server, listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and make described push server send the prompting message of having listened to speech data to the transmitting terminal that sends this new voice message notice.

For example, the form of above-mentioned updating message for example can be id1-status1, wherein many updating message can be separated as id1-status1#id2-status2... with #, id represents voice ID, status represents voice status information, and for example the status value is 0 o'clock, and expression does not receive that voice, value are at 1 o'clock, expression has been received and has not been listened to, value is 3 o'clock, and expression is listened to.

As seen, by preserving the state of speech data correspondence in push server, be conducive to the state that transmitting terminal user and receiving terminal user in time understand speech message, improve the user and experience.

And after transmitting terminal sent the new voice message notice to push server, if make receiving terminal uppick voice, described receiving terminal can also be replied, and specific implementation for example can comprise:

Wherein said upload successfully after, also the receiving media server is according to the voice ID of the logos sound data of the speech data feedback of uploading, so that the described new voice message that sends to push server also comprises described voice ID in notifying, also make described push server from described new voice message notice, extract voice ID, preserve the corresponding relation of the transmitting terminal of this new voice message notice of voice ID and transmission;

And also comprise:

If receive new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, judge also whether the reply voice event is triggered;

If obtain the reply voice data by recorded speech;

Upload described reply voice data to media server;

If upload success, the receiving media server is according to the download address of the reply voice data feedback of uploading;

Send a reply the voice messaging notice to push server, described reply voice information notice comprises the download address of described voice ID and media server feedback, so that described push server inquires these voice ID corresponding sending terminal, the download address that reply voice information notice is comprised is sent to these voice ID corresponding sending terminal, so that described transmitting terminal is downloaded described reply voice data from this download address of media server.

Use the method that the above embodiment of the present invention provides, by speech data is carried out speech recognition, contact person's text message in identification text and the address list is mated, make sending speech message can be reduced to after the user carries out trigger action, as long as comprise the receiving terminal relevant information in the speech data of recording, just can send speech message automatically to receiving terminal, simplify user's operation, improve user's experience.

One of exemplary means

After one of method of having introduced exemplary embodiment of the invention, next, be introduced with reference to one of device of 4 pairs of transmission speech messages corresponding with one of above-mentioned illustrative methods of figure.

Referring to Fig. 4, one of a kind of apparatus structure schematic diagram that sends speech message that provides for the embodiment of the invention, the device that this embodiment provides can be disposed at client device, and as shown in the figure, this device can comprise:

Obtain voice unit 410: can dispose in response to corresponding trigger event, obtain speech data by recorded speech;

According to two kinds of trigger event possible implementations, for example, obtain voice unit 410: can dispose the trigger event that is pressed in response to the pre-set button control that is presented on touch-screen; Perhaps, in response to two or more contacts simultaneously touch-screen press and one section track of paddling after stop the trigger event of paddling.

According to above-mentioned trigger event, the described voice unit 410 that obtains: can dispose for during described pressing recorded speech; When detecting described when cancellation of pressing, stop to record, obtain the speech data of recording.

Recognition unit 420: can dispose for described speech data is carried out speech recognition, obtain the identification text;

Wherein, this recognition unit 420: can dispose for speech data is carried out speech recognition, can identify the full text of speech data, obtain speech data identification text in full; Also can be according to people's speech habits, the higher identified region of possibility appears in default receiving terminal relevant information, be pre-set interval in one section zone of the beginning of speech data for example, the pre-set interval of speech data is identified, obtain the identification text of speech data pre-set interval.

Matching unit 430: can dispose for the contact person's text message with described identification text and address list and mate;

When the pre-set interval of 420 pairs of speech datas of recognition unit is carried out speech recognition, described matching unit 430: if can also dispose for not obtaining and contact person's text message of identifying text matches, recomputate pre-set interval, trigger recognition unit again and carry out.

Transmitting element 440: if can dispose for obtaining and contact person's text message of identifying text matches, described speech data is sent to the determined receiving terminal of this contact person's text message.

In order to send speech message receiving terminal extremely accurately, before sending, can also send whether confirm to send speech message to the prompting of this receiving terminal to the user, for example, wherein said transmitting element 440: if can dispose for obtaining and contact person's text message of identifying text matches, send the prompting that whether sends voice; If receive according to the positive acknowledgement of described prompting feedback, described speech data be sent to the determined receiving terminal of this contact person's text message.

Wherein, speech data is sent to the determined receiving terminal of this contact person's text message, can sends by transit server, for example:

Wherein said transmitting element 440: if can dispose for obtaining and contact person's text message of identifying text matches, upload described speech data to media server; If upload success, the receiving media server is according to the download address of the speech data feedback of uploading; Send the new voice message notice to push server, described new voice message notice comprises the determined receiving terminal information of contact person's text message and the described download address of coupling, so that push server is sent to the receiving terminal of described receiving terminal information correspondence with described download address, so that this receiving terminal is downloaded speech data according to described download address from media server.

In order to ensure information security wherein said transmitting element 440: can dispose for connecting the safe lane of setting up by front end and communicate by letter with described push server, media server.

In order to make transmitting terminal user and receiving terminal user in time understand the state of speech message, improving the user experiences, the transmitting element 440 of apparatus of the present invention embodiment: can also dispose for the voice identifier ID that be used for unique identification speech data of receiving media server according to the speech data feedback of uploading, so that the described new voice message that sends to push server comprises described voice ID in notifying;

And can also comprise:

Receiving element 450: if can dispose for receive the new voice message notice from described push server, and do not download to the speech data that this new voice message is notified the voice ID sign that comprises, send the updating message that comprises these voice ID and do not receive voice messaging to described push server, so that described push server is updated to the voice status of these voice ID correspondence according to this updating message the prompting message of the download speech data of not receiving the voice respective feedback; If receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that new voice message notice comprises and do not listen to, send the updating message that comprises these voice ID and do not listen to information to described push server, do not listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence for not listening to the prompting message of listening to speech data of respective feedback; If receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, send the updating message that comprises these voice ID and listened to information to described push server, listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and make described push server send the prompting message of having listened to speech data to the transmitting terminal that sends this new voice message notice.

Consider that the receiving terminal that has disposed the device that the embodiment of the invention provides is after receive speech message, the demand that also has reply voice, the described transmitting element 440 of the device that the embodiment of the invention provides: can also dispose for the voice ID of receiving media server according to the logos sound data of the speech data feedback of uploading, so that the described new voice message that sends to push server also comprises described voice ID in notifying, also make described push server from described new voice message notice, extract voice ID, preserve the corresponding relation of the transmitting terminal of this new voice message notice of voice ID and transmission;

And can also comprise:

Receiving element 450: if can dispose for receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, judge also whether the reply voice event is triggered, if, obtain the reply voice data by recorded speech, upload described reply voice data to media server, if upload success, the receiving media server is according to the download address of the reply voice data feedback of uploading, send a reply the voice messaging notice to push server, described reply voice information notice comprises the download address of described voice ID and media server feedback, so that described push server inquires these voice ID corresponding sending terminal, the download address that reply voice information notice is comprised is sent to these voice ID corresponding sending terminal, so that described transmitting terminal is downloaded described reply voice data from this download address of media server.

Use the device that the above embodiment of the present invention provides, can carry out speech recognition to speech data by recognition unit, the contact person's text message that to be identified in text and the address list by matching unit mates, make sending speech message can be reduced to after the user carries out trigger action, as long as comprise the receiving terminal relevant information in the speech data of recording, just can send speech message automatically to receiving terminal by transmitting element, simplify user's operation, improve user's experience.

Two of illustrative methods

Describe according to two of the method for the transmission speech message of exemplary embodiment of the invention below with reference to Fig. 5.

Referring to Fig. 5, two of a kind of method flow schematic diagram that sends speech message that provides for the embodiment of the invention, the method that this embodiment provides can be applied to the push server end, and as shown in the figure, this method can comprise:

S510, receive the new voice message notice from transmitting terminal, described new voice message notice comprises receiving terminal information and download address, wherein, described receiving terminal information and download address are obtained by forwarding step by described transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server, if upload success, the receiving media server is determined receiving terminal information according to the download address of the speech data feedback of uploading according to contact person's text message of coupling;

S520, described download address is sent to the receiving terminal of receiving terminal information correspondence, so that this receiving terminal is downloaded speech data according to described new voice message notice from the described download address of media server.

Use the method for the above embodiment of the present invention, make push server to receive the new voice message notice from transmitting terminal, wherein, described new voice message notice comprises receiving terminal information and download address, wherein, after can being identified the speech data of recording by described transmitting terminal, obtain to determine receiving terminal information with contact person's text message of identification text matches from address list, after uploading described speech data, media server obtains download address by described transmitting terminal, thereby push server can be sent to described download address the receiving terminal of receiving terminal information correspondence, make this receiving terminal download speech data according to described new voice message notice from the described download address of media server, make sending speech message can be reduced to after the transmitting terminal user carries out trigger action, as long as comprise the receiving terminal relevant information in the speech data of recording, just can send speech message automatically to receiving terminal by push server, simplify user's operation, improved user's experience.

Two of exemplary means

The method of having introduced exemplary embodiment of the invention two after, next, be introduced with reference to two of the device of 6 pairs of transmission speech messages corresponding with two of above-mentioned illustrative methods of figure.

Referring to Fig. 6, two of a kind of apparatus structure schematic diagram that sends speech message that provides for the embodiment of the invention, the device that this embodiment provides can be disposed at push server end equipment, and as shown in the figure, this device can comprise:

Reception notification unit 610: can dispose for receive the new voice message notice from transmitting terminal, described new voice message notice comprises receiving terminal information and download address, wherein, described receiving terminal information and download address are obtained by forwarding step by described transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server, if upload success, the receiving media server is determined receiving terminal information according to the download address of the speech data feedback of uploading according to contact person's text message of coupling;

Retransmission unit 620: can dispose for the receiving terminal that described download address is sent to receiving terminal information correspondence, so that this receiving terminal is downloaded speech data according to described new voice message notice from the described download address of media server.

Three of illustrative methods

Describe according to three of the method for the transmission speech message of exemplary embodiment of the invention below with reference to Fig. 7.

Referring to Fig. 7, three of a kind of method flow schematic diagram that sends speech message that provides for the embodiment of the invention, the method that this embodiment provides can be applied to the media server end, and as shown in the figure, this method can comprise:

The speech data that S710, receiving end/sending end are uploaded, wherein, described speech data is uploaded by forwarding step by transmitting terminal, and wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtained the identification text, the contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server;

Receive successfully if S720 is described, feed back download address according to the speech data that receives to transmitting terminal, so that described transmitting terminal sends the new voice message notice to push server, described new voice message notice comprises receiving terminal information and the download address of determining according to contact person's text message of coupling.

Use the method that the above embodiment of the present invention provides, make media server to receive the speech data of uploading from transmitting terminal, wherein, after described speech data is identified the speech data of recording by described transmitting terminal, obtain to determine to upload after the receiving terminal information with contact person's text message of identification text matches from address list, thereby media server can feed back to receiving terminal with the speech data of described download address according to the request of receiving terminal, make sending speech message can be reduced to after the transmitting terminal user carries out trigger action, as long as comprise the receiving terminal relevant information in the speech data of recording, just can send speech message automatically to receiving terminal by push server, simplify user's operation, improved user's experience.

Three of exemplary means

The method of having introduced exemplary embodiment of the invention three after, next, be introduced with reference to three of the device of 8 pairs of transmission speech messages corresponding with three of above-mentioned illustrative methods of figure.

Referring to Fig. 8, three of a kind of apparatus structure schematic diagram that sends speech message that provides for the embodiment of the invention, the device that this embodiment provides can be disposed at media server end equipment, and as shown in the figure, this device can comprise:

Receive uploading unit 810: can dispose the speech data of uploading for receiving end/sending end, wherein, described speech data is uploaded by forwarding step by transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server;

Feedback unit 820: receive successfully if can dispose for described, feed back download address according to the speech data that receives to transmitting terminal, so that described transmitting terminal sends the new voice message notice to push server, described new voice message notice comprises receiving terminal information and the download address of determining according to contact person's text message of coupling.

Although should be noted that and mention the some unit that send the device of speech message in above-detailed, this division only is not enforceable.In fact, according to the embodiment of the present invention, the feature of above-described two or more unit and function can be specialized in a unit.Otherwise the feature of an above-described unit and function can further be divided into by a plurality of unit to be specialized.

In addition, although described the operation of the inventive method in the accompanying drawings with particular order,, this is not that requirement or hint must be carried out these operations according to this particular order, or the operation shown in must carrying out all could realize the result of expectation.On the contrary, the step of describing in the flow chart can change execution sequence.Additionally or alternatively, can omit some step, a plurality of steps be merged into a step carry out, and/or a step is decomposed into a plurality of steps carries out.

The verb of mentioning in the application documents " comprises ", those elements or the element the step or the existence of putting down in writing of step do not got rid of in " comprising " and paradigmatic use thereof in application documents.The existence that article " " before the element or " one " do not get rid of a plurality of this elements.

Though described spirit of the present invention and principle with reference to some embodiments, but should be appreciated that, the present invention is not limited to disclosed embodiment, division to each side does not mean that the feature in these aspects can not make up to be benefited yet, and this division only is the convenience in order to explain.The present invention is intended to contain interior included various modifications and the equivalent arrangements of spirit and scope of claims.The scope of claims meets the most wide in range explanation, thereby comprises all such modifications and equivalent structure and function.

Description of drawings

By reading detailed description hereinafter with reference to the accompanying drawings, above-mentioned and other purposes of exemplary embodiment of the invention, the feature and advantage easy to understand that will become.In the accompanying drawings, show some execution modes of the present invention in exemplary and nonrestrictive mode, wherein:

Fig. 1 schematically shows the block diagram of the exemplary computer system 100 that is suitable for realizing embodiment of the present invention;

Fig. 2 schematically shows the application scenarios according to the embodiment of the invention;

Fig. 3 schematically shows one of method flow schematic diagram according to the embodiment of the invention;

Fig. 4 schematically shows one of apparatus structure schematic diagram according to the embodiment of the invention;

Fig. 5 schematically shows according to two of the method flow schematic diagram of the embodiment of the invention;

Fig. 6 schematically shows according to two of the apparatus structure schematic diagram of the embodiment of the invention;

Fig. 7 schematically shows according to three of the method flow schematic diagram of the embodiment of the invention;

Fig. 8 schematically shows according to three of the apparatus structure schematic diagram of the embodiment of the invention;

In the accompanying drawings, identical or corresponding label is represented identical or corresponding part.

Embodiment

Below with reference to some illustrative embodiments principle of the present invention and spirit are described.Should be appreciated that providing these execution modes only is for those skilled in the art can being understood better and then realize the present invention, and be not to limit the scope of the invention by any way.On the contrary, it is in order to make the disclosure thorough more and complete that these execution modes are provided, and the scope of the present disclosure intactly can be conveyed to those skilled in the art.

Fig. 1 shows the block diagram of the exemplary computer system 100 that is suitable for realizing embodiment of the present invention.As shown in Figure 1, computing system 100 can comprise: CPU (CPU) 101, random-access memory (ram) 102, read-only memory (ROM) 103, system bus 104, hard disk controller 105, keyboard controller 106, serial interface controller 107, parallel interface controller 108, display controller 109, hard disk 110, keyboard 111, serial external equipment 112, parallel external equipment 113 and display 114.In these equipment, with system bus 104 coupling CPU101, RAM102, ROM103, hard disk controller 105, keyboard controller 106, serialization controller 107, parallel controller 108 and display controller 109 arranged.Hard disk 110 and hard disk controller 105 couplings, keyboard 111 and keyboard controller 106 couplings, serial external equipment 112 and serial interface controller 107 couplings, parallel external equipment 113 and parallel interface controller 108 couplings, and display 114 and display controller 109 couplings.Should be appreciated that the described structured flowchart of Fig. 1 only is the purpose for example, rather than limitation of the scope of the invention.In some cases, can increase or reduce some equipment as the case may be.

Art technology technical staff knows that embodiments of the present invention can be implemented as a kind of system, method or computer program.Therefore, the disclosure can specific implementation be following form, that is: hardware, software (comprising firmware, resident software, microcode etc.) completely completely, the perhaps form of hardware and software combination, this paper is commonly referred to as " circuit ", " module " or " system ".In addition, in certain embodiments, the present invention can also be embodied as the form of the computer program in one or more computer-readable mediums, comprises computer-readable program code in this computer-readable medium.

Can adopt the combination in any of one or more computer-readable media.Computer-readable medium can be computer-readable signal media or computer-readable recording medium.Computer-readable recording medium for example can be, but be not limited to electricity, magnetic, light, electromagnetism, infrared ray or semi-conductive system, device or device, perhaps any above combination.The example more specifically of computer-readable recording medium (non exhaustive example) for example can comprise: have the electrical connection, portable computer diskette, hard disk, random-access memory (ram), read-only memory (ROM), erasable type programmable read only memory (EPROM or flash memory), optical fiber, Portable, compact disk read-only memory (CD-ROM), light storage device, magnetic memory device of one or more leads or the combination of above-mentioned any appropriate.In presents, computer-readable recording medium can be any comprising or stored program tangible medium, and this program can be used by instruction execution system, device or device or be used in combination with it.

Computer-readable signal media can be included in the base band or as the data-signal that a carrier wave part is propagated, wherein carry computer-readable program code.The data-signal of this propagation can adopt various ways, includes but not limited to the combination of electromagnetic signal, light signal or above-mentioned any appropriate.Computer-readable signal media can also be any computer-readable medium beyond the computer-readable recording medium, and this computer-readable medium can send, propagates or transmit the program of using or being used in combination with it for by instruction execution system, device or device.

The program code that comprises on the computer-readable medium can be with the transmission of any suitable medium, includes but not limited to wireless, electric wire, optical cable, RF etc., the perhaps combination of above-mentioned any appropriate.

Can make up to write for carrying out the computer program code that the present invention operates with one or more programming languages or its, described programming language comprises object-oriented programming language-such as Java, Smalltalk, C++, also comprises conventional process type programming language-such as " C " language or similar programming language.Program code can fully be carried out at subscriber computer, partly carries out at subscriber computer, carry out or carry out at remote computer or server fully at remote computer on subscriber computer top as an independently software kit execution, part.In relating to the situation of remote computer, remote computer can be connected to subscriber computer by the network (comprising Local Area Network or wide area network (WAN)) of any kind, perhaps, can be connected to outer computer (for example utilizing the ISP to come to connect by the internet).

With reference to the flow chart of the method for the embodiment of the invention and the block diagram of equipment (or system) embodiments of the present invention are described below.The combination that should be appreciated that each square frame in each square frame of flow chart and/or block diagram and flow chart and/or the block diagram can be realized by computer program instructions.These computer program instructions can offer the processor of all-purpose computer, special-purpose computer or other programmable data processing unit, thereby produce a kind of machine, these computer program instructions are carried out by computer or other programmable data processing unit, have produced the device of the function/operation of stipulating in the square frame in realization flow figure and/or the block diagram.

Also can be stored in these computer program instructions and can make in computer or the computer-readable medium of other programmable data processing unit with ad hoc fashion work, like this, the instruction that is stored in the computer-readable medium just produces a product that comprises the command device of the function/operation of stipulating in the square frame in realization flow figure and/or the block diagram.

Also can be loaded into computer program instructions on computer, other programmable data processing unit or the miscellaneous equipment, make and carry out the sequence of operations step at computer, other programmable data processing unit or miscellaneous equipment, producing computer implemented process, thus the process that makes the function/operation of in the instruction that computer or other programmable device are carried out can provide square frame in realization flow figure and/or the block diagram, stipulating.

According to the embodiment of the present invention, a kind of method and apparatus that sends speech message has been proposed.

In this article, it will be appreciated that any number of elements in the accompanying drawing is all unrestricted for example, and any name only be used for to distinguish all, and do not have any limitation.

Below with reference to some representative embodiments of the present invention, explained in detail principle of the present invention and spirit.

Claims

1. method that sends speech message, wherein this method is applied to client, comprising:

In response to corresponding trigger event, obtain speech data by recorded speech;

Described speech data is carried out speech recognition, obtain the identification text;

Contact person's text message in described identification text and the address list is mated;

If obtain and contact person's text message of identifying text matches, described speech data be sent to the determined receiving terminal of this contact person's text message.

2. method according to claim 1, wherein said trigger event comprises:

The pre-set button control that is presented on touch-screen is pressed;

Perhaps,

Two or more contacts are pressed also at touch-screen simultaneously and are stopped paddling behind one section track of paddling.

3. method according to claim 2 wherein in response to corresponding trigger event, obtains speech data by recorded speech and comprises:

During described pressing, recorded speech;

When detecting described when cancellation of pressing, stop to record, obtain the speech data of recording.

4. method according to claim 1 is wherein carried out speech recognition to described speech data and is comprised the pre-set interval of speech data is identified;

After the contact person's text message in described identification text and the address list is mated, also comprise: if obtain and contact person's text message of identifying text matches, recomputate pre-set interval, return the step that the pre-set interval of described speech data is identified.

5. method according to claim 1 if wherein obtain contact person's text message with the identification text matches, is sent to the determined receiving terminal of this contact person's text message with described speech data and comprises:

If obtain and contact person's text message of identifying text matches, send the prompting that whether sends voice;

If receive according to the positive acknowledgement of described prompting feedback, described speech data be sent to the determined receiving terminal of this contact person's text message.

6. method according to claim 1 if wherein obtain contact person's text message with the identification text matches, is sent to the determined receiving terminal of this contact person's text message with described speech data and comprises:

Send the new voice message notice to push server, described new voice message notice comprises the determined receiving terminal information of contact person's text message and the described download address of coupling, so that push server is sent to the receiving terminal of described receiving terminal information correspondence with described download address, so that this receiving terminal is downloaded speech data according to described download address from media server.

7. method according to claim 6, wherein with described push server, media server between communication specifically be connected the safe lane of setting up by front end and communicate by letter.

8. method according to claim 6, wherein said upload successfully after, also the receiving media server is according to the voice identifier ID that is used for the unique identification speech data of the speech data feedback of uploading, so that the described new voice message that sends to push server comprises described voice ID in notifying;

And also comprise:

9. method according to claim 6, wherein said upload successfully after, also the receiving media server is according to the voice ID of the logos sound data of the speech data feedback of uploading, so that the described new voice message that sends to push server also comprises described voice ID in notifying, also make described push server from described new voice message notice, extract voice ID, preserve the corresponding relation of the transmitting terminal of this new voice message notice of voice ID and transmission;

And also comprise:

If obtain the reply voice data by recorded speech;

Upload described reply voice data to media server;

10. device that sends speech message, wherein this device is disposed at client device, comprising:

Obtain voice unit: configuration is used in response to corresponding trigger event, obtains speech data by recorded speech;

Recognition unit: configuration is used for described speech data is carried out speech recognition, obtains the identification text;

Matching unit: configuration is used for contact person's text message of described identification text and address list is mated;

Transmitting element: if configuration is used for obtaining and contact person's text message of identifying text matches, described speech data is sent to the determined receiving terminal of this contact person's text message.

11. device according to claim 10, the wherein said voice unit that obtains: configuration is for the trigger event that is pressed in response to the pre-set button control that is presented on touch-screen; Perhaps, in response to two or more contacts simultaneously touch-screen press and one section track of paddling after stop the trigger event of paddling.

12. device according to claim 11, the wherein said voice unit that obtains: configuration is used for during described pressing, recorded speech; When detecting described when cancellation of pressing, stop to record, obtain the speech data of recording.

13. device according to claim 10, wherein said recognition unit: configuration is used for the pre-set interval of speech data is identified;

Described matching unit: if also configuration is used for not obtaining and contact person's text message of identifying text matches, recomputate pre-set interval, trigger recognition unit again and carry out.

14. device according to claim 10, wherein said transmitting element: if configuration is used for obtaining and contact person's text message of identifying text matches, send the prompting that whether sends voice; If receive according to the positive acknowledgement of described prompting feedback, described speech data be sent to the determined receiving terminal of this contact person's text message.

15. device according to claim 10, wherein said transmitting element: if configuration is used for obtaining and contact person's text message of identifying text matches, upload described speech data to media server; If upload success, the receiving media server is according to the download address of the speech data feedback of uploading; Send the new voice message notice to push server, described new voice message notice comprises the determined receiving terminal information of contact person's text message and the described download address of coupling, so that push server is sent to the receiving terminal of described receiving terminal information correspondence with described download address, so that this receiving terminal is downloaded speech data according to described download address from media server.

16. device according to claim 15, wherein said transmitting element: configuration is used for connecting the safe lane of setting up by front end and communicates by letter with described push server, media server.

17. device according to claim 15, wherein said transmitting element: also configuration is for the voice identifier ID that be used for unique identification speech data of receiving media server according to the speech data feedback of uploading, so that the described new voice message that sends to push server comprises described voice ID in notifying;

And also comprise:

Receiving element: if configuration is used for receiving the new voice message notice from described push server, and do not download to the speech data that this new voice message is notified the voice ID sign that comprises, send the updating message that comprises these voice ID and do not receive voice messaging to described push server, so that described push server is updated to the voice status of these voice ID correspondence according to this updating message the prompting message of the download speech data of not receiving the voice respective feedback; If receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that new voice message notice comprises and do not listen to, send the updating message that comprises these voice ID and do not listen to information to described push server, do not listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and receive described push server according to the voice status of these voice ID correspondence for not listening to the prompting message of listening to speech data of respective feedback; If receive the new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, send the updating message that comprises these voice ID and listened to information to described push server, listen to so that described push server is updated to the voice status of these voice ID correspondence according to this updating message, and make described push server send the prompting message of having listened to speech data to the transmitting terminal that sends this new voice message notice.

18. device according to claim 15, wherein said transmitting element: also configuration is used for the receiving media server according to the voice ID of the logos sound data of the speech data feedback of uploading, so that the described new voice message that sends to push server also comprises described voice ID in notifying, also make described push server from described new voice message notice, extract voice ID, preserve the corresponding relation of the transmitting terminal of this new voice message notice of voice ID and transmission;

And also comprise:

Receiving element: if configuration is used for receiving the new voice message notice from described push server, and download to the speech data of the voice ID sign that this voice messaging notice comprises and listened to, judge also whether the reply voice event is triggered, if, obtain the reply voice data by recorded speech, upload described reply voice data to media server, if upload success, the receiving media server is according to the download address of the reply voice data feedback of uploading, send a reply the voice messaging notice to push server, described reply voice information notice comprises the download address of described voice ID and media server feedback, so that described push server inquires these voice ID corresponding sending terminal, the download address that reply voice information notice is comprised is sent to these voice ID corresponding sending terminal, so that described transmitting terminal is downloaded described reply voice data from this download address of media server.

19. a method that sends speech message, wherein this method is applied to the push server end, comprising:

Receive the new voice message notice from transmitting terminal, described new voice message notice comprises receiving terminal information and download address, wherein, described receiving terminal information and download address are obtained by forwarding step by described transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server, if upload success, the receiving media server is determined receiving terminal information according to the download address of the speech data feedback of uploading according to contact person's text message of coupling;

Described download address is sent to the receiving terminal of receiving terminal information correspondence, so that this receiving terminal is downloaded speech data according to described new voice message notice from the described download address of media server.

20. a device that sends speech message, wherein this device is disposed at push server end equipment, comprising:

The reception notification unit: configuration is used for receiving the new voice message notice from transmitting terminal, described new voice message notice comprises receiving terminal information and download address, wherein, described receiving terminal information and download address are obtained by forwarding step by described transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server, if upload success, the receiving media server is determined receiving terminal information according to the download address of the speech data feedback of uploading according to contact person's text message of coupling;

Retransmission unit: configuration is used for described download address is sent to the receiving terminal of receiving terminal information correspondence, so that this receiving terminal is downloaded speech data according to described new voice message notice from the described download address of media server.

21. a method that sends speech message, wherein this method is applied to the media server end, comprising:

The speech data that receiving end/sending end is uploaded, wherein, described speech data is uploaded by forwarding step by transmitting terminal, and wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtained the identification text, the contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server;

If described the reception successfully, feed back download address according to the speech data that receives to transmitting terminal, so that described transmitting terminal sends the new voice message notice to push server, described new voice message notice comprises receiving terminal information and the download address of determining according to contact person's text message of coupling.

22. a device that sends speech message, wherein this device is disposed at media server end equipment, comprising:

Receive uploading unit: configuration is used for the speech data that receiving end/sending end is uploaded, wherein, described speech data is uploaded by forwarding step by transmitting terminal, wherein, described forwarding step comprises: in response to corresponding trigger event, obtain speech data by recorded speech, described speech data is identified, obtain the identification text, contact person's text message in described identification text and the address list is mated, if obtain and contact person's text message of identifying text matches, upload described speech data to media server;

Feedback unit: if configuration is used for described the reception successfully, feed back download address according to the speech data that receives to transmitting terminal, so that described transmitting terminal sends the new voice message notice to push server, described new voice message notice comprises receiving terminal information and the download address of determining according to contact person's text message of coupling.