OA17170A

OA17170A - Method and system for transferring speech information.

Info

Publication number: OA17170A
Application number: OA1201400379
Authority: OA
Inventors: Bin Zhang; Zhenan Guan; Xing LIANG; Yuewei Chen; Lejun LIU
Original assignee: Tencent Technology (Shenzhen) Company Limited
Priority date: 2012-02-21
Filing date: 2013-01-18
Publication date: 2016-04-05

Abstract

It relates to computer communication technology. A method and system for transmitting voice messages are disclosed. The method includes: voice data collected by the first 5 intercom terminal are received; whether the size of the voice data collected reaches a predefined threshold are circularly detected; and when the size of the voice data collected reaches the predefined threshold, or when the voice data collected doesn't reach the predefined size but contains a voice message terminator, the voice data collected are upload to the transit server via a predefined network, to realize 10 asynchronization between voice data collection and uploading. Thus the problem of the prolongation of the intercom data transmission time associated with existing processes is solved by such an effective intercom data transmission method. In this method, the collection and uploading of the voice data can be done asynchronously, thus the delay of intercom can be reduced and the experience of the users of intercom terminal can be 15 improved.

Description

Method and System for Transferring Speech Information

TECHNICAL FIELD

The présent disclosure relates to computer communication technology, and more particulariy, to a method and system for transmitting voice messages.

BACKGROUND

The network-based voice intercom is a network-based message transmission 10 application and it can simulate the behavior of short message service to provide a new interaction mode for network users.

However, in the existing processes, an intercom terminal of a sender (e.g., mobile terminal) records a voice message when it receives a user intercom command (e.g., an instruction triggered by touching), and then, the voice message 15 is uploaded to a server, finally an intercom terminal of a récipient can download the voice message from the server and play it. As can be seen, the intercom terminal of the récipient has to wait to download the voice message until after the intercom terminal of the sender has completed the upload of the complété voice message of one time of intercom. This takes additional time for the transmission of .20 intercom voice messages, leading to a réduction in the intercom expérience of users.

SUMMARY

The présent disclosure provides a method and system for transmitting voice 25 messages, seeking to address the problem of the prolongation of the intercom data transmission time associated with the existing processes.

An embodiment of the présent disclosure provides a method for transmitting voice messages. The method includes the following steps:

receiving voice data collected by a first intercom terminal*, detecting whether the size of the voice data collected reaches a predefined threshold;

uploading the voice data collected to a transit server via a predefined network on condition that the size of the voice data collected reaches the predefined threshold or when the voice data collected contains a voice message terminator.

Another embodiment of the présent disclosure is to provide a System for transmitting voice messages. The System includes:

a data collection unit, configured to receive voice data collected by a first intercom terminal;

a cycle détection unit, configured to detect whether the size of the voice data collected reaches a predefined threshold; and a data uploading unit, configured to upload the voice data collected to a transit server via a predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected contains a voice message terminator.

Another embodiment of the présent disclosure further provides a computer storage medium for storing computer exécutable programs. The computer exécutable programs are used to execute a method for transmitting voice messages according to an embodiment of the présent disclosure.

The embodiments of the présent disclosure can realize asynchronization between the collection of voice data by the first intercom terminal and the uploading of the voice data collected of predefined size to the transit server, thereby solving the problem of tirne consuming during the traditional voice message transmissions, reducing the intercom delay and improving the personalization of intercom in such manner that the voice data collected by the first intercom terminal is received and circularly detected to déterminé whether the size of the voice data collected reaches the predefined threshold and then the voice data collected are uploaded to the transit server via the predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator.

BRIEF DESCRIPTION OF THE DRAWINGS

The aforementioned technical solution in the embodiments of the présent disclosure can be better understood by reading the literal explanation of the disclosure with référencé to the drawings. The accompanying drawings used in description of the embodiments are introduced briefly below. Obviously, the accompanying drawings in the description are only some of the embodiments of the disclosure. Those skilled in the art can obtain other accompanying drawings based on the accompanying drawings below without any créative work.

Figure 1 is a flowchart of a method for transmitting voice messages according to embodiment 1 of the présent disclosure;

Figure 2 is a flowchart of asynchronous transmission of voice data between a first intercom terminal and a second intercom terminal according to embodiment 2 of the présent disclosure;

Figure 3 is a flowchart of a method for transmitting voice messages according to embodiment 3 of the présent disclosure;

Figure 4 is a flowchart of a method for transmitting voice messages according to embodiment 4 of the présent disclosure;

Figure 5 is a schematic diagram illustrating a graphical interface output by the voice message transmission method according to embodiment 4 of the présent disclosure;

Figure 6 is a structural diagram illustrating a System for transmitting voice messages according to embodiment 5 of the présent disclosure;

Figure 7 is a structural diagram illustrating a System for transmitting voice messages according to embodiment 6 of the présent disclosure.

DETAILED DESCRIPTION OF THE DISCLOSURE

In order to better clarify the objectives, technical solution and advantages of the présent disclosure, detailed description is given below on embodiments of the présent disclosure in conjunction with the accompanying drawings.

Although the disclosure has been described in connection with spécifie preferred embodiments, it should be understood that the disclosure as claimed should not be unduly limited to such spécifie embodiments

The implémentation of the présent disclosure will be described in detail in conjunction with spécifie embodiments as follows:

Embodiment 1:

The increase in the transmission speed of data between mobile terminais data and the réduction in the transmission cost per unit data volume provide favorable conditions for the network-based voice intercom which realizes voice intercom through using the network flow and simulating the behavior of the traditional short message service (SMS).

Figure 1 illustrâtes a flowehart of a method for transmitting voice messages transmission method according to embodiment 1 of the présent disclosure, as described below in detail:

In step S101, receive voice data collected by a first intercom terminal.

In step S102, detect circularly whether the size of the voice data collected reaches a predefined threshold.

In this embodiment of the présent disclosure, when the intercom command is received from a user, the first intercom terminal starts to collect the user’s voice data untii the end of this intercom. As a resuit, the complété voice message of the user of the first intercom terminal in one time of intercom has been obtained, which includes a number of voice data (data packages). In practical implémentation, the intercom command may be generated by such a triggering event as spécifie voice, pressing on a physical key of intercom terminal or virtual key and so on.

In step S103, upload the voice data collected to a transit server via a predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator.

In this embodiment of the présent disclosure, a value is predefined as the threshold for the voice data encapsulation and the size of data to be uploaded, and whether the size of the voice data collected reaches the predefined threshold is ctrcularly detected. Once the predefined threshold is reached, or when the size of the voice data collected doesn’t reach the predefined threshold but the voice data collected contatns a voice message terminator, the data will be encapsulated according to the transmission protocol or format of the predefined network transmission and uploaded to the transit server, thereby realizing the asynchronization between collection and uploading of the voice data and reducing the intercom delay.

In practical implémentation, the predefined threshold may be either a fixed value or the value which is a fonction of intercom time. The définition of such threshold shall take the voice data sending network used by the intercom terminal, the data processing capability of the intercom terminal and the user’s demand for the real-time intercom into comprehensive considération. For instances, if the voice data sending network is fast and the intercom terminal has a high data processing capability, this threshold may be set to a smaller value, and if not, it shall be set to a larger one; if the user has a high demand for the real-time intercom and the intercom terminal has a high data processing capability, this threshold may be set to a smaller value, and if not, it shall be set to a larger one. Therefore, the fact that the threshold shall be set according to particular application environment is not intended to limit the scope of the présent disclosure.

In practical implémentation, when the size of the voice data collected doesn’t reach the predefined threshold but the user of the first intercom terminal has sent the signal to end this intercom, namely, the voice data collected contains a voice message terminator, the voice data collected of actual size will be immediately uploaded. To be exact, the predefined network may be either a wireless network, such as WiFi network or GPRS network, or a wired network. But herein it is not intended to limit the scope of the présent disclosure.

The présent embodiment of the disclosure has realized the asynchronization between the collection of voice data by the first intercom terminal and the uploading of the voice data collected of predefined size to the transit server and thereby solved the problem of time consuming with the traditional voice message transmission, reduced the intercom delay and improved the personalization of intercom, through receiving the voice data collected by the first intercom terminal, circularly detecting whether the size of the voice data collected reaches the predefined threshold and then uploading the voice data collected to the transit server via the predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator.

Embodiment 2:

Figure 2 illustrâtes a flowchart of the asynchronous transmission of voice data between a first intercom terminal and a second intercom terminal in another embodiment of the présent disclosure. In the implémentation of the présent disclosure as described below, the whole intercom System includes the first intercom terminal, the transit server and the second intercom terminal:

In step 1: the first intercom terminal collects voice data.

In step 2: the first intercom terminal circularly detects whether the size of the voice data collected reaches a predefined threshold.

In step 3: the first intercom terminal uploads the voice data collected to the transit server via a predefined network once the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn't reach the predefined size but contains a voice message terminator.

In this embodiment of the présent disclosure, steps 1-3 resemble Steps S10I-S103 in embodiment 1, and no more description is given here.

In step 4: the transit server sends the voice data uploaded by the first intercom terminal to the second intercom terminal.

In this embodiment of the présent disclosure, the second intercom terminal is the receiving terminal. After the transit server receives the voice data uploaded by the first intercom terminal, it asynchronously sends the received voice data to the second intercom terminal, making it possible for the second intercom terminal to receive the voice data collected in real time, without the need to wait for the arrivai of ail voice messages in one time of intercom at the transit server, and thereby reducing the time used by the second intercom terminal to receive data.

In step 5: the second intercom terminal plays ail voice data after receiving the ail voice data of the current intercom from the first intercom terminal.

In this embodiment of the présent disclosure, a value is predefined as the threshold for the voice data encapsulation and the size of data to be uploaded, and whether the size of the voice data collected reaches the predefined threshold is circularly detected. Once the predefined threshold is reached, or when the size of the voice data collected doesn’t reach the predefined threshold but it contains a voice message terminator, the data will be encapsulated according to the transmission protocol or format of the predefined network transmission, and while continuing to collect data, the voice data collected of predefined size will be uploaded to the transit server, realizing the asynchronization between data collection and uploading and reducing the intercom delay. Correspondingly, the second intercom terminal can also download the voice data in a timely manner from the transit server or timely receive the voice data transferred by the transit server, reducing the time used by the second intercom terminal for downloading/receiving data. After receiving ail voice data in the current intercom from the first intercom terminal, the second intercom terminal plays ail the voice data in this time of the current intercom, eventually realizing the network-based intercom and reducing the data transmission time in this time of intercom.

Embodiment 3:

Figure 3 illustrâtes a flowehart of a method for transmîtting voice messages according to embodiment 3 of the présent disclosure. This embodiment is described in detail as below:

In step S301, receive voice data collected by a first intercom terminal.

In step S302, store the voice data collected to a predefined upload queue.

Preferably, in this embodiment of the présent disclosure, an upload queue is predefined, for caching the voice data collected to be uploaded to the transit server.

In step S3O3, circularly detect whether the size of the voice data collected reaches the predefined threshold.

In step S304, detect whether the size of the voice data collected reaches predefined threshold value, and if yes, go to step S306, or if not, go to step S305.

In step S305, judge whether the voice data collected contains a voice message terminator, and if yes, go to step S306, or if not, go to step S304.

In step S306, upload the voice data collected to the transit server via a predefined network.

In this embodiment of the présent disclosure, the size of the upload queue in step S302 may be set to an intégral multiple of the predefined threshold, for conveniently storing the voice data collected. When the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator, the voice data collected will be uploaded to the transit server via the predefined network. In case the size of the voice data collected reaches the predefined threshold but the voice data can't be sent out in a timely manner, it can be cached in the upload queue, so as to avoid the loss of voice data.

Embodiment 4:

Figure 4 illustrâtes a flowchart of a method for transmitting voice messages according to embodiment 4 of the présent disclosure. This embodiment is described in detail as below:

In step S401, receive the voice data collected by the first intercom terminal.

In step S402, timely output a graphical interface to the first intercom terminal and the graphical interface contains the sound volume information corresponding to the voice data collected.

In this embodiment of the présent disclosure, while collecting the voice data of the first intercom terminal, namely, when the user is speaking, a graphical interface to the first intercom terminal is timely output. This graphical interface contains the sound volume information corresponding to the voice data collected, clearly indicating to the user the Ioudness of his voice. As an example, Figure 5 illustrâtes the schematic view of the graphical interface output by the voice message transmission method according to embodiment 4 of the présent disclosure. As shown in Figure 5, the output graphical interface includes a walkie talkie image and a Sound volume image, improving the visualization effect ofthe intercom terminal.

In step S403, store the voice data collected to a predefined upload queue.

In step S404, detect whether the size of the voice data collected reaches a predefined threshold, and if yes, go to step S406, or if not, go to step S405.

In this embodiment of the présent disclosure, the size of the upload queue in step S403 may be set to an intégral multiple of the predefined threshold, for conveniently storing the voice data collected. When the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator, the voice data collected will be uploaded to the transit server via the predefined network. In case the size of the voice data collected reaches the predefined threshold but the voice data can’t be sent out in a timely manner, it can be cached in the upload queue, so as to avoid the loss of voice data.

In step S405, judge whether the voice data collected contains a voice message terminator, and if yes, go to step S406, or if not, go to step S404.

ίο

In step S406, judge whether the first intercom terminal is successfully connected to predefined network. If yes, go to step S407; if, not, continue step S406.

In step S407, after the first intercom terminal is successfully connected to the predefined network, upload the voice data in the upload queue to the transit server via the predefined network.

In this embodiment of the présent disclosure, if the user collects the voice data through making a recording with the first intercom terminal not connected to the network, the voice data collected is cached in the upload queue. At the same time, the first intercom terminal continuously tries to connect to the predefined network. Once it is successfully connected to the network, the voice data collected will be uploaded to the transit server via the network. In this way, the automatic uploading of voice data without the need of manual intervention can be realized provided that the first intercom terminal is online, making the intercom terminal more intelligent.

Those skilled in the art can understand that ail or part of the steps to realize the method described in the embodiment above can be accomplished through programs that instruct relevant hardware, wherein the programs may be stored în a computer readable storage medium, and the storage medium may be a magnetic disk, an optical disk, a Read Only Memory (ROM), or a Random Access Memory (RAM).

Embodiment 5:

Figure 6 illustrâtes a structure of a System for transmitting voice messages according to embodiment 5 of the présent disclosure. For ease of présentation, only parts in relation to the présent embodiment of the disclosure are illustrated, including:

A data collection unit 51, configured to receive voice data collected by a first intercom terminal.

A cycle détection unit 52, configured to circularly detect whether the size of the voice data collected reaches a predefined threshold.

ιι

A data uploading unît 53, configured to upload the voice datacollected to a transit server via a predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator.

In this embodiment of the présent disclosure, the System for transmitting voice message can be implemented using the method described in embodiment

1. Please refer to the description of embodiment 1.

Embodiment 6:

Figure 7 illustrâtes a structure of a system for transmitting voice messages according to embodiment 6 of the présent disclosure. For ease of présentation, only parts in relation to the présent embodiment of the disclosure are illustrated, including:

A data collection unit 61, configured to receive voice data collected by a first intercom terminal.

An interface output unit 62, configured to timely output a graphical interface to the first intercom terminal, wherein the graphical interface contains Sound volume information corresponding to the voice data collected.

A storage unit 63, configured to store the voice data collected to a predefined upload queue.

A cycle détection unît 64, configured to circularly detect whether the size of the voice data collected reaches a predefined threshold.

A data uploading unit 65, configured to upload the voice data collected to the transit server via the predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator.

A data sending unit 66, configured to control the transit server to send the voice data collected to the second intercom terminal.

In this embodiment of the présent disclosure, if the user collects the voice data by making a recordîng with the first intercom terminal not connected to the network, the voice data collected is cached in the upload queue. At the same time, the first intercom terminal continuously tries to connect to the predefined network. Once it is successfully connected to the network, the voice data collected will be uploaded to the transit server via the network. In this way, the automatic uploading of the voice data without the need of manual intervention can be realized provided that the first intercom terminal is online, making the intercom terminal more intelligent. To this end, the data uploading unit 65 may further include a connection judgment sub-unit 651 and a data uploading sub-unit 652, wherein:

The connection judgment sub-unit 651 is configured to circularly judge whether the first intercom terminal is successfully connected to the predefined network; and

The data uploading sub-unit 652 is configured to upload voice data in the upload queue to the transit server via the predefined network once the first intercom terminal is successfully connected to the predefined network.

The présent embodiment of the disclosure has realized the asynchronization between the collection of voice data by the first intercom terminal and the uploading of the voice data collected of a predefined size to the transit server. Therefore, the problem of time consuming during the traditional voice message transmission is solved, reduced the intercom delay is reduced and the personalization of intercom is improved through receiving the voice data collected by the first intercom terminal, circularly detecting whether the size of the voice data collected reaches the predefined threshold and then uploading the voice data collected to the transit server via the predefined network when the size of the voice data collected reaches the predefined threshold or when the voice data collected doesn’t reach the predefined size but contains a voice message terminator. In case the intercom terminal is not connected to the predefined network, the voice data collected will be cached in the upload queue, and at the same time, the first intercom terminal continuously tries to connect to the predefined network. Once it is successfully connected to the network, the voice data collected will be uploaded to the transit server via the network. In this way, the automatic uploading of voice data without the need of manual intervention can be realized provided that the first intercom terminal is online, making the intercom terminal more intelligent.

Person skilled in the art can understand that ail or part of the steps to realize the method described in the embodiment above can be accomplished through programs that instruct relevant hardware, wherein the programs may be stored in a computer readable storage medium, and the storage medium may be a magnetic disk, an optical disk, a Read Only Memory (ROM), or a Random 10 Access Memory (RAM).

The preferred embodiments described above are ail exemplary in nature only and should not be construed as restrictions to the présent disclosure in any way.

Any modifications, variations, équivalent replacements and improvements 15 which are apparent to those skilled in the art without departing from the scope and spirit of the présent disclosure are intended to be within the scope of the following daims.

Claims

Claims

What is claimed is:

1. A method for transmitting voice messages, comprising: receiving voice data collected by a first intercom terminal;

5 detecting whether the size of the voice data collected reaches a predefined threshold; and uploading the voice data collected to a transit server via a predefined network when the size of the voice data collected reaches the predefined threshold, or when the voice data collected contains a voice message terminator.

10
2. The method according to claim 1, further comprising:

controlling the transît server to send the voice data collected to a second intercom terminal after uploading the voice data collected to the transit server via the predefined network.
3. The method according to claim 1, further comprising:

15 storing the voice data collected to a predefined upload queue after receiving the voice data collected by the first intercom terminal and before detecting circularly whether the size of the voice data collected reaches the predefined threshold.
4. The method according to claim 3, wherein uploading the voice data 20 collected to the transit server via the predefined network comprises:

detect circularly whether the first intercom terminal is connected to the predefined network successfully; and uploading the voice data in the upload queue to the transit server via the predefined network when the first intercom terminal is successfully connected to 25 the predefined network.
5. The method according to claim 1, wherein the first intercom terminal is a mobile terminal, and the predefined network is either a GPRS network or a WiFi network.
6. The method according to any one of claims 1-5, further comprising:

30 outputting a graphical interface to the first intercom terminal after receiving the voice data collected by the first intercom terminal; wherein the graphical interface contains Sound volume information corresponding to the voice data collected.
7. A System for transmitting voice messages, comprising:

5 a data collection unit, configured to receive voice data collected by a first intercom terminal;

a cycle détection unit, configured to detect whether the size of the voice data collected reaches a predefined threshold; and a data uploading unit, configured to upload the voice data collected to a transit 10 server via a predefined network when the size of the voice data collected reaches the predefined threshold, or when the voice data collected contains a voice message terminator.
8. The system according to claim 7, further comprising:

a data sending unit, configured to control the transit server to send the voice

15 data collected a second intercom terminal.
9. The system according to claim 7, further comprising:

a storage unit, configured to store the voice data collected to a predefined upload queue.
10. The system according to claim 9, wherein the data uploading unit 20 comprises:

a connection judgment sub-unit, configured to judging circularly whether the first intercom terminal is successfully connected to the predefined network; and a data upload sub-unit, configured to upload the voice data in the upload queue to the transit server via the predefined network when the first intercom 25 terminal is successfully connected to the predefined network.
11. The System according to any one of daims 7-11, further comprising:

an interface output unit, configured to output a graphical interface to the first intercom terminal, wherein the graphical interface contains Sound volume information corresponding to the voice data collected.

30 12. A computer storage medium comprising one or more computer exécutable t

programs, the one or more computer exécutable programs are to be executed to perform the method for transmitting voice messages according to any one of daims 1 to 6.