WO2020221105A1 - Procédé et dispositif de traitement de messages vocaux courts, et support - Google Patents

Procédé et dispositif de traitement de messages vocaux courts, et support Download PDF

Info

Publication number
WO2020221105A1
WO2020221105A1 PCT/CN2020/086508 CN2020086508W WO2020221105A1 WO 2020221105 A1 WO2020221105 A1 WO 2020221105A1 CN 2020086508 W CN2020086508 W CN 2020086508W WO 2020221105 A1 WO2020221105 A1 WO 2020221105A1
Authority
WO
WIPO (PCT)
Prior art keywords
short message
voice short
voice
editing
text
Prior art date
Application number
PCT/CN2020/086508
Other languages
English (en)
Chinese (zh)
Inventor
杨静静
Original Assignee
上海掌门科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海掌门科技有限公司 filed Critical 上海掌门科技有限公司
Publication of WO2020221105A1 publication Critical patent/WO2020221105A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/04Real-time or near real-time messaging, e.g. instant messaging [IM]
    • H04L51/046Interoperability with other network applications or services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/06Message adaptation to terminal or network requirements

Definitions

  • the invention relates to the field of computer technology, in particular to a method, equipment and medium for processing voice short messages.
  • voice short messages In the prior art, most software products with social attributes have the function of sending voice short messages.
  • the function of sending voice short messages means that through corresponding software, users can record and send their own voice to other software users.
  • the existing voice short message sending function is relatively simple, and usually only supports the basic operations of input and sending.
  • the following embodiments of the present invention provide a method, device, and medium for processing voice short messages, which are used to avoid duplication of entry, thereby improving the communication efficiency based on voice short messages.
  • some embodiments of the present invention provide a method for processing voice short messages, including: obtaining a trigger operation input by a user through a conversation interface of a chat session of social software; Short message editing interface; the first voice short message is input by the user through the conversation interface; the editing operation based on the editing interface is acquired; in response to the editing operation, the first voice short message Edit and get the edited voice short message.
  • some embodiments of the present invention provide a method for processing voice short messages, including: a terminal acquires a trigger operation input by a user through a conversation interface of a chat session of social software; An editing interface for a voice short message; the first voice short message is input by the user through the conversation interface; an editing operation based on the editing interface is obtained; an editing instruction is sent to the server, and the editing instruction contains all Information corresponding to the editing operation; obtaining the processing result of editing the first voice short message fed back by the server; and obtaining the edited voice short message based on the processing result.
  • some embodiments of the present invention provide a method for processing voice short messages, including: a server obtains a first voice short message uploaded by a terminal; the first voice short message is sent through social software of the terminal The conversation interface of the chat session is input to the terminal; the editing instruction sent by the terminal is obtained, the editing instruction is generated based on the editing operation input through the editing interface of the social software of the terminal; in response to the editing instruction, Clipping the first voice short message to obtain the clipped voice short message; sending the clipped voice short message to the terminal.
  • some embodiments of the present invention provide a device for information processing, which includes a memory for storing computer program instructions and a processor for executing program instructions, wherein when the computer program instructions are processed by the processing When the device is executed, the device is triggered to execute a voice short message processing method as described above.
  • some embodiments of the present invention provide a computer-readable medium having computer-readable instructions stored thereon, and the computer-readable instructions can be executed by a processor to implement a voice short message as described above. The processing method of the message.
  • the above-mentioned at least one technical solution adopted by the above-mentioned embodiment of the present invention breaks the above-mentioned inertia of thinking, and can achieve the following beneficial effects: by acquiring the trigger operation input by the user through the conversation interface of the chat session of social software; in response to the trigger operation, Display the editing interface for the first voice short message; obtain the editing operation based on the editing interface; in response to the editing operation, edit the first voice short message to obtain the edited voice short message, so that the user can When the voice short message needs to be edited, there is no need to re-enter a new voice short message, and the original voice short message can be edited, thereby improving the communication efficiency based on the voice short message.
  • FIG. 1 is a schematic diagram of an application scenario of a voice short message processing method provided by some embodiments of the present invention
  • FIG. 2 is a schematic flowchart of a method for processing voice short messages according to some embodiments of the present invention
  • FIG. 3 is a schematic diagram of a conversation interface of a social software chat session provided by some embodiments of the present invention.
  • FIG. 4 is a schematic diagram of a conversation interface including a text editing interface for the first voice short message provided by some embodiments of the present invention
  • FIG. 5 is a schematic diagram of a trigger mode for a first voice short message that has been input but not sent according to some embodiments of the present invention
  • FIG. 6 is a schematic diagram of a method for triggering a first voice short message that has been sent according to some embodiments of the present invention
  • FIG. 7 is a schematic diagram of an editing manner for multiple first voice short messages provided by some embodiments of the present invention.
  • FIG. 8 is a schematic flowchart of a voice short message processing method based on the application scenario in FIG. 1 according to some embodiments of the present invention
  • FIG. 9 is a schematic structural diagram of a device for information processing provided by some embodiments of the present invention.
  • Fig. 1 is a schematic diagram of an application scenario of a voice short message processing method provided by some embodiments of the present invention.
  • the application scenario may include a server 101 and a terminal device 102.
  • the server 101 is a server of a service provider that can provide social network services.
  • the service provider can run an application 103 that supports the function of sending voice short messages.
  • the application 103 can be social software.
  • the service provider can be based on the server. 101 and application 103 provide users with social network services that support voice short message interaction.
  • the terminal device 102 may be equipped with an application program 103 that supports the voice short message sending function. It should be noted that what is carried on the terminal device 102 may be the client part of the application 103, and what is carried on the server 101 may be the server part of the application 103. As an example, in FIG. 1, the terminal device 102 may be the client of the application 103, and correspondingly, the server 101 may be the server of the application 103. In practical applications, the terminal device 102 may be various terminal devices with a display screen, including but not limited to smart phones, tablet computers, portable computers or desktop computers, etc.; the server 101 may be a service device that provides various services, including but not limited to Not limited to integrated servers or distributed servers, etc.
  • FIG. 1 the number of servers and terminal devices in FIG. 1 is merely illustrative. According to implementation needs, any number of servers and devices can be used.
  • FIG. 2 is a schematic flowchart of a method for processing voice short messages according to some embodiments of the present invention.
  • the execution subject of the process can be a terminal device equipped with an application program.
  • the process can include:
  • Step S201 Acquire the trigger operation input by the user through the conversation interface of the chat session of the social software.
  • the conversation interface of the chat session of the social software may be one of the application interfaces of the application program, and the conversation interface may be displayed on the user's terminal device.
  • the user can trigger the conversation interface to input the trigger operation according to the preset rules of the social software, and then edit the voice short message.
  • a voice short message editing button may be provided in the conversation interface, and the user can input the trigger operation by clicking the voice short message editing button.
  • the voice short message editing button may not be provided in the conversation interface.
  • the option including the short message editing function may be displayed.
  • Fig. 3 is a schematic diagram of a conversation interface of a chat conversation of social software provided by some embodiments of the present invention.
  • a conversation interface of a chat conversation of social software is displayed on the terminal device 102 of the first user, and no voice short message editing button is set in the conversation interface.
  • the display screen of the terminal device 102 displays a conversation interface 301 of a chat session of social software.
  • the conversation message 302 and the conversation message 303 represent the conversation message input by the first user.
  • the message 304 represents the conversation message input by the second user.
  • the second user is a chat partner of the first user, and the first user and the second user may be different users.
  • the conversation message 302 has a voice short message identifier in the form of "sonic wave” and voice duration information (wherein, “2s” means that the voice short message duration is 2 seconds), which means that the conversation message 302 is a voice short message, and the voice duration is 2 second.
  • the conversation message 303 is a voice short message with a voice duration of 6 seconds.
  • the conversation message 304 does not have a voice short message identifier and voice duration information, which means that the conversation message 304 is text information.
  • the conversation interface 301 may also be provided with a voice short message entry button 305, and the user can click the voice short message entry button 305 to trigger the voice entry function. When the user needs to edit the voice short message 303, the user can click on the voice short message 303.
  • the conversation interface 301 may be as shown in FIG. 3b, and a voice short message edit button 306 may be displayed on the conversation interface 301, and the user may click the voice short message edit button 306 to input a trigger operation.
  • Step S202 In response to the trigger operation, display an editing interface for the first voice short message; the first voice short message is input by the user through the conversation interface.
  • the editing interface is used to display information related to editing operations on the first voice short message, so that the user can perform corresponding editing operations on the first voice short message according to requirements.
  • Step S203 Obtain an editing operation based on the editing interface.
  • Step S204 In response to the editing operation, clip the first voice short message to obtain the clipped voice short message.
  • the editing interface for the first voice short message is displayed, and after the editing operation of the user based on the editing interface is acquired, the corresponding edited voice is generated Short message, thereby realizing the user's editing operation on the first voice short message, so that the user can get the required voice short message without re-entering the first voice short message, which not only improves the communication efficiency based on the voice short message , Can also improve user experience.
  • this specification provides an implementation manner for obtaining the editing operation of the editing interface.
  • a conversion process for the first voice short message may also be included.
  • the conversion process can be executed locally by the terminal or by the server.
  • the following steps may be adopted: recognizing the first voice short message, and obtaining text information corresponding to the first voice short message.
  • displaying an editing interface for the first voice short message in step S202 may include: displaying a text editing interface for the first voice short message, the text editing interface containing text corresponding to the text information .
  • obtaining an editing operation based on the editing interface in step S203 may include: obtaining a selection operation for a text on the text editing interface.
  • FIG. 4 is a schematic diagram of a conversation interface including a text editing interface for the first voice short message provided in some embodiments of the present invention.
  • a conversation interface of a social software chat session is displayed on the display screen of the terminal device 102.
  • the conversation interface includes a first voice short message 303 and a text editing interface 401 for the first voice short message.
  • the first voice short message 303 is the same as the conversation message 303 in FIG. 3.
  • the text editing interface 401 displays text corresponding to the first voice short message 303. Assume that the text corresponding to the first voice short message is "I plan to leave work tonight at 6 o’clock, and I am expected to be home at 5 o’clock tonight.” Since the home time is earlier than the off time, the home time is wrong.
  • the first voice short message 303 is edited to obtain a voice short message with correct content.
  • the user can perform a selection operation on the text on the text editing interface according to the preset rules of the social software.
  • the social software can obtain the user's selection operation on the text on the text editing interface.
  • the display state of the selected text can be changed. For example, change the size, font, font, or shading of the selected file. In Figure 4b, change the shading of the selected text as an example.
  • the text editing interface for the first voice short message by displaying the text editing interface for the first voice short message, the text obtained by recognizing the first voice short message is displayed to the user, so that the user knows the specific content of the first voice short message without the user Playing the first voice short message improves the convenience for the user when recognizing the problem with the first voice short message.
  • the editing operation on the editing interface is realized, so that the user can perform the editing operation conveniently, quickly, and intuitively, which can improve the user experience.
  • displaying the editing interface for the first voice short message may also be implemented in the following manner, that is, displaying the audio editing interface for the first voice short message, and the audio editing interface may include The playback time axis corresponding to a short voice message. The user can select two moments on the time axis to select the voice short messages corresponding to the two selected moments.
  • the editing interface in some embodiments of the present invention may be a text editing interface, an audio editing interface and other editing interfaces, which are not specifically limited here.
  • the trigger operation of obtaining user input in step S201 in the embodiment may be implemented in multiple ways.
  • One implementation manner may be: obtaining the trigger operation for the voice short message that has been input but not sent, and another implementation manner may be: obtaining the trigger operation for the voice short message in the sent state.
  • the obtaining of the trigger operation input by the user through the conversation interface of social software in step S201 may include:
  • the voice status of the first voice short message is entered but not sent, and the user needs to send the first voice short message
  • the first voice short message can be sent out.
  • the conversation interface of the social software displays a trigger area for the first voice short message that has been input but not sent, and the user can trigger the trigger area according to the preset rules of the social software to detect the input but not sent
  • the first voice short message performs the first trigger operation. It should be noted that the specific operation steps for the user to perform the first trigger operation on the first voice short message that has been input but not sent can be implemented in multiple ways, which are not specifically limited here.
  • Fig. 5 is a schematic diagram of a trigger mode for a first voice short message that has been input but not sent in some embodiments of the present invention.
  • a conversation interface of a social software chat session is displayed on the display screen of the terminal device 102.
  • the first voice short message 501 in the conversation interface has a "not sent” and other signs at one end, meaning the first voice message A voice short message 501 is in the input but not sent state, and the voice short message 302 in the conversation interface does not have the word "unsent" after it, which means that the voice short message 302 is in the sent state.
  • the user can input the voice by clicking the voice short message input button 305 to generate an input but not sent voice short message.
  • the conversation interface of the social software chat session can be as shown in Figure 5b.
  • the conversation interface displays a voice short message edit button 502, and the user can click the voice
  • the short message editing button 502 is used to input the first trigger operation for the first voice short message 501 that has been input but not sent.
  • the user can perform the first trigger operation on the input but not sent first voice short message to edit the input but not sent first voice short message to obtain the desired
  • the voice short message does not require the user to enter the first voice short message again. Since the user can edit the voice short message before sending it, it is possible to avoid the user from sending inaccurate voice short messages as much as possible, thereby improving the receiver's experience.
  • a voice short message-based chat scenario after the user enters the voice short message, he may not be sure whether the voice short message he has entered is clearly expressed.
  • some embodiments of the present invention It also provides a way for the user to know the content of the entered voice short message before sending the entered voice short message.
  • the first voice short message that has been input but not sent is played.
  • the first voice short message when the user enters the first voice short message but does not perform the triggering operation to send the first voice short message, the first voice short message is the first voice message that has been input but not sent.
  • the user can perform a playback operation on the input but not sent first voice short message to control the terminal device to play the input but not sent first voice short message.
  • the specific operation steps for the user to perform the playback operation on the input but not sent first voice short message have multiple implementation manners, which are not specifically limited here.
  • the first user can directly click the input but not sent first voice short message 501 to perform the playback operation.
  • the play button can also be displayed after the user clicks the input but not sent first voice short message 501, and the user clicks the play button to execute the input but not sent first voice short message 501 Play operations.
  • the user can perform a playback operation on the first voice short message that has been input but not sent, so that the user can aurally obtain the content of the first voice short message, which is convenient and fast, and has good practicability.
  • the obtaining of the trigger operation input by the user through the conversation interface of social software in step S201 may include:
  • the editing interface for the first voice short message may further include:
  • the voice state of the first voice short message can be changed to the sent state; the user can send the first voice short message (ie The voice short message whose voice status is the sent state) executes a second trigger operation, and the second trigger operation may be a trigger operation for instructing to withdraw the first voice short message that has been sent.
  • the user can input the second trigger operation by clicking the first voice short message sent and the withdrawal option.
  • the specific operation steps of the user performing the second trigger operation on the sent voice short message may also have other implementation manners, which are not specifically limited here.
  • Fig. 6 is a schematic diagram of a triggering manner for a first voice short message that has been sent in some embodiments of the present invention.
  • a conversation interface of a social software chat conversation is displayed on the display screen of the terminal device 102.
  • the conversation interface is shown in Figure 6a.
  • the conversation interface can display a withdrawal button 601 for withdrawing the first voice short message 302 that has been sent, and the user can use Click the withdrawal button 601 to input the second trigger operation.
  • the social software in response to the second triggering operation withdraws the first voice short message 302 that has been sent in the chat session.
  • the conversation interface of the social software can be as shown in Figure 6b, and the voice short message 602 in Figure 6b can be followed by a "retracted” mark, which means that the voice short message 602 is in the withdrawn state, and the voice short message 602 This is the first voice short message 302 after the withdrawal.
  • the social software may also display an editing interface for the short voice message 602 in the conversation interface.
  • the user can perform a second trigger operation on the first voice short message that has been sent.
  • the social software can withdraw the sent first voice short message selected by the user, thereby The user can edit the withdrawn voice short message to obtain the desired voice short message, without the user having to enter the corresponding first voice short message again, so as to reduce the user's voice entry time.
  • This implementation also provides users with a remedy for inaccurate voice short messages that have been sent.
  • this specification provides an implementation manner for performing editing operations on multiple first voice short messages.
  • multiple first voice short messages may be displayed in the editing interface, step S203: obtaining editing operations based on the editing interface, Can include:
  • step S204 in response to the editing operation, editing the first voice short message may include:
  • a second voice short message is generated; the second voice short message contains the multiple selected first voice short messages, and the multiple selected first voice short messages
  • the playback sequence of the voice short messages in the second voice short message is consistent with the sequence in which the multiple selected first voice short messages are selected.
  • the voice status of any one of the plurality of first voice short messages may be any one of input but not sent status, sent status, or withdrawn status.
  • Fig. 7 is a schematic diagram of an editing manner for multiple voice short messages in some embodiments of the present invention.
  • the terminal device 102 displays an editing interface 701 generated after the user inputs a trigger operation on the conversation interface of the chat session of social software.
  • the editing interface displays a first voice short message 302, a selection option 703 corresponding to the first voice short message 302, a first voice short message 303, and a selection option 704 corresponding to the first voice short message 303.
  • the expression content corresponding to the first voice short message 302 is "I go home to cook tonight"
  • the expression content corresponding to the first voice short message 303 is "I get off work at 5 o'clock this evening”.
  • the internal filling color of the selection option 703 in FIG. 7a is white, which may mean that the first voice short message 302 corresponding to the selection option 703 is not selected.
  • the user can perform a selection operation on the first voice short message by clicking the selection option. After the user clicks the selection option, the display state of the selection option can be changed.
  • the fill color of the selection option 705 corresponding to the selection option 703 is black and has a check mark, which can mean that the first voice short message 302 is selected by the user.
  • the user first clicks the selection option 704 and then clicks the selection option 703 to select the first voice short message 303 and then the first voice short message 302.
  • the first voice short message 303 and the first voice short message 302 are spliced together to generate a second voice short message.
  • the content expressed when the second voice short message is played can be "I will leave work at 5 pm today, and I will go home tonight Cook".
  • a second voice short message containing multiple selected first voice short messages can be generated according to multiple first voice short messages selected by the user, and the multiple selected first voice short messages are in the first voice short message.
  • the playback sequence in the short voice message is consistent with the sequence in which the multiple selected first voice short messages are selected. The user can adjust the playback sequence of the entered voice short messages according to their needs, so that the chat partner can understand the content of the received voice short messages.
  • the user can splice multiple voice short messages entered into one voice short message and send it to the chat partner, so that the chat partner does not need to click on each voice short message to know the content of the voice short message sent by the user, reducing the number of users who are the chat partner
  • the operation steps can improve the experience of the chat partner on social software.
  • the input voice short message may contain content that needs to be retained or content that needs to be deleted.
  • the editing operation is an implementation manner of selecting the text on the text editing interface, for step S204
  • the edited voice short message can be obtained in various ways.
  • One implementation manner may be: deleting the voice short message corresponding to the text selected by the selection operation, and another implementation manner may be: saving the voice short message corresponding to the text selected by the selection operation.
  • step S204 in response to the editing operation, editing the first voice short message to obtain the edited voice
  • the short message can include:
  • the start time is the pronunciation start time corresponding to the first text in the text selected by the selection operation in the first voice short message
  • the end time is the pronunciation end time corresponding to the last text in the text selected by the selection operation in the first voice short message.
  • the deleting the selected voice short message to obtain the remaining voice short message may include: parsing the first audio file corresponding to the first voice short message to obtain first file header data and first audio data From the first audio data, determine the second audio data corresponding to the selected voice short message; determine the third audio data, the third audio data is the first audio data to remove the second audio The remaining audio data after the data; the second file header data is determined according to the first file header data and the third audio data; the second audio data is generated according to the second file header data and the third audio data file.
  • the audio file is parsed to obtain file header data and audio data
  • the file header data includes the total number of bytes of the audio file and the number of bytes of audio data.
  • the first file header data includes: the total number of bytes of the first audio file is 220 bytes, and the number of bytes of the first audio data is 200 bytes. Since the number of bytes of the first file header is the difference between the total number of bytes of the first audio file and the number of bytes of the first audio data, it can be known that the number of bytes of the first file header is 20 bytes. Among them, the number of bytes of audio data has a positive linear correlation with the duration of audio data.
  • the duration of the first audio data is 6 seconds
  • the determined duration of the second audio data is 3 seconds.
  • the number of bytes of the second audio data the duration of the second audio data/the duration of the first audio data Duration * The number of bytes of the first audio data
  • the number of bytes of the second audio data can be calculated to be 100 bytes
  • the number of bytes of the third audio data is the number of bytes of the first audio data and the second audio data
  • the difference in the number of bytes of the third audio data is 100 bytes. Since usually the number of bytes of the file header of the audio file can be the same, it can be determined that the number of bytes of the second file header is 20 bytes.
  • the total number of bytes of the second audio file is the sum of the number of bytes of the second file header and the number of bytes of the third audio data, that is, the total number of bytes of the second audio file is 120 bytes.
  • the corresponding second file header data may include: the total number of bytes of the second audio file is 120 bytes and the number of bytes of the third audio data is 100 bytes; in this case, it can be based on the second file header data and The third audio data generates a second audio file.
  • the voice short message selected by the selection operation can be determined, and the selected voice short message can be deleted to obtain the remaining voice short message. It is suitable for the situation where there is less content to be deleted in the first voice short message, and the user can complete the editing operation by performing a few selection operations, which is convenient and quick.
  • the method may further include: displaying a first operation option for deleting the selected voice short message; and acquiring a trigger operation for the first operation option.
  • the deleting the selected voice short message may include: deleting the selected voice short message after the first operation option is triggered.
  • the user finds that the selection operation is performed incorrectly and selects text that does not need to be edited, he can return to the text editing interface by clicking the operation option used to cancel the selection operation to perform the selection operation again.
  • the user can control the social software to delete the voice short message corresponding to the selected text by clicking the first operation option for deleting the selected voice short message .
  • the first operation option for deleting the selected voice short message is set, and after the first operation option is triggered, the selected voice short message is deleted, so that the user can confirm
  • the first voice short message is edited accordingly to improve the accuracy of the remaining voice short messages obtained after editing the first voice short message, and reduce the probability of generating incorrect voice short messages due to user operation errors , Thereby enhancing the user experience.
  • step S204 in response to the editing operation, the first voice short message is edited to obtain the edited voice
  • the short message can include:
  • the start time is the pronunciation start time corresponding to the first text in the text selected by the selection operation in the first voice short message
  • the end time is the pronunciation end time corresponding to the last text in the text selected by the selection operation in the first voice short message.
  • the retaining the selected voice short message may include: parsing the first audio file corresponding to the first voice short message to obtain first file header data and first audio data; In the audio data, determine the second audio data corresponding to the selected voice short message; determine the second file header data according to the first file header data and the second audio data; determine the second file header data according to the second file header data and The second audio data generates a second audio file.
  • the second file header data and the third audio data generated are used to generate the second audio file.
  • the method of the second audio file is basically the same, so I will not repeat it.
  • the voice short message selected by the selection operation may be determined, and the selected voice short message may be retained as the edited voice short message. It is suitable for the case where there is less content to be retained in the first voice short message, and the user can complete the editing operation by performing a fewer number of selection operations, which is convenient and quick.
  • the retaining the selected voice short message may further include:
  • the retaining the selected voice short message may include:
  • the selected short voice message is retained.
  • the first operation option for holding the selected voice short message is set, and after the first operation option is triggered, the selected voice short message is retained for the user to confirm
  • the first voice short message is edited accordingly to improve the accuracy of the remaining voice short messages obtained after editing the first voice short message, and reduce the probability of generating incorrect voice short messages due to user operation errors , Thereby enhancing the user experience.
  • step S204 After obtaining the edited voice short message, it may also include:
  • a voice short message sending button may also be displayed on the conversation interface of the chat session of the social software, and the user can click the voice short message sending button to send the edited voice short message to the chat partner.
  • the edited voice short message can be presented in the input but not sent state.
  • the edited voice short message can be presented in the sent state.
  • the conversation interface it is convenient for the user to recognize the voice status of each voice short message.
  • some embodiments of the present invention also provide a method for processing a voice short message whose execution subject is a terminal device or a server.
  • FIG. 8 is a schematic flowchart of another method for processing voice short messages according to some embodiments of the present invention. As shown in Figure 8, the process may include:
  • Step S801 The server obtains the first short voice message uploaded by the terminal; the first short voice message is input to the terminal through the conversation interface of the chat session of the social software of the terminal.
  • Step S802 The terminal obtains the trigger operation input by the user through the conversation interface of the chat session of the social software.
  • Step S803 The terminal sends a voice recognition instruction to the server, where the voice recognition instruction is used to instruct to convert the first voice short message into corresponding text information.
  • the conversion process of the first voice short message can be executed locally by the terminal or by the server.
  • the terminal may send a voice recognition instruction to the server to instruct the server to generate a conversion result for the first voice short message.
  • Step S804 The server recognizes the first voice short message to obtain a conversion result; the conversion result includes text information corresponding to the first voice short message.
  • Step S805 The server sends the conversion result to the terminal.
  • Step S806 The terminal displays a text editing interface for the first voice short message according to the conversion result, and the text editing interface contains the text corresponding to the text information.
  • Step S807 The terminal obtains the editing instruction sent by the terminal, and the editing instruction is generated based on the editing operation input through the editing interface of the social software of the terminal.
  • Step S808 The terminal sends an editing instruction to the server, and the editing instruction includes information corresponding to the editing operation.
  • the editing process for the first voice short message can be executed locally by the terminal or by the server.
  • the terminal may send an editing instruction to the server to instruct the server to edit the first voice short message.
  • Step S809 In response to the editing instruction, the server edits the first voice short message to obtain the edited voice short message.
  • Step S810 The server sends the clipped voice short message to the terminal.
  • the terminal may obtain the processing result of editing the first voice short message fed back by the server; and obtain the edited voice short message based on the processing result.
  • the interaction process between the terminal device and the server involved in the execution of the method is given, by making the server recognize, convert and edit the first voice short message .
  • the method makes little changes to the existing application system, is easy to implement, and can improve the operating speed of the mobile terminal, thereby improving user experience.
  • the present invention also provides a voice short message processing method, and the execution subject of the method is a terminal.
  • the steps of the method may include:
  • the terminal obtains the trigger operation input by the user through the conversation interface of the chat conversation of the social software.
  • an editing interface for the first voice short message is displayed; the first voice short message is input by the user through the conversation interface.
  • the displaying the editing interface for the first voice short message may include: according to the conversion result, displaying a text editing interface for the first voice short message, the text editing interface containing the text information corresponding Text.
  • the above-mentioned execution body is a method for processing voice short messages of a mobile terminal.
  • the terminal sends to the server a voice recognition instruction for instructing the conversion of the first voice short message into corresponding text information and includes the user’s response to the first voice short message. Edit the editing instruction corresponding to the information of the editing operation, and receive the processing result of editing the first voice short message fed back by the server to obtain the edited voice short message.
  • the terminal does not need to recognize, convert and edit the first voice short message ,
  • the load pressure is small, which is conducive to improving the operating speed of the mobile terminal, thereby improving the user experience.
  • some embodiments of the present invention also provide another voice short message processing method, and the execution subject of the method is the server.
  • the method can include the following steps:
  • the server obtains the first voice short message uploaded by the terminal; the first voice short message is input to the terminal through a conversation interface of a chat session of the social software of the terminal.
  • an editing instruction sent by the terminal the editing instruction being generated based on an editing operation input through an editing interface of the social software of the terminal.
  • the first voice short message is edited to obtain the edited voice short message.
  • the server acquires a voice recognition instruction sent by the terminal, where the voice recognition instruction is used to instruct to convert the first voice short message into corresponding text information; recognize the first voice short message to obtain a conversion result; the conversion The result includes the text information corresponding to the first voice short message; the conversion result is sent to the terminal.
  • the execution body is the method for processing voice short messages of the server.
  • the server receives and responds to the voice recognition instruction sent by the terminal for instructing to convert the first voice short message into corresponding text information and contains the user’s response to the first voice short message.
  • the editing operation of the message corresponds to the editing instruction of the information, and feeds back the recognition and conversion results of the first voice short message, and the corresponding editing processing result to the mobile terminal, so that the terminal does not need to recognize, convert and convert the first voice short message. Editing can reduce the load pressure of the mobile terminal, which is conducive to improving the operating speed of the mobile terminal, thereby enhancing the user experience.
  • the device 900 includes a memory 930 for storing computer program instructions 920 and The processor 910 that executes the program instructions 920, where, when the computer program instructions are executed by the processor, the device is triggered to execute a voice short message processing method as provided in the foregoing embodiment.
  • the device when the computer program instructions are executed by the processor, the device is triggered to perform the following steps:
  • the first voice short message is edited to obtain the edited voice short message.
  • the device when the computer program instructions are executed by the processor, the device is triggered to perform the following steps:
  • the terminal obtains the trigger operation input by the user through the conversation interface of the chat conversation of social software
  • the device when the computer program instructions are executed by the processor, the device is triggered to perform the following steps:
  • the server obtains the first voice short message uploaded by the terminal; the first voice short message is input to the terminal through the conversation interface of the chat session of the social software of the terminal;
  • some embodiments of the present invention also provide a computer-readable medium corresponding to the above method, on which computer-readable instructions are stored, and the computer-readable instructions can be executed by a processor to implement the above-mentioned implementation
  • the example provides a method for processing voice short messages.
  • the storage medium such as ROM/RAM, magnetic disk, optical disk, etc.
  • the first voice short message is edited to obtain the edited voice short message.
  • the device when the computer program instructions are executed by the processor, the device is triggered to perform the following steps:
  • the terminal obtains the trigger operation input by the user through the conversation interface of the chat conversation of social software
  • the device when the computer program instructions are executed by the processor, the device is triggered to perform the following steps:
  • the server obtains the first voice short message uploaded by the terminal; the first voice short message is input to the terminal through the conversation interface of the chat session of the social software of the terminal;
  • a typical implementation device is a computer.
  • the computer may be, for example, a personal computer, a laptop computer, a cell phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or Any combination of these devices.
  • the embodiments of the present invention may be provided as methods, systems, or computer program products. Therefore, the present invention may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, the present invention may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
  • a computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device.
  • the device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
  • These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment.
  • the instructions provide steps for implementing functions specified in a flow or multiple flows in the flowchart and/or a block or multiple blocks in the block diagram.
  • the computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
  • processors CPUs
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • the memory may include non-permanent memory in a computer-readable medium, random access memory (RAM) and/or non-volatile memory, such as read only memory (ROM) or flash memory (flashRAM).
  • RAM random access memory
  • ROM read only memory
  • flashRAM flash memory
  • Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
  • the information can be computer-readable instructions, data structures, program modules, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
  • the invention can be described in the general context of computer-executable instructions executed by a computer, such as a program module.
  • program modules include routines, programs, objects, components, data structures, etc. that perform specific tasks or implement specific abstract data types.
  • the present invention can also be practiced in distributed computing environments in which tasks are performed by remote processing devices connected through a communication network.
  • program modules can be located in local and remote computer storage media including storage devices.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephonic Communication Services (AREA)

Abstract

L'invention concerne un procédé et un dispositif de traitement de messages vocaux courts, et un support. La solution comporte les étapes consistant à: acquérir une opération de déclenchement introduite, au moyen d'une interface de session d'une session de conversation en ligne de logiciel social, par un utilisateur; en réponse à l'opération de déclenchement, afficher une interface de révision pour un premier message vocal court; acquérir une opération de révision sur la base de l'interface de révision; et en réponse à l'opération de révision, réviser le premier message vocal court pour obtenir un message vocal court révisé. Au moyen du procédé, du dispositif ou du support selon la présente invention, une opération de révision sur un message vocal court peut être réalisée par un utilisateur, de telle façon que l'utilisateur obtienne le message vocal court requis, et il n'est pas nécessaire que l'utilisateur réintroduise le message vocal court, ce qui améliore le rendement de communication sur la base d'un message vocal court.
PCT/CN2020/086508 2019-04-30 2020-04-23 Procédé et dispositif de traitement de messages vocaux courts, et support WO2020221105A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910364866.3A CN110061910B (zh) 2019-04-30 2019-04-30 一种语音短消息的处理方法、设备及介质
CN201910364866.3 2019-04-30

Publications (1)

Publication Number Publication Date
WO2020221105A1 true WO2020221105A1 (fr) 2020-11-05

Family

ID=67322089

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/086508 WO2020221105A1 (fr) 2019-04-30 2020-04-23 Procédé et dispositif de traitement de messages vocaux courts, et support

Country Status (2)

Country Link
CN (1) CN110061910B (fr)
WO (1) WO2020221105A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112565063A (zh) * 2020-12-11 2021-03-26 维沃移动通信有限公司 消息处理方法、装置和电子设备
CN115237291A (zh) * 2022-07-29 2022-10-25 北京字跳网络技术有限公司 一种信息处理方法、装置、设备及介质
EP4220368A4 (fr) * 2021-05-13 2024-03-13 Tencent Technology (Shenzhen) Company Limited Procédé et appareil de traitement de données multimédias, dispositif, support de stockage lisible par ordinateur, et produit-programme d'ordinateur

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110061910B (zh) * 2019-04-30 2021-11-30 上海掌门科技有限公司 一种语音短消息的处理方法、设备及介质
CN110943908A (zh) * 2019-11-05 2020-03-31 上海盛付通电子支付服务有限公司 语音消息发送方法、电子设备及介质
CN111369994B (zh) * 2020-03-16 2023-08-29 维沃移动通信有限公司 语音处理方法及电子设备
CN114422468A (zh) * 2020-10-12 2022-04-29 腾讯科技(深圳)有限公司 消息处理方法、装置、终端及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1961350A (zh) * 2004-05-27 2007-05-09 皇家飞利浦电子股份有限公司 用于修改消息的方法和系统
CN101072207A (zh) * 2007-06-22 2007-11-14 腾讯科技(深圳)有限公司 即时通讯工具中的交流方法及即时通讯工具
US20100125795A1 (en) * 2008-07-03 2010-05-20 Mspot, Inc. Method and apparatus for concatenating audio/video clips
CN104375997A (zh) * 2013-08-13 2015-02-25 腾讯科技(深圳)有限公司 一种为即时通讯音频信息添加备注信息的方法和装置
CN107066115A (zh) * 2017-03-17 2017-08-18 深圳市金立通信设备有限公司 一种补充语音消息的方法及终端
CN110061910A (zh) * 2019-04-30 2019-07-26 上海掌门科技有限公司 一种语音短消息的处理方法、设备及介质

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7185285B2 (en) * 2003-02-19 2007-02-27 Microsoft Corporation User interface and content enhancements for real-time communication
CN1941747A (zh) * 2005-09-27 2007-04-04 腾讯科技(深圳)有限公司 一种即时通信方法及系统
KR102020335B1 (ko) * 2012-08-27 2019-09-10 삼성전자 주식회사 메시지 운용 방법 및 이를 지원하는 단말기
CN105141496B (zh) * 2014-05-29 2019-01-11 腾讯科技(深圳)有限公司 一种即时通信消息播放方法及装置
CN105049317A (zh) * 2015-05-21 2015-11-11 腾讯科技(深圳)有限公司 消息转发方法及装置
CN105939250A (zh) * 2016-05-25 2016-09-14 珠海市魅族科技有限公司 音频处理方法和装置
CN106027785A (zh) * 2016-05-26 2016-10-12 深圳市金立通信设备有限公司 一种语音处理方法及终端
CN106357509B (zh) * 2016-08-31 2019-11-05 维沃移动通信有限公司 一种对已接收消息进行查看的方法及移动终端
CN106921560B (zh) * 2017-02-28 2020-06-02 北京小米移动软件有限公司 语音通信方法、装置及系统
CN107248948A (zh) * 2017-05-27 2017-10-13 佛山语奥科技有限公司 发送消息处理方法和系统
CN109859776B (zh) * 2017-11-30 2021-07-13 阿里巴巴集团控股有限公司 一种语音编辑方法以及装置
CN108632465A (zh) * 2018-04-27 2018-10-09 维沃移动通信有限公司 一种语音输入的方法及移动终端

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1961350A (zh) * 2004-05-27 2007-05-09 皇家飞利浦电子股份有限公司 用于修改消息的方法和系统
CN101072207A (zh) * 2007-06-22 2007-11-14 腾讯科技(深圳)有限公司 即时通讯工具中的交流方法及即时通讯工具
US20100125795A1 (en) * 2008-07-03 2010-05-20 Mspot, Inc. Method and apparatus for concatenating audio/video clips
CN104375997A (zh) * 2013-08-13 2015-02-25 腾讯科技(深圳)有限公司 一种为即时通讯音频信息添加备注信息的方法和装置
CN107066115A (zh) * 2017-03-17 2017-08-18 深圳市金立通信设备有限公司 一种补充语音消息的方法及终端
CN110061910A (zh) * 2019-04-30 2019-07-26 上海掌门科技有限公司 一种语音短消息的处理方法、设备及介质

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112565063A (zh) * 2020-12-11 2021-03-26 维沃移动通信有限公司 消息处理方法、装置和电子设备
CN112565063B (zh) * 2020-12-11 2023-04-07 维沃移动通信有限公司 消息处理方法、装置和电子设备
EP4220368A4 (fr) * 2021-05-13 2024-03-13 Tencent Technology (Shenzhen) Company Limited Procédé et appareil de traitement de données multimédias, dispositif, support de stockage lisible par ordinateur, et produit-programme d'ordinateur
CN115237291A (zh) * 2022-07-29 2022-10-25 北京字跳网络技术有限公司 一种信息处理方法、装置、设备及介质

Also Published As

Publication number Publication date
CN110061910B (zh) 2021-11-30
CN110061910A (zh) 2019-07-26

Similar Documents

Publication Publication Date Title
WO2020221105A1 (fr) Procédé et dispositif de traitement de messages vocaux courts, et support
US11223584B2 (en) Automatic action responses
CN109313667B (zh) 构建特定于状态的多轮上下文语言理解系统的系统和方法
US11050685B2 (en) Method for determining candidate input, input prompting method and electronic device
US10733384B2 (en) Emotion detection and expression integration in dialog systems
US20210398028A1 (en) Automatic reservation of a conference
US10212103B2 (en) Smart automatic composition of short messaging responses
CN106601254B (zh) 信息输入方法和装置及计算设备
US20130159920A1 (en) Scenario-adaptive input method editor
US20180316637A1 (en) Conversation lens for context
JP2015528968A (ja) コンテキストを用いた文字列予測の生成
US20120148034A1 (en) Interruptible, contextually linked messaging system with audible contribution indicators
US10929606B2 (en) Method for follow-up expression for intelligent assistance
CN113285868B (zh) 任务生成方法、设备以及计算机可读介质
JP7331044B2 (ja) 情報処理方法、装置、システム、電子機器、記憶媒体およびコンピュータプログラム
US11706172B2 (en) Method and device for sending information
CN111817945B (zh) 一种在即时通信应用中回复通信信息的方法与设备
US20190087391A1 (en) Human-machine interface for collaborative summarization of group conversations
CN111726685A (zh) 视频处理方法、装置、电子设备和介质
CN109598001A (zh) 一种信息显示方法、装置及设备
JP6986590B2 (ja) 音声スキル作成方法、音声スキル作成装置、電子機器及び記憶媒体
CN114422468A (zh) 消息处理方法、装置、终端及存储介质
CN107357481B (zh) 消息展示方法及消息展示装置
TWI714006B (zh) Android系統Activity的啟動方法和裝置
JP2018524676A (ja) メッセンジャー基盤サービス提供装置、及びそれを利用した方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20798993

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 04/02/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 20798993

Country of ref document: EP

Kind code of ref document: A1