WO2019179014A1 - 语音消息搜索显示方法、装置、计算机设备及存储介质 - Google Patents

语音消息搜索显示方法、装置、计算机设备及存储介质 Download PDF

Info

Publication number
WO2019179014A1
WO2019179014A1 PCT/CN2018/101071 CN2018101071W WO2019179014A1 WO 2019179014 A1 WO2019179014 A1 WO 2019179014A1 CN 2018101071 W CN2018101071 W CN 2018101071W WO 2019179014 A1 WO2019179014 A1 WO 2019179014A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice message
message
voice
preset
keyword
Prior art date
Application number
PCT/CN2018/101071
Other languages
English (en)
French (fr)
Inventor
张雨嘉
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019179014A1 publication Critical patent/WO2019179014A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/638Presentation of query results

Definitions

  • the present application relates to the field of data processing technologies, and in particular, to a voice message search and display method, apparatus, computer device, and storage medium.
  • the embodiment of the present application provides a voice message search and display method, device, computer device, and storage medium, which can search for a voice message and display the search result according to a preset format.
  • an embodiment of the present application provides a voice message search and display method, where the method includes:
  • the message search instruction includes a keyword; searching for a text message matching the keyword in the preset file, wherein the data saved in the preset file includes a text message corresponding to the voice message;
  • the voice message search result corresponding to the text message matched by the keyword is displayed in a preset format.
  • an embodiment of the present application provides a voice message search and display device, where the device includes a unit for performing the voice message search and display method according to the above first aspect.
  • an embodiment of the present application provides a computer device, where the computer device includes a memory, and a processor connected to the memory;
  • the memory is for storing a computer program for executing a computer program stored in the memory to perform the voice message search display method of the first aspect described above.
  • an embodiment of the present application provides a computer readable storage medium, where the computer readable storage medium stores a computer program, where the computer program includes program instructions, and when the program instructions are executed by a processor, implementing the foregoing The voice message search display method of the first aspect.
  • the embodiment of the present application can search for a voice message, obtain a voice message that matches the search keyword, and display the search result of the searched voice message according to a preset format, so that the user can conveniently view the voice message matched with the search keyword, and improve The efficiency of querying voice messages improves the user experience.
  • FIG. 1 is a schematic flowchart of a voice message search and display method according to an embodiment of the present application
  • FIG. 2 is a schematic flowchart of a voice message search display according to another embodiment of the present application.
  • FIG. 3 is a schematic diagram of a sub-flow of a voice message search and display method according to an embodiment of the present application
  • FIG. 4 is a diagram showing an example of displaying a voice message search result matching a keyword according to an embodiment of the present application
  • FIG. 5 is a schematic diagram of another sub-flow of a voice message search and display method according to an embodiment of the present application.
  • FIG. 6 is a schematic flowchart of a voice message search and display method according to another embodiment of the present application.
  • FIG. 7 is a schematic block diagram of a voice message search and display device provided by an embodiment of the present application.
  • FIG. 8 is a schematic block diagram of a voice message search and display device according to another embodiment of the present application.
  • FIG. 9 is a schematic block diagram of a display unit provided by an embodiment of the present application.
  • FIG. 10 is a schematic block diagram of a display unit according to another embodiment of the present application.
  • FIG. 11 is a schematic block diagram of a voice message search and display apparatus according to another embodiment of the present application.
  • FIG. 12 is a schematic block diagram of a computer device according to an embodiment of the present application.
  • first, second, etc. may be used herein to describe various elements, these elements should not be limited to these terms. These terms are only used to distinguish these elements from each other.
  • first acquisition unit may be referred to as a second acquisition unit without departing from the scope of the present application, and similarly, the second acquisition unit may be referred to as a first acquisition unit.
  • the first acquisition unit and the second acquisition unit are both acquisition units, but they are not the same acquisition unit.
  • FIG. 1 is a schematic flowchart diagram of a voice message search and display method according to an embodiment of the present application. The method includes the following steps S101-S103.
  • the search keyword may be input in a search query item on an instant communication tool such as a WeChat homepage, click a search button or detect completion of the input, generate a message search instruction; or open a specific communication object such as a chat object, in a specific In the corresponding interface of the communication object, find a related button such as "find chat record", click the button, input a search keyword, click a search button or detect the completion of the input, that is, generate a message search instruction, wherein the communication object can be a single contact People can also be groups. Among them, the way the keyword is input, including the text form, and the voice form. The keyword input in the form of voice needs to convert the voice into a corresponding keyword in the form of text according to the voice recognition.
  • the search instruction may further include target time segment information selected in two time periods, that is, the search instruction may further include time information; in some embodiments, the search instruction may further include at least The target contact information selected in the interface of the two contacts, that is, the search instruction may also include the target contact information.
  • the device obtains the user's communication record such as chat history. If the message in the chat record is a voice message, the device converts the voice message into a corresponding text message according to the voice recognition algorithm, and saves it in the preset file for use in searching.
  • the preset file includes a text message corresponding to the voice message. Searching for a text message matching the keyword in the preset file, such as the keyword "Zoo", searching in the text message saved in the preset file, and if the search includes a text message related to "Zoo", then it is considered
  • the text message is a text message that matches the keyword.
  • the search includes various ways of searching, such as fuzzy search, precise search and the like.
  • the preset format is related to the duration of the voice message, or the length of the display line of the device screen, the duration of the voice message corresponding to the voice message, and the number of words of the text message corresponding to the voice message.
  • the voice message in the instant communication tool is searched, the voice message matching the search keyword is obtained, and the search result of the searched voice message is displayed according to a preset format, which is convenient for the user to view the voice matching the search keyword.
  • the message improves the efficiency of querying voice messages and improves the user experience.
  • FIG. 2 is a schematic flowchart diagram of a voice message search and display method according to another embodiment of the present disclosure.
  • the method includes steps S201-S207.
  • steps S201-S204 before receiving the message search command.
  • steps S201-S204 will be described in detail below.
  • the steps S205-S207 correspond to the steps in the embodiment shown in FIG. 1. Referring to the description in the embodiment shown in FIG. 1, details are not described herein.
  • the preset voice message duration can be preset, such as 1 minute. It can also be set according to the user's habits, that is, to receive the user's settings. After the preset voice message duration is set, the modification may be performed. For example, the preset voice message duration modified by the user may be received, or another suitable duration set by the server may be received as the new preset voice message duration according to the feedback of the user. It can be understood that if the duration of the preset voice message is exceeded, it means that the user is not willing to read such a long voice at a time. If the user reads a piece of voice message, it is not very clear about a certain segment of the voice message. The user only wants to repeat the corresponding voice of the segment, and does not want to start from the beginning every time the voice is heard. In this case, if you start from scratch every time, it will affect the user's experience.
  • the long voice message refers to a voice message that exceeds the duration of the preset voice message. Segmenting the long voice message to form a plurality of voice messages, including: segmenting the long voice message according to time to form a plurality of voice messages, or detecting a speaking pause position in the long voice message, stopping the position according to time and speaking The long voice message is segmented to form a plurality of voice messages. The long voice message is segmented according to the time to form a plurality of voice messages. It can be understood that the long voice message is segmented at intervals, and the time is shorter than the preset voice message duration, such as every 30 seconds. The voice message is divided into multiple segments.
  • the segmentation point in combination with the time and the pause position of the speech. For example, when the recording time exceeds 30 s, if the speech pause position is detected at the same time, the speech pause position is used as the segmentation point, and the next segmentation point is the previous segment. The interval between segment points is above 30s. If the time interval between the last segment point and the end of the voice message is less than 30 s, the voice after the last segment point is also used as a segmented speech. Wherein, the speech pause position can be detected according to the sound wave change corresponding to the voice message.
  • the sound wave of the segment having a relatively low average amplitude can be taken.
  • the time corresponding to the intermediate position is used as the speaking pause position, and the long voice message is divided into a plurality of segments according to the time and the speaking pause position. It should be noted that in this embodiment, the 30s time is only an example, and in other embodiments, the time may be set to other values.
  • the preset voice message duration is exceeded, and multiple voice messages formed by the segmentation are converted into corresponding text messages, and in another case, if the preset voice is not exceeded
  • the message duration is converted into a corresponding text message.
  • the voice message can be converted to a corresponding text message using a speech recognition algorithm on the device.
  • the following operations are further performed: determining whether the interval between the current time and the time of receiving the voice message reaches the withdrawal message duration; if the withdrawal message duration is reached, The voice message is converted into a corresponding text message by voice recognition.
  • the withdrawal message length refers to the time set in the instant messaging tool to withdraw the message, such as 2s. It can be understood that if the interval between the current time and the time of receiving the voice message reaches the withdrawal message duration, the message will not be withdrawn by the user, and the voice message is first converted into a corresponding text message, and then the voice message is withdrawn. Case. This can reduce the amount of processing that the device converts the recalled voice message into a corresponding text message. It should be noted that if the withdrawal message duration is not reached, the voice message needs to be segmented and displayed. Just do not convert the voice message into a corresponding text message.
  • the long voice message is segmented to form multiple voice messages, and then the voice message is converted into a corresponding text message by voice recognition and saved. .
  • the long voice message exceeding the preset voice message duration is divided into multiple segments, and the user can segment or search for the content of the corresponding voice message.
  • the voice message can be played only without segmentation. Played from the beginning every time, improving the user experience.
  • the method further includes: detecting whether there are multiple text messages matching the keyword;
  • the keyword matching text message has multiple pieces, and the voice message search results corresponding to the plurality of text messages are sorted according to a preset rule.
  • the voice message search result corresponding to the text message matched with the keyword is displayed in a preset format, including: displaying the sorted voice message search result corresponding to the text message matched by the keyword according to a preset format.
  • the preset rule includes: following the time sequence of the voice message received, and/or sorting according to the matching degree of the text message corresponding to the voice message and the keyword, or according to the forgetting curve of the person according to the time of the different voice message time Sort the possibilities and so on. Specifically, the time of receiving the voice message is sorted from late to early, that is, the received voice message is sorted in the front; and the matching degree of the text message corresponding to the voice message and the keyword is sorted from high to low. That is, the sort with high matching degree is in front; the sorting possibility is sorted from high to low according to the possibility of forgetting, and the sorting with high possibility of forgetting is in front.
  • the voice message search result corresponding to the text message matched with the keyword is displayed in a preset format, that is, step S103, including the following steps S301-S304.
  • the preset format is related to the duration of the voice message.
  • the preset duration can be 30s.
  • the first preset format includes: a voice message, and a text content of a preset number of words corresponding to the keyword before and after the keyword in the voice message.
  • the first preset format may further include: sender information corresponding to the voice message, and time of sending the voice message.
  • the keyword may be highlighted, such as distinguishing color or bold, etc., the sender information includes a sender nickname and/or a sender avatar, etc., the voice information includes a voice and/or a voice message duration, etc.; the preset word number includes a keyword
  • the number of words, the preset number can be set to 16 words, or can be set to other words.
  • the text message corresponding to the voice message exceeds the preset number of words, other texts other than the preset number of words may be replaced by an ellipsis. If the keyword is: eat, the default number of words is 16, then the text message can be displayed as: ... where you eat, send a position to .... If the number of words of the text message corresponding to the voice message is less than the preset number of times, in addition to displaying the text message, other than the preset number of words may be replaced by an ellipsis. It can be understood that there are a lot of noises in the voice message, and there are not many voice messages.
  • the second preset format includes: voice information, and text content corresponding to the voice message.
  • the second preset format may further include: sender information corresponding to the voice message, and time of sending the voice message.
  • the keyword information is highlighted, such as distinguishing colors or bolding, and the sender information includes a sender nickname and/or a sender avatar, and the voice information includes a voice and/or a voice message duration.
  • the specific display effect can be seen in Figure 4.
  • FIG. 4 is a diagram showing an example of a voice message search result display matching a keyword.
  • a voice message search result matching the keyword is displayed on the screen 11 of the terminal 10.
  • the keyword 110 is "Zoo"
  • the sender information includes a sender image 120 and a sender nickname 130.
  • the voice message includes voice 160 and voice message duration 150.
  • the text content 140 corresponding to the voice message, wherein the keyword "Zoo” can be seen as a bold display.
  • the time 170 of the voice message is displayed as: 2018-01-01. In other embodiments, the time of sending the voice message may also be specific to minutes and the like.
  • the voice message search result corresponding to the text message matched with the keyword is displayed in a preset format, that is, step S103 includes the following steps S501-S504.
  • the preset format is related to the length of the display line of the search result interface, the duration of the voice message, and the number of words of the text message corresponding to the voice message. It can be understood that, in the search result interface display line, a voice message search result corresponding to the text message matching the keyword is displayed, that is, a search result of the voice message is displayed in one line of the search result interface, so as to facilitate the user to view.
  • S503. Determine whether the length of the voice message display and the length of the word display exceed the length of the display line of the search result interface.
  • the third preset format includes: a voice message, and a text content of a preset number of words corresponding to the keyword before and after the keyword in the voice message.
  • the third preset format may further include: sender information corresponding to the voice message, and a time when the voice message is sent.
  • the keyword is highlighted, such as distinguishing color or bold, etc.
  • the sender information includes the sender nickname and/or the sender avatar, etc.
  • the voice information includes the duration of the voice and voice message, etc.
  • the preset number of words includes the number of words of the keyword, The preset number of words is set according to the length of the display line of the search result interface and the length of the voice message.
  • the preset number of words (the length of the search result interface display line - the length of the voice message display) / the length of each word display - n.
  • n is the reserved length
  • the reserved length includes the white space of the border, and the reserved length can also be used to indicate the ellipsis. If you can replace the excess with an ellipsis, such as the keyword: eat, the default number of words is 16, then the text message can be displayed as: ... where you eat, send a position to ....
  • the fourth preset format includes: voice information, and text content corresponding to the voice message.
  • the fourth preset format may further include: sender information corresponding to the voice message, and a time when the voice message is sent.
  • the keyword information is highlighted, such as distinguishing color or bold, and the sender information includes a sender nickname and/or a sender avatar, and the voice information includes a voice and a voice message duration.
  • the data saved in the preset file includes a text message corresponding to the voice message.
  • the data saved in the preset file includes a plain text message in addition to the text message corresponding to the voice message.
  • a plain text message can be understood as a text message that is initially entered as a text input. That is, the data saved in the preset file includes a text message corresponding to the voice message and a plain text message.
  • FIG. 6 is a schematic flowchart diagram of a voice message search and display method according to another embodiment of the present application.
  • the method includes steps S601-S609.
  • This embodiment differs from the embodiment shown in FIG. 2 in that steps S607-608 and steps S606, S609 are added.
  • steps S607-608 and steps S606, S609 are added.
  • steps S607-608 and steps S606, S609 are added.
  • the differences between the embodiment and the embodiment shown in FIG. 2 will be described in detail below. For other steps, please refer to the description of the embodiment shown in FIG. 2, and details are not described herein again.
  • the device obtains the user's communication record such as chat history. If the message in the chat record is a voice message, the device converts the voice message into a corresponding text message according to the voice recognition algorithm, and saves it in the preset file; if the message in the chat record is a plain text message, then the file will be pure Text messages are saved in a preset file. Therefore, the data saved in the preset file includes a text message corresponding to the voice message and a plain text message.
  • the voice message search result corresponding to the plurality of text messages and the plain text message search result are sorted according to a preset rule.
  • the preset rule includes: following the time sequence of the voice message and the plain text message, and/or sorting according to the text message corresponding to the voice message and the matching degree of the plain text message and the keyword, or according to the human forgetting curve. The order of the forgetting possibility corresponding to the transmission time of different voice messages and plain text messages is sorted.
  • the time of receiving according to the voice message and the plain text message is sorted from late to early, that is, the received voice message and the plain text message are sorted in front; according to the text message corresponding to the voice message and the plain text message and The matching degree of the keywords is sorted from high to low, that is, the sorting with high matching degree is in front; the sorting according to the possibility of forgetting is sorted from high to low, and the sorting with high possibility of forgetting is in front.
  • the voice message search result corresponding to the sorted text message matching the keyword is displayed according to a preset format, and the plain text message search result matched with the keyword is displayed according to another preset format.
  • the voice message search result corresponding to the text message matched with the keyword is displayed in a preset format, and the content described in the embodiment shown in FIG. 3 and FIG. 5 may be referred to, and details are not described herein again.
  • the plain text message search result matched with the keyword is displayed according to another preset format, wherein the other preset format includes: sender information corresponding to the plain text information, plain text information, and time when the plain text message is sent.
  • the keyword in the plain text message is highlighted, such as distinguishing color or bold, the sender information includes the sender nickname and/or the sender avatar, and the plain text information includes the content of the plain text information.
  • the data saved in the preset file includes a text message corresponding to the voice message and a plain text message, and after searching for a text message matching the keyword in the preset file, determining a text message matching the keyword Whether there are multiple, if there are multiple, the searched voice message search results and the plain text message search results are sorted, and displayed in the sorted order.
  • the voice message search result corresponding to the text message matched with the keyword is displayed according to a preset format
  • the plain text message search result matched with the keyword is displayed according to another preset format. Further improving the efficiency of message search and improving the user experience.
  • FIG. 7 is a schematic block diagram of a voice message search and display device according to an embodiment of the present application. As shown in FIG. 7, the device 70 includes a receiving unit 701, a searching unit 702, and a display unit 703.
  • the receiving unit 701 is configured to receive a message search instruction, where the message search instruction includes a keyword.
  • the searching unit 702 is configured to search, in the preset file, a text message that matches the keyword, where the data saved in the preset file includes a text message corresponding to the voice message.
  • the display unit 703 is configured to display the voice message search result corresponding to the text message matched by the keyword according to a preset format.
  • the preset format is related to the duration of the voice message, or the length of the display line of the device screen, the duration of the voice message corresponding to the voice message, and the number of words of the text message corresponding to the voice message.
  • FIG. 8 is a schematic block diagram of a voice message search and display device according to another embodiment of the present application.
  • the apparatus 80 includes a judging unit 801, a segmentation unit 802, an identification unit 803, a saving unit 804, a receiving unit 805, a searching unit 806, and a display unit 807.
  • the difference between this embodiment and the embodiment shown in FIG. 7 is that the judging unit 801, the segmentation unit 802, the recognizing unit 803, and the saving unit 804 are added.
  • the receiving unit 805, the searching unit 806, and the display unit 807 are the same as those described in the embodiment shown in FIG. 7. For details, refer to the corresponding description in FIG. 7, and details are not described herein again.
  • the judging unit 801, the segmentation unit 802, the recognition unit 803, and the saving unit 804 will be described in detail below.
  • the determining unit 801 is configured to determine, after receiving the voice message, whether the duration of the voice message exceeds a preset voice message duration.
  • the segmentation unit 802 is configured to segment the long voice message to form a plurality of voice messages if the preset voice message duration is exceeded.
  • the long voice message refers to a voice message that exceeds the duration of the preset voice message.
  • the segmentation unit 802 is configured to segment the long voice message according to time to form a plurality of voice messages.
  • the segmentation unit includes a position detection unit, a voice segmentation unit.
  • the location detecting unit is configured to detect a speaking pause position in the long voice message.
  • a voice segmentation unit configured to segment the long voice message according to a time and a talk pause position to form a plurality of voice messages.
  • the identifying unit 803 is configured to convert the voice message into a corresponding text message if the preset voice message duration is exceeded, and convert the plurality of voice messages formed by the segment into corresponding text messages if the preset voice message duration is exceeded .
  • the determining unit 801 is further configured to determine whether the interval between the current time and the received voice message time reaches the withdrawal message duration.
  • the identifying unit 803 is configured to convert the voice message into a corresponding text message by voice recognition if the withdrawal message duration is reached.
  • the withdrawal message length refers to the time set in the instant messaging tool to withdraw the message, such as 2s.
  • the saving unit 804 is configured to save the voice message and the text message corresponding to the voice message in the preset file.
  • a voice message search and display device further includes a first detecting unit, a first sorting unit, and specifically, a first detecting unit, configured to detect whether there are multiple text messages matching the keyword.
  • the first sorting unit is configured to sort the voice messages corresponding to the plurality of text messages according to a preset rule if there are multiple text messages matching the keyword.
  • a display unit configured to display the sorted voice message search result corresponding to the text message matched by the keyword according to a preset format.
  • the voice message search result corresponding to the text message matched with the keyword is displayed according to a preset format, wherein the preset format is related to the duration of the voice message.
  • the display unit 807 includes a first acquiring unit 901 , a first determining unit 902 , and a first display unit 903 .
  • the first obtaining unit 901 is configured to acquire a voice message duration for the voice message corresponding to the text message that matches the keyword.
  • the first determining unit 902 is configured to determine whether the duration of the voice message exceeds a preset duration.
  • the preset duration can be 30s.
  • the first display unit 903 is configured to display a corresponding voice message search result according to the first preset format if the preset duration is exceeded.
  • the first display unit 903 is further configured to display a corresponding voice message search result according to the second preset format if the preset duration is not exceeded.
  • the voice message search result corresponding to the text message matched with the keyword is displayed according to a preset format, wherein the preset format and the search result interface display the length of the line, the duration of the voice message, and the text corresponding to the voice message.
  • the number of words in the message is related.
  • a voice message search result corresponding to the text message matching the keyword is displayed, that is, a search result of the voice message is displayed in one line of the search result interface, so as to facilitate the user to view.
  • the display unit 807 further includes a second acquisition unit 101, a calculation unit 102, a second determination unit 103, and a second display unit 104.
  • the second obtaining unit 101 is configured to acquire a voice message duration for the voice message corresponding to the text message matched by the keyword, and a word number of the text message corresponding to the voice message.
  • the calculating unit 102 is configured to calculate a length of the voice message display according to the duration of the voice message.
  • the second determining unit 103 is configured to determine whether the length of the voice message display and the length of the word display exceed the length of the display line of the search result interface.
  • the second display unit 104 is configured to display the corresponding voice message search result in the same row according to the third preset format if the length of the display line of the search result interface is exceeded.
  • the second display unit 104 is further configured to display the corresponding voice message search result according to the fourth preset format if the length of the display line of the search result interface is not exceeded.
  • FIG. 11 is a schematic block diagram of a voice message search and display device according to another embodiment.
  • the apparatus 110 includes a determining unit 111, a segmenting unit 112, an identifying unit 113, a saving unit 114, a receiving unit 115, a searching unit 116, a second detecting unit 117, a second sorting unit 118, and a display unit 119.
  • the difference between this embodiment and the embodiment shown in FIG. 8 is that the difference between the second detecting unit 117, the second sorting unit 118, and the search unit 116 and the display unit 119 is increased.
  • the unit is the same as that described in the embodiment shown in FIG. 8. For details, refer to the corresponding description in FIG. 8, and details are not described herein again.
  • the search unit 116, the second detecting unit 117, the second sorting unit 118, and the display unit 119 will be described in detail below.
  • the search unit 116 is configured to search for a text message that matches the keyword in the preset file, where the data saved in the preset file includes a text message corresponding to the voice message and a plain text message.
  • the second detecting unit 117 is configured to detect whether there are multiple text messages matching the keyword.
  • the second sorting unit 118 is configured to sort the voice message and the plain text message corresponding to the plurality of text messages according to a preset rule if there are multiple pieces.
  • the display unit 119 is configured to display the sorted voice message search result corresponding to the text message matched by the keyword according to a preset format, and the plain text message search result matched with the keyword is displayed according to another preset format. .
  • the above apparatus may be embodied in the form of a computer program that can be run on a computer device as shown in FIG.
  • FIG. 12 is a schematic block diagram of a computer device according to an embodiment of the present application.
  • the device 120 can also be in the form of a client.
  • the device 120 includes a processor 122, a memory, and a network interface 123 connected by a system bus 121, wherein the memory can include a non-volatile storage medium 124 and an internal memory 125.
  • the non-volatile storage medium 124 can store an operating system 1241 and a computer program 1242. When the computer program 1242 is executed, the processor 122 can be caused to perform a voice message search display method.
  • the processor 122 is configured to provide computing and control capabilities to support operation of the entire device 120.
  • the internal memory 125 provides an environment for the operation of a computer program in a non-volatile storage medium that, when executed by the processor 122, causes the processor 122 to perform a voice message search display method.
  • the network interface 123 is used for network communication, such as receiving instructions and the like. It will be understood by those skilled in the art that the structure shown in FIG.
  • the specific device 120 may be It includes more or fewer components than those shown in the figures, or some components are combined, or have different component arrangements.
  • the processor 122 is configured to execute a computer program stored in the memory to implement any of the foregoing voice message search and display methods.
  • the processor 122 may be a central processing unit (CPU), and the processor may also be another general-purpose processor, a digital signal processor (DSP). , Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, etc.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • a computer readable storage medium is stored, the computer readable storage medium storing a computer program, the computer program comprising program instructions, when executed by a processor, To implement any of the foregoing embodiments of the voice message search display method.
  • the computer readable storage medium may be an internal storage unit of the terminal described in any of the foregoing embodiments, such as a hard disk or a memory of the terminal.
  • the computer readable storage medium may also be an external storage device of the terminal, such as a plug-in hard disk equipped on the terminal, a smart memory card (SMC), and a Secure Digital (SD) card. Wait.
  • the computer readable storage medium may also include both an internal storage unit of the terminal and an external storage device.
  • the disclosed terminal and method may be implemented in other manners.
  • the terminal embodiment described above is only illustrative.
  • the division of the unit is only a logical function division, and the actual implementation may have another division manner.
  • a person skilled in the art can clearly understand that, for the convenience and brevity of the description, the specific working process of the terminal and the unit described above can be referred to the corresponding process in the foregoing method embodiment, and details are not described herein again.
  • the foregoing is only a specific embodiment of the present application, but the scope of protection of the present application is not limited thereto, and any equivalents can be easily conceived by those skilled in the art within the technical scope disclosed in the present application. Modifications or substitutions are intended to be included within the scope of the present application. Therefore, the scope of protection of this application should be determined by the scope of protection of the claims.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

一种语音消息搜索显示方法、装置、设备及计算机可读存储介质。所述方法包括:接收消息搜索指令,其中,所述消息搜索指令中包括关键词(S101);在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息(S102);将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示(S103)。

Description

语音消息搜索显示方法、装置、计算机设备及存储介质
本申请要求于2018年3月22日提交中国专利局、申请号为201810240238.X、发明名称为“语音消息搜索显示方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及数据处理技术领域,尤其涉及一种语音消息搜索显示方法、装置、计算机设备及存储介质。
背景技术
即时通信工具如微信、QQ等,已经成为人们工作生活中必不可少的交流工具。我们在使用此类通信工具时,通过视觉和听觉感知的聊天内容在脑海里会留下印象,然而随着时间的流逝,内容不是很清晰。为了了解之前的聊天内容,我们经常会用搜索功能,从而定位到当时的聊天记录。对于聊天方式,一般会用文字消息和语音消息两种。与文字消息相比,语音消息方便快捷且能拉近距离。表现为:相同长度的句子,对于文字输入和语音输入所需要的时间长度来说,语音输入要快的多;且语音输入时,录音中会带有说话人的情感状态,传达信息更为全面、不易产生误解。因而语音输入的未来市场巨大。目前,即时通信工具例如微信,所能搜索的内容最多,如群成员、日期、图片及视频、文件、链接、音乐和文本信息。而对于喜欢语音消息的用户来说,想要回顾之前的、不太容易定位日期的、所讲某一话题内容的详细信息时,就存在一定的困难。为了方便用户查找、定位历史消息记录,大多数现有的通信工具都为用户提供了历史消息记录的查询功能,然而拥有这一功能的通讯工具均只能查询、定位到用户的文字消息记录,忽略了用户对查询、定位语音消息记录的需求,导致用户查找语音消息的过程极其繁琐,严重影响用户体验。
发明内容
本申请实施例提供一种语音消息搜索显示方法、装置、计算机设备及存储介质,可对语音消息进行搜索,并将搜索的结果按照预设格式显示。
第一方面,本申请实施例提供了一种语音消息搜索显示方法,该方法包括:
接收消息搜索指令,其中,所述消息搜索指令中包括关键词;在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息;将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。
第二方面,本申请实施例提供了一种语音消息搜索显示装置,该装置包括用于执行上述第一方面所述的语音消息搜索显示方法的单元。
第三方面,本申请实施例提供了一种计算机设备,所述计算机设备包括存储器,以及与所述存储器相连的处理器;
所述存储器用于存储计算机程序,所述处理器用于运行所述存储器中存储的计算机程序,以执行上述第一方面所述的语音消息搜索显示方法。
第四方面,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令被处理器执行时,实现上述第一方面所述的语音消息搜索显示方法。
本申请实施例可对语音消息进行搜索,得到与搜索关键词匹配的语音消息,并将搜索得到的语音消息搜索结果按照预设格式显示,可方便用户查看与搜索关键词匹配的语音消息,提高了查询语音消息的效率,提升了用户的体验。
附图说明
图1是本申请实施例提供的一种语音消息搜索显示方法的流程示意图;
图2是本申请另一实施例提供的一种语音消息搜索显示的流程示意图;
图3是本申请实施例提供的一种语音消息搜索显示方法的子流程示意图;
图4是本申请实施例提供的与关键词匹配的语音消息搜索结果显示的示例图;
图5是本申请实施例提供的一种语音消息搜索显示方法的另一子流程示意图;
图6是本申请另一实施例提供的一种语音消息搜索显示方法的流程示意图;
图7是本申请施例提供的一种语音消息搜索显示装置的示意性框图;
图8是本申请另一实施例提供的一种语音消息搜索显示装置的示意性框图;
图9是本申请实施例提供的显示单元的示意性框图;
图10是本申请另一实施例提供的显示单元的示意性框图;
图11是本申请另一实施例提供的语音消息搜索显示装置的示意性框图;
图12是本申请实施例提供的一种计算机设备的示意性框图。
具体实施方式
在本申请中,应当理解,尽管术语第一、第二等可以在此用来描述各种元素,但这些元素不应该受限于这些术语。这些术语仅用来将这些元素彼此区分开。例如,在不脱离本申请范围的前提下,第一获取单元可以被称为第二获取单元,并且类似地,第二获取单元可以被称为第一获取单元。第一获取单元和第二获取单元均为获取单元,但它们并非同一获取单元。
以下描述的方法实施例可以应用于移动电话、膝上型计算机、平板计算机、台式计算机等设备中。需要注意的是,这些设备中安装有即时通信工具如微信、QQ等。
图1为本申请实施例提供的一种语音消息搜索显示方法的流程示意图。该方法包括以下步骤S101-S103。
S101,接收消息搜索指令,其中,所述消息搜索指令中包括关键词。
具体地,可在即时通信工具如微信主页上的搜索查询项中输入搜索关键词,点击搜索按钮或者检测到输入完成,生成消息搜索指令;也可打开具体的通信对象如聊天对象,在具体的通信对象相应的界面中找到“查找聊天记录”等相关按钮,点击该按钮后,输入搜索关键词,点击搜索按钮或者检测到输入完成,即生成消息搜索指令,其中,通信对象可以是单个的联系人,也可以是群组。其中,关键词输入的方式,包括文本形式,以及语音形式。语音形式输入的关键词,需要根据语音识别将语音转换为对应的文本形式的关键词。
在一些实施例中,搜索指令中还可以包括在两个时间段中选择的目标时间段信息,即搜索指令中还可以包括时间信息;在一些实施例中,搜索指令中还可以包括在有关至少两个联系人的界面中选择的目标联系人信息,即搜索指令中还可以包括目标联系人信息。
S102,在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息。
用户在使用即时通信工具如微信的过程中,设备会获取用户的通信记录如聊天记录。若聊天记录中的消息为语音消息,那么设备会根据语音识别算法将语音消息转换成对应的文本消息,并保存在预设文件中,以供搜索时使用。其中,预设文件中包括语音消息对应的文本消息。在预设文件中搜索与关键词匹配的文本消息,如关键词为“动物园”,在预设文件中保存的文本消息中搜索,若搜索到包括与“动物园”相关的文本消息,那么就认为该文本消息是与关键词匹配的文本消息。其中,搜索包括各种方式的搜索,如模糊搜索、精确搜索等。
S103,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。其中,预设格式与语音消息时长有关,或者与设备屏幕显示行的长度、语音消息对应的语音消息时长、语音消息对应的文本消息的字数有关。
该实施例对即时通信工具中的语音消息进行搜索,得到与搜索关键词匹配的语音消息,并将搜索得到的语音消息搜索结果按照预设格式显示,可方便用户查看与搜索关键词匹配的语音消息,提高了查询语音消息的效率,提升了用户的体验。
图2为本申请另一实施例提供的一种语音消息搜索显示方法的流程示意图。该方法包括步骤S201-S207。该实施例与图1所示实施例的区别在于:在接收消息搜索指令之前,该实施例增加了步骤S201-S204。下面将详细描述步骤S201-S204。其中,步骤S205-S207与图1所示实施例中的步骤对应,请参看图1所示实施例中的描述,在此不在赘述。
S201,当接收到语音消息后,判断语音消息时长是否超过预设语音消息时长。
预设语音消息时长可以是预先设置的,如具体为1分钟等。也可以根据用 户的习惯进行设置,即接收用户的设置。预设语音消息时长设置好后,可以进行修改,如可以接收用户修改的预设语音消息时长,也可以根据用户的反馈,接收服务器设置的另一合适的时长作为新的预设语音消息时长。可以理解地,若超过预设语音消息时长,表示用户不太愿意一次阅读这么长的语音。如用户阅读了一段语音消息后,对语音消息中某一段不是很清楚,用户只想再重复听该段对应的语音,而不希望每次听语音时,都从头开始。在该种情况下,若每次都从头开始,会影响用户的体验。
S202,若超过预设语音消息时长,将该长语音消息进行分段形成多个语音消息。
其中,长语音消息指的是超过预设语音消息时长的语音消息。将该长语音消息进行分段形成多个语音消息,包括:根据时间将该长语音消息进行分段形成多个语音消息,或者检测该长语音消息中的说话停顿位置,根据时间和说话停顿位置将该长语音消息进行分段形成多个语音消息。其中,根据时间将该长语音消息进行分段形成多个语音消息,可以理解为,每隔一段时间将该长语音消息进行分段,该时间小于预设语音消息时长,如每隔30s将长语音消息分成多段。也可以结合时间和说话停顿位置来定位分段点,如当检测到录音时间超过30s时,若同时检测到说话停顿位置,将该说话停顿位置作为分段点,下一个分段点与上一个分段点之间的间隔在30s以上。若上一个分段点与语音消息结束之间的时间间隔小于30s,将上一个分段点之后的语音也作为分段后的一个语音。其中,可以根据语音消息对应的声波变化来检测说话停顿位置,如若检测到语音消息中的一段声波平均振幅比较高,而另一段声波平均振幅比较低,可以取平均振幅比较低的该段声波的中间位置对应的时间作为说话停顿位置,再根据时间和说话停顿位置来将该长语音消息分成多段。需要注意的是,在该实施例中,30s时间只是一个例子而已,在其他实施例中,时间也可以设置为其他的数值。
S203,通过语音识别将语音消息转换为对应的文本消息。
其中,这里包括了两种情况,一种情况是,超过了预设语音消息时长,将分段形成的多个语音消息转换为对应的文本消息,另一种情况是,若没有超过预设语音消息时长,将该语音消息转换为对应的文本消息。具体地,可以利用 设备上的语音识别算法将语音消息转换为对应的文本消息。
在一实施例中,在通过语音识别将语音消息转换为对应的文本消息之前,还需要进行如下操作:判断当前时间与接收语音消息时间的间隔是否达到撤回消息时长;若达到撤回消息时长,再通过语音识别将语音消息转换为对应的文本消息。其中,撤回消息时长指的是在即时通信工具中设置的可以撤回消息的时间,如2s等。可以理解为,若当前时间与接收到语音消息时间的间隔达到撤回消息时长,那么该条消息不会被用户撤回,避免出现先将语音消息转换为对应的文本消息,然后该条语音消息被撤回的情况。如此可以减少设备将被撤回的语音消息转换为对应的文本消息的处理量。需要注意的是,若未达到撤回消息时长,仍需要对语音消息进行分段,并显示。只是不将语音消息转换为对应的文本消息。
S204,将语音消息以及语音消息对应的文本消息保存在预设文件中。
该实施例是在接收到语音消息后,若语音消息超过预设语音消息时长,将该长语音消息进行分段形成多个语音消息,再通过语音识别将语音消息转换为对应的文本消息并保存。将超过预设语音消息时长的长语音消息分成多段,用户可以分段来获取或者搜索对应语音消息的内容,当其中某一段语音消息用户想再次阅读时,可只播放该段语音消息,而不必每次都从头播放,提高了用户的体验。
在一实施例中,在步骤S102之后,即在预设文件中搜索与关键词匹配的文本消息之后,所述方法还包括:检测与所述关键词匹配的文本消息是否有多条;若与所述关键词匹配的文本消息有多条,按照预设规则对多条文本消息对应的语音消息搜索结果进行排序。将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,包括:将排序后的与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。其中,预设规则包括按照语音消息接收的时间前后顺序,和/或按照语音消息对应的文本消息与关键词的匹配度进行排序,或者根据人的遗忘曲线来根据不同语音消息时间所对应的遗忘可能性的高低进行排序等。具体地,如按照语音消息接收的时间由晚到早的顺序进行排序,即后接收到的语音消息排序在前面;按照语音消息对应的文本消息与关键词的匹配度由高到低进行排序,即匹配度高的排序在前面;按照遗忘可能 性由高到低进行排序,即将遗忘可能性高的排序在前面。
在一实施例中,如图3所示,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,即步骤S103,包括以下步骤S301-S304。其中,预设格式与语音消息时长有关。
S301,对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长。
S302,判断语音消息时长是否超过预设时长。其中,预设时长可以为30s。
S303,若超过预设时长,按照第一预设格式显示对应的语音消息搜索结果。其中,第一预设格式包括:语音消息、语音消息中关键词前后对应的预设字数的文本内容。第一预设格式还可以包括:语音消息对应的发送人信息、语音消息发送的时间。其中,关键词可以高亮显示,如区分颜色或者加粗等,发送人信息包括发送人昵称和/或发送人头像等,语音信息包括语音和/或语音消息时长等;预设字数包括关键词的字数,预设次数可以设置为16字,也可以设置为其他的字数。若语音消息对应的文本消息的总字数超过预设字数,预设字数以外的其他文本可以用省略号代替。如关键词为:吃饭,预设字数为16,那么文本消息可以显示为:...你在哪个地方吃饭,发个定位给...。若语音消息对应的文本消息的字数少于预设次数,除了显示文本消息之外,其他不足预设字数的也可以用省略号代替。可以理解为,语音消息中有很多的杂音,真正有语音消息的并不多。
S304,若未超过预设时长,按照第二预设格式显示对应的语音消息搜索结果。其中,第二预设格式包括:语音信息、语音消息对应的文本内容。第二预设格式还可以包括:语音消息对应的发送人信息、语音消息发送的时间。其中,关键词高亮显示,如区分颜色或者加粗等,发送人信息包括发送人昵称和/或发送人头像等,语音信息包括语音和/或语音消息时长等。具体的显示效果可参看图4。
图4为与关键词匹配的语音消息搜索结果显示的示例图。如图4所示,在终端10的屏幕11上显示有与关键词匹配的语音消息搜索结果。其中,关键词110为“动物园”,发送人信息包括发送人图像120和发送人昵称130。语音消息包括语音160和语音消息时长150。语音消息对应的文本内容140,其中,可 以看出关键词“动物园”为加粗显示。语音消息发送的时间170显示为:2018-01-01,在其他实施例中,语音消息发送的时间还可以具体到分钟等。
在一实施例中,如图5所示,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,即步骤S103包括以下步骤S501-S504。其中,预设格式与搜索结果界面显示行的长度、语音消息时长、语音消息对应文本消息的字数有关。可以理解为,在搜索结果界面显示行中,显示一条与所述关键词匹配的文本消息对应的语音消息搜索结果,即搜索结果界面一行中,显示一条语音消息搜索结果,以方便用户查看。
S501,对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长,以及语音消息对应文本消息的字数。
S502,根据语音消息时长计算语音消息显示的长度。
S503,判断语音消息显示的长度与字数显示长度是否超过搜索结果界面显示行的长度。
S504,若超过搜索结果界面显示行的长度,按照第三预设格式将对应的语音消息搜索结果显示于同一行中。其中,第三预设格式包括:语音消息、语音消息中关键词前后对应的预设字数的文本内容。第三预设格式还可以包括:语音消息对应的发送人信息、语音消息发送的时间。其中,关键词高亮显示,如区分颜色或者加粗等,发送人信息包括发送人昵称和/或发送人头像等,语音信息包括语音和语音消息时长等;预设字数包括关键词的字数,预设字数根据搜索结果界面显示行的长度、以及语音消息的长短来设定,预设字数=(搜索结果界面显示行的长度-语音消息显示的长度)/每个字显示的长度-n。其中,n为预留长度,预留长度包括了边框的留白等,预留长度还可以用来表示省略号等。如可以用省略号代替超过的部分,如关键词为:吃饭,预设字数为16,那么文本消息可以显示为:...你在哪个地方吃饭,发个定位给...。
S505,若未超过搜索结果界面显示行的长度,按照第四预设格式显示对应的语音消息搜索结果。其中,第四预设格式包括:语音信息、语音消息对应的文本内容。第四预设格式还可以包括:语音消息对应的发送人信息、语音消息发送的时间。其中,关键词高亮显示,如区分颜色或者加粗等,发送人信息包括发送人昵称和/或发送人头像等,语音信息包括语音和语音消息时长等。
以上实施例中,预设文件中保存的数据包括语音消息对应的文本消息,在其他一些实施例中,预设文件中保存的数据除了包括语音消息对应的文本消息之外,还包括纯文本消息,纯文本消息可理解为最开始输入即为文本形式输入的文本消息。即预设文件中保存的数据包括语音消息对应的文本消息和纯文本消息。
图6为本申请另一实施例提供的一种语音消息搜索显示方法的流程示意图。该方法包括步骤S601-S609。该实施例与图2所示的实施例的区别在于:增加了步骤S607-608,以及步骤S606、S609的不同。下面将详细该实施例与图2所示的实施例的区别之处,其他步骤请参看图2所示的实施例部分的描述,在此不再赘述。
S606,在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息和纯文本消息。
用户在使用即时通信工具如微信的过程中,设备会获取用户的通信记录如聊天记录。若聊天记录中的消息为语音消息,那么设备会根据语音识别算法将语音消息转换成对应的文本消息,并保存在预设文件中;若聊天记录中的消息为纯文本消息,那么直接将纯文本消息保存在预设文件中。因此,预设文件中保存的数据包括语音消息对应的文本消息和纯文本消息。
S607,检测与所述关键词匹配的文本消息是否有多条。
S608,若与所述关键词匹配的文本消息有多条,将多条文本消息对应的语音消息搜索结果和纯文本消息搜索结果按照按照预设规则排序。其中,预设规则包括按照语音消息和纯文本消息接收的时间前后顺序,和/或按照语音消息对应的文本消息以及纯文本消息与关键词的匹配度进行排序,或者根据人的遗忘曲线来根据不同语音消息和纯文本消息发送时间所对应的遗忘可能性的高低进行排序等。具体地,如按照语音消息和纯文本消息接收的时间由晚到早的顺序进行排序,即后接收到的语音消息和纯文本消息排序在前面;按照语音消息对应的文本消息以及纯文本消息与关键词的匹配度由高到低进行排序,即匹配度高的排序在前面;按照遗忘可能性由高到低进行排序,即将遗忘可能性高的排序在前面。
S609,将排序后的与所述关键词匹配的文本消息对应的语音消息搜索结果 按照预设格式显示,与所述关键词匹配的纯文本消息搜索结果按照另一预设格式显示。
其中,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,可参看图3和图5所示实施例描述的内容,在此不再赘述。
将与所述关键词匹配的纯文本消息搜索结果按照另一预设格式显示,其中,另一预设格式包括:纯文本信息对应的发送人信息、纯文本信息、纯文本消息发送的时间。其中,纯文本消息中的关键词高亮显示,如区分颜色或者加粗等,发送人信息包括发送人昵称和/或发送人头像等,纯文本信息包括纯文本信息的内容。
在该实施例中,预设文件中保存的数据包括语音消息对应的文本消息和纯文本消息,在预设文件中搜索与关键词匹配的文本消息后,判断与所述关键词匹配的文本消息是否有多条,若有多条,将搜索到的语音消息搜索结果和纯文本消息搜索结果进行排序,按照排序后的顺序显示。其中,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,将与所述关键词匹配的纯文本消息搜索结果按照另一预设格式显示。更一步提高了消息搜索的效率,提升了用户的体验。
图7是本申请实施例提供的一种语音消息搜索显示装置的示意性框图。如图7所示,该装置70包括接收单元701、搜索单元702、显示单元703。
接收单元701,用于接收消息搜索指令,其中,所述消息搜索指令中包括关键词。
搜索单元702,用于在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息。
显示单元703,用于将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。其中,预设格式与语音消息时长有关,或者与设备屏幕显示行的长度、语音消息对应的语音消息时长、语音消息对应的文本消息的字数有关。
图8是本申请另一实施例提供的一种语音消息搜索显示装置的示意性框图。如图8所示,该装置80包括判断单元801、分段单元802、识别单元803、保存单元804、接收单元805、搜索单元806、显示单元807。其中,该实施例 与图7所示的实施例的区别在于:增加了判断单元801、分段单元802、识别单元803、保存单元804。其中,接收单元805、搜索单元806、显示单元807与图7所示实施例中描述的一致,具体请参看图7中相应的描述,在此不再赘述。下面将详细描述判断单元801、分段单元802、识别单元803、保存单元804。
判断单元801,用于接收到语音消息后,判断语音消息时长是否超过预设语音消息时长。
分段单元802,用于若超过预设语音消息时长,将该长语音消息进行分段形成多个语音消息。其中,长语音消息指的是超过预设语音消息时长的语音消息。分段单元802用于根据时间将该长语音消息进行分段形成多个语音消息。在另一实施例中,分段单元包括位置检测单元,语音分段单元。其中,位置检测单元用于检测该长语音消息中的说话停顿位置。语音分段单元,用于根据时间和说话停顿位置将该长语音消息进行分段形成多个语音消息。
识别单元803,用于若超过预设语音消息时长,将该语音消息转换为对应的文本消息,以及若超过了预设语音消息时长,将分段形成的多个语音消息转换为对应的文本消息。
在一实施例中,判断单元801,还用于判断当前时间与接收语音消息时间的间隔是否达到撤回消息时长。识别单元803,用于若达到撤回消息时长,再通过语音识别将语音消息转换为对应的文本消息。其中,撤回消息时长指的是在即时通信工具中设置的可以撤回消息的时间,如2s等。
保存单元804,用于将语音消息以及语音消息对应的文本消息保存在预设文件中。
在一实施例中,一种语音消息搜索显示装置还包括第一检测单元、第一排序单元,具体地,第一检测单元,用于检测与所述关键词匹配的文本消息是否有多条。第一排序单元,用于若与所述关键词匹配的文本消息有多条,按照预设规则对对多条文本消息对应的语音消息进行排序。显示单元,用于将排序后的与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。在一实施例中,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,其中,预设格式与语音消息时长有关。如图9所示,显示单元807包括第一获取单元901、第一判断单元902、第一显示单元903。
第一获取单元901,用于对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长。
第一判断单元902,用于判断语音消息时长是否超过预设时长。其中,预设时长可以为30s。
第一显示单元903,用于若超过预设时长,按照第一预设格式显示对应的语音消息搜索结果。
第一显示单元903,还用于若未超过预设时长,按照第二预设格式显示对应的语音消息搜索结果。
在一实施例中,将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,其中,预设格式与搜索结果界面显示行的长度、语音消息时长、语音消息对应文本消息的字数有关。可以理解为,在搜索结果界面显示行中,显示一条与所述关键词匹配的文本消息对应的语音消息搜索结果,即搜索结果界面一行中,显示一条语音消息搜索结果,以方便用户查看。如图10所示,显示单元807还包括第二获取单元101、计算单元102、第二判断单元103、第二显示单元104。
第二获取单元101,用于对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长,以及语音消息对应文本消息的字数。
计算单元102,用于根据语音消息时长计算语音消息显示的长度。
第二判断单元103,用于判断语音消息显示的长度与字数显示长度是否超过搜索结果界面显示行的长度。
第二显示单元104,用于若超过搜索结果界面显示行的长度,按照第三预设格式将对应的语音消息搜索结果显示于同一行中。
第二显示单元104,还用于若未超过搜索结果界面显示行的长度,按照第四预设格式显示对应的语音消息搜索结果。
图11是另一实施例提供的一种语音消息搜索显示装置的示意性框图。如图11所示,该装置110包括判断单元111、分段单元112、识别单元113、保存单元114、接收单元115、搜索单元116、第二检测单元117、第二排序单元118、显示单元119。其中,该实施例与图8所示的实施例的区别在于:增加了第二检测单元117、第二排序单元118以及搜索单元116与显示单元119的不同。 其中单元请与图8所示实施例中描述的一致,具体请参看图8中相应的描述,在此不再赘述。下面将详细描述搜索单元116、第二检测单元117、第二排序单元118、显示单元119。
搜索单元116,用于在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息和纯文本消息。
第二检测单元117,用于检测与所述关键词匹配的文本消息是否有多条。
第二排序单元118,用于若有多条,将多条文本消息对应的语音消息和纯文本消息按照按照预设规则排序。
显示单元119,用于将排序后的与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,与所述关键词匹配的纯文本消息搜索结果按照另一预设格式显示。
上述装置实施例的具体工作过程和达到的有益效果,请参看前述方法实施例对应的实施过程和有益效果,在此不再赘述。
上述装置可以实现为一种计算机程序的形式,计算机程序可以在如图12所示的计算机设备上运行。
图12为本申请实施例提供的一种计算机设备的示意性框图。该设备120也可以是以客户端的形式存在。该设备120包括通过系统总线121连接的处理器122、存储器和网络接口123,其中,存储器可以包括非易失性存储介质124和内存储器125。
该非易失性存储介质124可存储操作系统1241和计算机程序1242。该计算机程序1242被执行时,可使得处理器122执行语音消息搜索显示方法。该处理器122用于提供计算和控制能力,支撑整个设备120的运行。该内存储器125为非易失性存储介质中的计算机程序的运行提供环境,该计算机程序被处理器122执行时,可使得处理器122执行语音消息搜索显示方法。该网络接口123用于进行网络通信,如接收指令等。本领域技术人员可以理解,图12中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的设备120的限定,具体的设备120可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。
其中,所述处理器122用于运行存储在存储器中的计算机程序,以实现前 述语音消息搜索显示方法的任一实施例。
应当理解,在本申请实施例中,所称处理器122可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。
在本申请的另一实施例中提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令当被处理器执行时,以实现前述语音消息搜索显示方法的任一实施例。
所述计算机可读存储介质可以是前述任一实施例所述的终端的内部存储单元,例如终端的硬盘或内存。所述计算机可读存储介质也可以是所述终端的外部存储设备,例如所述终端上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡等。进一步地,所述计算机可读存储介质还可以既包括所述终端的内部存储单元也包括外部存储设备。
在本申请所提供的几个实施例中,应该理解到,所揭露的终端和方法,可以通过其它的方式实现。例如,以上所描述的终端实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的终端和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。

Claims (20)

  1. 一种语音消息搜索显示方法,其特征在于,所述方法包括:
    接收消息搜索指令,其中,所述消息搜索指令中包括关键词;
    在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息;
    将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。
  2. 根据权利要求1所述的方法,其特征在于,在所述接收消息搜索指令之前,所述方法还包括:
    当接收到语音消息后,判断语音消息时长是否超过预设语音消息时长;
    若超过预设语音消息时长,将该长语音消息进行分段形成多个语音消息,其中,长语音消息为语音消息时长超过预设语音消息时长的语音消息;
    通过语音识别将多个语音消息转换为对应的文本消息;
    将多个语音消息以及多个语音消息对应的文本消息保存在预设文件中。
  3. 根据权利要求2所述的方法,其特征在于,所述将该长语音消息进行分段形成多个语音消息,包括:
    根据时间将该长语音消息进行分段形成多个语音消息;或者
    检测该长语音消息中的说话停顿位置;
    根据时间和说话停顿位置将该长语音消息进行分段形成多个语音消息。
  4. 根据权利要求1所述的方法,其特征在于,所述预设文件中保存的数据还包括纯文本消息,所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,包括:
    将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示;将与所述关键词匹配的纯文本消息搜索结果按照另一预设格式显示。
  5. 根据权利要求1所述的方法,其特征在于,所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示之前,所述方法还包括:
    检测与所述关键词匹配的文本消息是否有多条;
    若与所述关键词匹配的文本消息有多条,将多条文本消息对应的语音消息 搜索结果按照预设规则进行排序。
  6. 根据权利要求1所述的方法,其特征在于,所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,包括:
    对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长;
    判断语音消息时长是否超过预设时长;
    若超过预设时长,按照第一预设格式显示对应的语音消息搜索结果;
    若未超过预设时长,按照第二预设格式显示对应的语音消息搜索结果。
  7. 根据权利要求1所述的方法,其特征在于,所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,包括:
    对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长,以及语音消息对应文本消息的字数;
    根据语音消息时长计算语音消息显示的长度;
    判断语音消息显示的长度与字数显示长度是否超过搜索结果界面显示行的长度;
    若超过搜索结果界面显示行的长度,按照第三预设格式将对应的语音消息搜索结果显示于同一行中;
    若未超过搜索结果界面显示行的长度,按照第四预设格式显示对应的语音消息搜索结果。
  8. 一种语音消息搜索显示装置,其特征在于,所述装置包括:
    接收单元,用于接收消息搜索指令,其中,所述消息搜索指令中包括关键词;
    搜索单元,用于在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息;
    显示单元,用于将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。
  9. 根据权利要求8所述的装置,其特征在于,所述装置还包括:
    判断单元,用于当接收到语音消息后,判断语音消息时长是否超过预设语音消息时长;
    分段单元,用于若超过预设语音消息时长,将该长语音消息进行分段形成 多个语音消息,其中,长语音消息为语音消息时长超过预设语音消息时长的语音消息;
    识别单元,用于通过语音识别将多个语音消息转换为对应的文本消息;
    保存单元,用于将多个语音消息以及多个语音消息对应的文本消息保存在预设文件中。
  10. 根据权利要求8所述的装置,其特征在于,所述显示单元包括:
    第二获取单元,用于对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长,以及语音消息对应文本消息的字数;
    计算单元,用于根据语音消息时长计算语音消息显示的长度;
    第二判断单元,用于判断语音消息显示的长度与字数显示长度是否超过搜索结果界面显示行的长度;
    第二显示单元,用于若超过搜索结果界面显示行的长度,按照第三预设格式将对应的语音消息搜索结果显示于同一行中;若未超过搜索结果界面显示行的长度,按照第四预设格式显示对应的语音消息搜索结果。
  11. 一种计算机设备,其特征在于,所述计算机设备包括存储器,以及与所述存储器相连的处理器;
    所述存储器用于存储计算机程序;所述处理器用于运行所述存储器中存储的计算机程序,以执行如下步骤:
    接收消息搜索指令,其中,所述消息搜索指令中包括关键词;
    在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息;
    将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。
  12. 根据权利要求11所述的计算机设备,其特征在于,在所述接收消息搜索指令之前,所述处理器还执行如下步骤:
    当接收到语音消息后,判断语音消息时长是否超过预设语音消息时长;
    若超过预设语音消息时长,将该长语音消息进行分段形成多个语音消息,其中,长语音消息为语音消息时长超过预设语音消息时长的语音消息;
    通过语音识别将多个语音消息转换为对应的文本消息;
    将多个语音消息以及多个语音消息对应的文本消息保存在预设文件中。
  13. 根据权利要求12所述的计算机设备,其特征在于,所述处理器在执行所述将该长语音消息进行分段形成多个语音消息时,具体执行如下步骤:
    根据时间将该长语音消息进行分段形成多个语音消息;或者
    检测该长语音消息中的说话停顿位置;
    根据时间和说话停顿位置将该长语音消息进行分段形成多个语音消息。
  14. 根据权利要求11所述的计算机设备,其特征在于,所述预设文件中保存的数据还包括纯文本消息,所述处理器在执行所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示,包括:
    将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示;将与所述关键词匹配的纯文本消息搜索结果按照另一预设格式显示。
  15. 根据权利要求11所述的计算机设备,其特征在于,所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示之前,所述处理器还执行如下步骤:
    检测与所述关键词匹配的文本消息是否有多条;
    若与所述关键词匹配的文本消息有多条,将多条文本消息对应的语音消息搜索结果按照预设规则进行排序。
  16. 根据权利要求11所述的计算机设备,其特征在于,所述处理器在执行所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示时,具体执行如下步骤:
    对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长;
    判断语音消息时长是否超过预设时长;
    若超过预设时长,按照第一预设格式显示对应的语音消息搜索结果;
    若未超过预设时长,按照第二预设格式显示对应的语音消息搜索结果。
  17. 根据权利要求11所述的计算机设备,其特征在于,所述处理器在执行所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示时,具体执行如下步骤:
    对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长,以及语音消息对应文本消息的字数;
    根据语音消息时长计算语音消息显示的长度;
    判断语音消息显示的长度与字数显示长度是否超过搜索结果界面显示行的长度;
    若超过搜索结果界面显示行的长度,按照第三预设格式将对应的语音消息搜索结果显示于同一行中;
    若未超过搜索结果界面显示行的长度,按照第四预设格式显示对应的语音消息搜索结果。
  18. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令被处理器执行时,实现如下步骤:
    接收消息搜索指令,其中,所述消息搜索指令中包括关键词;
    在预设文件中搜索与关键词匹配的文本消息,其中,预设文件中保存的数据包括语音消息对应的文本消息;
    将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示。
  19. 根据权利要求18所述的计算机可读存储介质,其特征在于,在所述接收消息搜索指令之前,所述处理器还实现如下步骤:
    当接收到语音消息后,判断语音消息时长是否超过预设语音消息时长;
    若超过预设语音消息时长,将该长语音消息进行分段形成多个语音消息,其中,长语音消息为语音消息时长超过预设语音消息时长的语音消息;
    通过语音识别将多个语音消息转换为对应的文本消息;
    将多个语音消息以及多个语音消息对应的文本消息保存在预设文件中。
  20. 根据权利要求18所述的计算机可读存储介质,其特征在于,所述处理器在执行所述将与所述关键词匹配的文本消息对应的语音消息搜索结果按照预设格式显示时,具体实现如下步骤:
    对与所述关键词匹配的文本消息对应的语音消息,获取语音消息时长,以及语音消息对应文本消息的字数;
    根据语音消息时长计算语音消息显示的长度;
    判断语音消息显示的长度与字数显示长度是否超过搜索结果界面显示行的 长度;
    若超过搜索结果界面显示行的长度,按照第三预设格式将对应的语音消息搜索结果显示于同一行中;
    若未超过搜索结果界面显示行的长度,按照第四预设格式显示对应的语音消息搜索结果。
PCT/CN2018/101071 2018-03-22 2018-08-17 语音消息搜索显示方法、装置、计算机设备及存储介质 WO2019179014A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810240238.X 2018-03-22
CN201810240238.XA CN108446389B (zh) 2018-03-22 2018-03-22 语音消息搜索显示方法、装置、计算机设备及存储介质

Publications (1)

Publication Number Publication Date
WO2019179014A1 true WO2019179014A1 (zh) 2019-09-26

Family

ID=63196129

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/101071 WO2019179014A1 (zh) 2018-03-22 2018-08-17 语音消息搜索显示方法、装置、计算机设备及存储介质

Country Status (2)

Country Link
CN (1) CN108446389B (zh)
WO (1) WO2019179014A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112988794A (zh) * 2019-12-02 2021-06-18 深圳云天励飞技术有限公司 一种动态调整搜索策略的数据搜索方法、装置及电子设备

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109274586A (zh) * 2018-11-14 2019-01-25 深圳市云歌人工智能技术有限公司 聊天信息的存储方法、装置及存储介质
CN109977130B (zh) * 2019-03-29 2021-09-28 珠海豹好玩科技有限公司 一种热词展示方法及系统
CN111625701B (zh) * 2020-05-25 2024-01-26 Oppo广东移动通信有限公司 搜索方法、装置、服务器及存储介质
CN113360636B (zh) * 2021-06-30 2023-08-01 北京百度网讯科技有限公司 一种内容显示方法、装置、设备以及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102750365A (zh) * 2012-06-14 2012-10-24 华为软件技术有限公司 即时语音消息的检索方法和系统,以及用户设备和服务器
CN104714981A (zh) * 2013-12-17 2015-06-17 腾讯科技(深圳)有限公司 语音消息搜索方法、装置及系统
CN107622137A (zh) * 2017-10-23 2018-01-23 腾讯音乐娱乐科技(深圳)有限公司 查找语音消息的方法和装置
CN107786719A (zh) * 2016-08-25 2018-03-09 中兴通讯股份有限公司 语音文件转换方法、装置及移动终端

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2637391B1 (en) * 2012-03-09 2019-06-19 BlackBerry Limited Message search method and electronic device
CN105049637A (zh) * 2015-08-25 2015-11-11 努比亚技术有限公司 一种控制即时通讯的装置和方法
CN105791087A (zh) * 2016-02-27 2016-07-20 深圳市金立通信设备有限公司 一种媒体分割方法及终端
CN105719642A (zh) * 2016-02-29 2016-06-29 黄博 连续长语音识别方法及系统、硬件设备
CN107305541B (zh) * 2016-04-20 2021-05-04 科大讯飞股份有限公司 语音识别文本分段方法及装置
CN106940618A (zh) * 2017-03-31 2017-07-11 珠海市魅族科技有限公司 一种语音信息的播放方法以及装置
CN107798143A (zh) * 2017-11-24 2018-03-13 珠海市魅族科技有限公司 一种信息搜索方法、装置、终端及可读存储介质

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102750365A (zh) * 2012-06-14 2012-10-24 华为软件技术有限公司 即时语音消息的检索方法和系统,以及用户设备和服务器
CN104714981A (zh) * 2013-12-17 2015-06-17 腾讯科技(深圳)有限公司 语音消息搜索方法、装置及系统
CN107786719A (zh) * 2016-08-25 2018-03-09 中兴通讯股份有限公司 语音文件转换方法、装置及移动终端
CN107622137A (zh) * 2017-10-23 2018-01-23 腾讯音乐娱乐科技(深圳)有限公司 查找语音消息的方法和装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112988794A (zh) * 2019-12-02 2021-06-18 深圳云天励飞技术有限公司 一种动态调整搜索策略的数据搜索方法、装置及电子设备
CN112988794B (zh) * 2019-12-02 2024-05-03 深圳云天励飞技术有限公司 一种动态调整搜索策略的数据搜索方法、装置及电子设备

Also Published As

Publication number Publication date
CN108446389A (zh) 2018-08-24
CN108446389B (zh) 2021-12-24

Similar Documents

Publication Publication Date Title
WO2019179014A1 (zh) 语音消息搜索显示方法、装置、计算机设备及存储介质
CN108874904B (zh) 语音消息搜索方法、装置、计算机设备及存储介质
US20100087169A1 (en) Threading together messages with multiple common participants
US8555156B2 (en) Inferring that a message has been read
KR20210000326A (ko) 모바일 비디오 서치 기법
CN106991179B (zh) 数据删除方法、装置及移动终端
CN108447509B (zh) 一种生成多媒体文件的方法和装置
EP3350756A1 (en) Providing collaboration communication tools within document editor
WO2022161431A1 (zh) 显示方法、装置及电子设备
US11010050B1 (en) Systems and methods for swipe-to-like
CN105094603B (zh) 一种关联输入的方法与装置
US10664482B2 (en) Providing relevance based dynamic hashtag navigation
WO2021093333A1 (zh) 音频播放方法、电子设备及存储介质
US10909146B2 (en) Providing automated hashtag suggestions to categorize communication
US9176639B1 (en) Collaborative communication system with voice and touch-based interface for content discovery
US20220019803A1 (en) Method and apparatus for analyzing video scenario
WO2017100010A1 (en) Organization and discovery of communication based on crowd sourcing
US20190349324A1 (en) Providing rich preview of communication in communication summary
US11153250B2 (en) Controlling communication of notifications to a user
CN112748828B (zh) 一种信息处理方法、装置、终端设备及介质
WO2021242381A1 (en) Machine learning-assisted graphical user interface for content organization
WO2018121487A1 (zh) 界面过滤的方法及系统
US20210158188A1 (en) Recommending news in conversation
WO2020140395A1 (zh) 一种电子设备的应用登录方法、装置、电子设备及介质
US10122666B2 (en) Retrieving and reusing stored message content

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18911131

Country of ref document: EP

Kind code of ref document: A1