WO2020238186A1 - 语音信息的播放方法、装置及存储介质 - Google Patents

语音信息的播放方法、装置及存储介质 Download PDF

Info

Publication number
WO2020238186A1
WO2020238186A1 PCT/CN2019/128158 CN2019128158W WO2020238186A1 WO 2020238186 A1 WO2020238186 A1 WO 2020238186A1 CN 2019128158 W CN2019128158 W CN 2019128158W WO 2020238186 A1 WO2020238186 A1 WO 2020238186A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice information
playing
playback
adjustment control
playback progress
Prior art date
Application number
PCT/CN2019/128158
Other languages
English (en)
French (fr)
Inventor
聂怡玲
Original Assignee
珠海格力电器股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 珠海格力电器股份有限公司 filed Critical 珠海格力电器股份有限公司
Publication of WO2020238186A1 publication Critical patent/WO2020238186A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/0485Scrolling or panning
    • G06F3/04855Interaction with scrollbars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/04Real-time or near real-time messaging, e.g. instant messaging [IM]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/07User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail characterised by the inclusion of specific contents
    • H04L51/18Commands or executable codes

Definitions

  • This application relates to the field of instant messaging technology, and in particular to a method, device and storage medium for playing voice information.
  • instant messaging application software With the development of network technology, the application of instant messaging application software is becoming more and more extensive. The popularity of instant messaging applications meets people's daily social needs, and people can interact with other users anytime and anywhere through instant messaging applications.
  • various instant messaging applications generally support the interaction of voice information, that is, users can send voice information to other users through instant messaging software, and the receiver can click to play the voice information after receiving the voice information.
  • the voice information interaction function of instant messaging software still has some defects, which cause inconvenience to users and affect user experience.
  • this application provides a method, device and storage medium for playing voice information.
  • a method for playing voice information including:
  • the playback progress bar including a playback progress adjustment control
  • the playback progress adjustment control is used to adjust the voice information to be played at the position indicated by the user.
  • the method for playing the voice information further includes: when a trigger operation to stop playing is received during the playing of the voice information, recording the stop point of the voice information;
  • the trigger operation to stop playback includes one of the following: a trigger operation to pause playback and a trigger operation to exit playback.
  • the method when the playback progress bar is generated, the method further includes: generating a double-speed playback adjustment control, the double-speed playback adjustment control being used to set the playback speed of the voice information.
  • the method for playing the voice information further includes: parsing the semantic content of the voice information, and displaying expressions matching the keywords extracted from the semantic content.
  • the displaying the emoticons matching the keywords extracted from the semantic content includes:
  • the emoticons matching the keywords extracted from the semantics in the instant messaging interface are displayed for a preset period of time or displayed as instant messaging information.
  • this application also provides a voice information playback device, including:
  • a generating module configured to generate a playback progress bar corresponding to the voice information according to the duration of the received voice information, the playback progress bar including a playback progress adjustment control;
  • the playback progress adjustment control is used to adjust the voice information to be played at the position indicated by the user.
  • the device for playing the voice information further includes a playing module, which is used to record the stop playing point of the voice information when a trigger operation to stop playing is received during the playing of the voice information; When the trigger operation to continue playing is reached, the voice information continues to be played from the recorded stop playing point.
  • a playing module which is used to record the stop playing point of the voice information when a trigger operation to stop playing is received during the playing of the voice information; When the trigger operation to continue playing is reached, the voice information continues to be played from the recorded stop playing point.
  • the device for playing the voice information further includes a display module for parsing the semantic content of the voice information and displaying expressions matching the keywords extracted from the semantic content.
  • this application also provides a voice information playback device, including: a memory and a processor, wherein:
  • the memory is used to store one or more computer instructions, and when the one or more computer instructions are executed by the processor, the foregoing method for playing voice information is realized.
  • this application also provides a computer storage medium, the storage medium is used to store a computer program, and the computer program is used to enable the computer to implement the above-mentioned method for playing voice information when executed.
  • the instant messaging application software when the instant messaging application software receives the voice information, it generates a corresponding playback progress bar based on the duration of the voice information, and the playback progress bar includes a playback progress adjustment control.
  • the playback progress of the voice information can be obtained based on the playback progress bar, and the playback position of the voice information can be adjusted based on the playback progress adjustment control, that is, the user can adjust the voice information to the indicated position through the playback progress adjustment control.
  • the flexibility of the voice information playback in the instant messaging application software is increased through the solutions of the embodiments of the present application.
  • Figure 1 is a flowchart of a method for playing voice information provided by this application
  • FIG. 2 is a flowchart of another method for playing voice information provided by this application.
  • FIG. 3 is a specific schematic diagram of a method for playing voice information provided by this application.
  • Figure 4 is a schematic structural diagram of a voice information playback device provided by this application.
  • Fig. 5 is a schematic structural diagram of a terminal device corresponding to the device for playing voice information shown in Fig. 4.
  • the embodiment of the application provides a method for playing voice information, in which a playing progress bar corresponding to the voice information is generated according to the duration of the received voice information, and the playing progress bar includes a playing progress adjustment control;
  • the playback progress adjustment control is used to adjust the voice information to be played at the position indicated by the user. Therefore, in the process of playing the voice information, the user can adjust the voice information to the indicated position through the playing progress adjustment control, which increases the flexibility of the voice information playing in the instant messaging application software.
  • Fig. 1 is a flowchart of a method for playing voice information provided by this application.
  • the voice information playback method shown in Figure 1 is executed by a terminal device installed with instant messaging application software, and the instant messaging application software supports the interaction of voice information, such as mobile phones and computers installed with various instant messaging application software , Bracelets and other terminal equipment.
  • the method for playing the voice information includes:
  • S101 Generate a playback progress bar corresponding to the voice information according to the duration of the received voice information, where the playback progress bar includes a playback progress adjustment control.
  • the instant messaging application software on the terminal device After the instant messaging application software on the terminal device receives the voice information, it parses the received voice information to obtain the duration of the voice information, and generates a playback progress bar corresponding to the voice information according to the duration of the voice information.
  • the generated playback progress bar It includes playback progress adjustment controls.
  • the foregoing generating a playback progress bar corresponding to the voice information includes: the instant messaging application software generates a playback progress bar and a playback progress adjustment control corresponding to the voice information after receiving the voice information, and displays it on the instant messaging interface Always display the playback progress bar and playback progress adjustment controls; alternatively, the instant messaging application software will generate the corresponding playback progress bar and playback progress adjustment controls after receiving the voice information, but will only display it after receiving the user's designated action such as a playback trigger operation The playback progress bar and the playback progress adjustment controls, if the user's designated trigger operation is not received, the playback progress bar and the playback progress adjustment controls are in a hidden state.
  • the instant messaging application software generates the playback progress bar and the playback progress adjustment control corresponding to the voice information only after receiving the user's action to play the voice information.
  • the playback progress bar and the playback progress adjustment control are always displayed, or , It will be displayed on the instant messaging interface only after receiving the user's designated action. If the user's designated action is not received, the playback progress bar and playback progress adjustment controls are in a hidden state.
  • the playback progress bar is used to display the playback progress of the voice information, and the playback progress adjustment control is used to adjust the voice information to the user. Play at the indicated position.
  • the playback progress of the voice information can be learned during the playback of the voice information and the playback position can be adjusted as needed.
  • the user can skip fuzzy voice content and skip based on the playback progress adjustment control. Turn to key content, etc., to provide users with flexible voice playback methods to meet users' individual needs for playing voice information.
  • FIG. 2 is a flowchart of a method for playing voice information provided by still other embodiments of the application. As shown in Figure 2, the method includes:
  • the instant messaging application software After receiving the voice information, the instant messaging application software parses the received voice information to determine the duration of the voice information.
  • S202 Generate a playback progress bar corresponding to the voice information according to the duration of the voice information, where the playback progress bar includes a playback progress adjustment control.
  • the initial position of the playback progress adjustment control is located at the start position of the playback progress bar
  • the end position is located at the end of the playback progress bar
  • the playback progress adjustment control can move within the range of the start and end positions.
  • the instant messaging application software after receiving the user's instruction to click to play voice information, the instant messaging application software generates a playback progress bar corresponding to the voice information, that is, a playback progress adjustment control.
  • the instant messaging application software in addition to generating a voice playback progress bar and a playback progress adjustment control, the instant messaging application software also generates a double-speed playback adjustment control, and the double-speed playback adjustment control is used to set the playback speed of the voice information.
  • the double-speed playback adjustment control includes multiple double-speed selection gears, such as 1x, 1.5x, 2x, and so on.
  • the voice message is played at 1x speed.
  • the voice message is played at the double speed selected by the user.
  • the double-speed selection gear includes multiple touch display methods, such as displaying all double-speed gears in a list, or expressing the double-speed gear by the number of consecutive user clicks. For example, the user clicks the double-speed playback adjustment control once, and the double-speed value is based on 1 times. Increase by 0.5.
  • the display position of the double-speed playback adjustment control can be flexibly set, for example, at the end of the playback progress bar, or set at other positions according to actual needs.
  • the playback progress bar is used to display the playback progress of the voice information
  • the playback progress adjustment control is used to adjust the voice information to the user. Instruct the location to play, and if the user selects multiple speed playback, the voice information is played at the multiple speed selected by the user.
  • the trigger operation to stop playback includes the trigger operation to pause playback or the trigger operation to exit playback. For example, after exiting the current instant messaging interface or switching to other applications or clicking on the voice message again during the voice message playback, the voice message stops. Play.
  • the double-speed playback of voice information is supported, and the playback point of the voice information can be memorized.
  • the voice message can be played again. Continue to play the voice information from the playback point of the recorded voice information.
  • the method for playing voice information of the present application further includes: parsing the semantic content of the voice information and displaying the keywords that match the keywords extracted from the semantic content. expression. For example, in the process of playing the voice information, the semantics of the played voice content is analyzed, and expressions matching the keywords extracted from the semantics are displayed.
  • the emoticons are personalized texts, graphics, etc. that match keywords in the semantic content
  • the personalized texts/graphics include statically displayed texts/graphics or dynamic texts/graphics.
  • the displaying the expressions matching the keywords extracted from the semantic content includes: displaying the dynamic expressions matching the keywords extracted from the semantics in the instant messaging interface for a preset duration or as an instant Communication information is displayed.
  • the preset duration such as a few seconds
  • the played voice information is parsed, and the keywords in the parsed voice content are matched with expressions.
  • the matched expressions are only displayed for a period of time in the instant messaging interface, such as a few seconds to increase the voice The fun of communication.
  • the aforementioned emoticons matching the keywords in the semantic content can also be displayed as instant messaging information, and displaying as instant messaging information refers to displaying the matched emoticons as information sent by the sender.
  • FIG. 3 is a specific schematic diagram of a method for playing voice information provided by an embodiment of the application.
  • a playback progress bar corresponding to the voice information is displayed.
  • the playback progress bar includes a playback progress adjustment control and a double-speed playback adjustment control. The user can play the voice at multiple speeds based on the above-mentioned controls and freely adjust Voice message progress.
  • the other party sends a voice of up to 60 seconds, and there is no focus when speaking.
  • the playback progress bar displayed under the voice message click 1.5 or click again to adjust the speed to 2 times.
  • the playback speed is accelerated, and the speech becomes less procrastinated.
  • you want to repeat which key sentence to listen to drag the progress bar to determine the key word, quickly and accurately.
  • the user can also find the playback point when the playback is exited due to an operation error through the freeze display of the playback progress bar.
  • the user clicks on a piece of voice message to play he accidentally touches and exits the voice play interface during the play, which can memorize the user's play progress.
  • the voice clicks to enter again the voice will automatically start playing from the position where it stopped last time, and the playback progress bar below the voice message will display the recording position at the time of the last exit. There is no need to play again from the starting position, which improves the efficiency of voice communication and prompts the user Interactive experience.
  • the instant messaging application software when the voice message is played, extracts keywords based on the meaning of the words in the voice, and displays dynamic emoticons corresponding to the keywords on the chat interface.
  • the dynamic emoticon package is displayed in the chat interface for a preset period of time, such as staying for 5 seconds.
  • the dynamic emoticon package is displayed as a piece of interactive information, and the displayed dynamic emoticon package supports emoticon collection. Displaying dynamic expression packs when playing voice information can personalize user information, enhance emotional communication between users, and increase the interest of voice information interaction.
  • FIG. 4 is a schematic structural diagram of a voice information playback device provided by an embodiment of the application. As shown in Figure 4, the device shown includes:
  • the generating module 11 is configured to generate a playback progress bar corresponding to the voice information according to the duration of the received voice information.
  • the playback progress bar includes a playback progress adjustment control; the playback progress adjustment control is used to adjust the voice information to Play at the location indicated by the user.
  • the device further includes a playback module 12, which is used to record the stop playback point of the voice information when a trigger operation to stop playback is received during the playback of the voice information; When the triggering operation is performed, the voice information continues to be played from the recorded stop playing point.
  • a playback module 12 which is used to record the stop playback point of the voice information when a trigger operation to stop playback is received during the playback of the voice information; When the triggering operation is performed, the voice information continues to be played from the recorded stop playing point.
  • the trigger operation to stop playback includes one of the following: a trigger operation to pause playback and a trigger operation to exit playback.
  • the generating module 11 when the generating module 11 generates the playback progress bar, it is also used to generate a double-speed playback adjustment control, and the double-speed playback adjustment control is used to set the playback speed of the voice information.
  • the device further includes a display module 13 for parsing the semantic content of the voice information and displaying expressions matching the keywords extracted from the semantic content.
  • the display module 13 displays the expressions matching the keywords extracted from the semantic content, which specifically includes: displaying the expressions matching the keywords extracted from the semantics in the instant messaging interface for a preset duration Or display it as instant messaging information.
  • the voice information playback device shown in FIG. 4 can execute the voice information playback method of the embodiments shown in FIGS. 1-3.
  • the parts not described in detail in this embodiment please refer to the relevant descriptions of the embodiments shown in FIGS. 1-3 .
  • the structure of the device for playing voice information may be implemented as a terminal device.
  • the terminal device includes: a processor 21 and a memory 22.
  • the memory 22 is used to store a program that supports the terminal device to execute the voice information playback method provided in the embodiments shown in FIGS. 1 to 3, and the processor 21 is configured to execute based on the program stored in the memory 22 The method for playing the voice information.
  • the structure of the terminal device further includes a communication interface 23 for the terminal device to communicate with other devices, such as a storage node or a communication network.
  • an embodiment of the present application provides a computer storage medium for storing computer software instructions used by the above-mentioned terminal device, which includes instructions for executing the voice information playback method in the above-mentioned method embodiments shown in FIGS. 1 to 3 program.
  • Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
  • Information includes computer-readable instructions, data structures, program modules, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technologies, CD-ROM, digital versatile disc (DVD) or other optical storage, magnetic Cassette tape, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices.
  • computer-readable media does not include transitory media, such as modulated data signals and carrier waves.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

本申请涉及一种语音信息的播放方法、装置及存储介质。其中,所述语音信息的播放方法包括:根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。基于本申请,在即时通信应用中播放语音信息时可以通过播放进度调节控件将语音信息调节到指示位置处播放,便于即时通信应用软件中语音信息的灵活播放,提升用户交互体验。

Description

语音信息的播放方法、装置及存储介质
相关申请
本申请要求2019年05月24日申请的,申请号为201910441998.1,名称为“一种语音信息的播放方法、装置及存储介质”的中国专利申请的优先权,在此将其全文引入作为参考。
技术领域
本申请涉及即时通信技术领域,尤其涉及一种语音信息的播放方法、装置及存储介质。
背景技术
随着网络技术的发展,即时通信应用软件的应用越来越广泛。即时通信应用的普及满足了人们日常社交的需求,人们可以通过即时通信应用随时随地与其他用户进行信息交互。在目前的各类即时通信应用中普遍支持语音信息的交互,即用户可以通过即时通信软件向其他用户发送语音信息,接收方接收到语音信息后点击即可播放语音信息。
相关技术中,即时通信软件的语音信息交互功能还存在着一些缺陷,给用户带来不便,影响用户体验效果。
发明内容
为了解决即时通信软件的语音信息交互功能还存在一些缺陷,给用户带来不便的问题,本申请提供了一种语音信息的播放方法、装置及存储介质。
一种语音信息的播放方法,包括:
根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;
所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。
在一实施例中,所述语音信息的播放方法还包括:在所述语音信息的播放过程中接收到停止播放的触发操作时,记录所述语音信息的停止播放点;
当接收到继续播放的触发操作时,从记录的所述停止播放点继续播放所述语音信息。
在一实施例中,所述停止播放的触发操作包括以下一种:暂停播放触发操作和退出播放触发操作。
在一实施例中,在生成所述播放进度条时,还包括:生成倍速播放调节控件,所述倍 速播放调节控件用于用于设置所述语音信息的播放倍速。
在一实施例中,所述语音信息的播放方法还包括:解析所述语音信息的的语义内容,并且显示与从所述语义内容中提取的关键词匹配的表情。
在一实施例中,所述显示与从所述语义内容中提取的关键词匹配的表情,包括:
在即时通信界面中与从所述语义中提取的关键词匹配的表情显示预设时长或者作为即时通信信息进行显示。
基于同一申请思路,本申请还提供了一种语音信息的播放装置,包括:
生成模块,用于根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;
所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。
在一实施例中,所述语音信息的播放装置还包括播放模块,用于在所述语音信息的播放过程中接收到停止播放的触发操作时,记录所述语音信息的停止播放点;当接收到继续播放的触发操作时,从记录的所述停止播放点继续播放所述语音信息。
在一实施例中,所述语音信息的播放装置还包括:显示模块,用于解析所述语音信息的语义内容,并且显示与从所述语义内容中提取的关键词匹配的表情。
基于同一申请思路,本申请还提供了一种语音信息的播放装置,包括:存储器、处理器,其中:
所述存储器用于存储一条或多条计算机指令,所述一条或多条计算机指令被所述处理器执行时实现上述的语音信息的播放方法。
基于同一申请思路,本申请还提供了一种计算机存储介质,所述存储介质用于存储计算机程序,所述计算机程序用于使计算机执行时实现上述的语音信息的播放方法。
在本申请提供的技术方案中,即时通信应用软件接收到语音信息时,基于语音信息的时长生成对应的播放进度条,在播放进度条上包括播放进度调节控件。在语音信息播放的过程中,基于播放进度条能够获知语音信息的播放进度,基于播放进度调节控件能够调节语音信息的播放位置,即用户能够通过播放进度调节控件将语音信息调节到指示位置处播放,通过本申请实施例方案增加了即时通信应用软件中语音信息播放的灵活性。
附图说明
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1为本申请提供的语音信息的播放方法的流程图;
图2为本申请提供的又一语音信息的播放方法的流程图;
图3为本申请提供的一种语音信息的播放方法的具体示意图;
图4为本申请提供的一种语音信息的播放装置的结构示意图;
图5为图4所示语音信息的播放装置对应的终端设备的结构示意图。
具体实施方式
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请的一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都属于本申请保护的范围。
为了解决在即时通信中播放语音信息时每次均只能从开始位置播放,对用户造成语音信息播放不便的技术问题。本申请实施例提供了一种语音信息的播放方法,在该方法中根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。由此在语音信息播放的过程中,用户能够通过播放进度调节控件将语音信息调节到指示位置处播放,增加了即时通信应用软件中语音信息播放的灵活性。
以下将结合具体实施例对本申请的语音信息的播放方法进行详细说明。
图1为本申请提供的语音信息的播放方法的流程图。图1所示语音信息的播放方法的执行主体为安装有即时通信应用软件的终端设备,并且所述的即时通信应用软件支持语音信息的交互,如安装有各类即时通信应用软件的手机、电脑、手环等终端设备。如图1所示,该语音信息的播放方法包括:
S101:根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件。
终端设备上的即时通信应用软件接收到语音信息之后,对接收到的语音信息进行解析得到语音信息的时长,并且根据语音信息的时长生成与语音信息对应的播放进度条,在生成的播放进度条上包括播放进度调节控件。
在一些实施例中,上述生成与语音信息对应的播放进度条包括:即时通信应用软件在接收到语音信息后即生成与语音信息对应的播放进度条及播放进度调节控件,并且在即时通信界面上始终显示播放进度条和播放进度调节控件;或者,即时通信应用软件接收到语音信息后即生成对应的播放进度条和播放进度调节控件,但在接收到用户的指定动作如播 放触发操作后才显示播放进度条和播放进度调节控件,若未接收到的用户的指定触发操作,播放进度条和播放进度调节控件处于隐藏状态。在其它一些实施例中,即时通信应用软件在接收到用户播放语音信息的动作触发后才生成与语音信息对应的播放进度条和播放进度调节控件,播放进度条和播放进度调节控件始终显示,或者,仅在接收到用户的指定动作后才在即时通信界面显示,若未接收到用户的指定动作播放进度条和播放进度调节控件处于隐藏状态。
S102:响应于用户触发的播放操作,在所述语音信息的播放过程中,所述播放进度条用于显示所述语音信息的播放进度,所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。
基于与语音信息对应的播放进度条和播放进度调节控件,在语音信息的播放过程中能够获知语音信息的播放进度并且根据需要调节播放位置,例如用户基于播放进度调节控件跳过模糊语音内容、跳转到关键内容等,为用户提供灵活的语音播放方式满足用户播放语音信息的个性化需求。
图2为本申请又一些实施例提供的语音信息的播放方法的流程图。如图2所示,该方法包括:
S201:即时通信应用软件接收到语音信息后,对接收到的语音信息进行解析,确定语音信息的时长。
S202:根据语音信息的时长生成与语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件。在一些实施例中,播放进度调节控件的初始位置位于播放进度条的起点位置,终点位置位于播放进度条的终点,并且播放进度调节控件能够在起始位置和终点位置的行程范围内移动。
在一些具体实施例中,当接收到用户点击播放语音信息的指令后,即时通信应用软件生成与该语音信息对应的播放进度条即播放进度调节控件。
S203:基于所述语音信息除了生成播放进度条之外还生成倍速播放调节控件。
在一些实施例中,即时通信应用软件除了生成语音播放进度条和播放进度调节控件之外,还生成倍速播放调节控件,所述倍速播放调节控件用于设置所述语音信息的播放倍速。
在一些实施例中,倍速播放调节控件包括多个倍速选择档位,如1倍、1.5倍、2倍等。在默认方式下语音信息以1倍速播放,当用户选择其他倍速档位时语音信息以用户所选择的倍速播放。倍速选择档位包括多种触控显示方式,如以列表方式显示所有倍速档位,或者以用户连续点击次数表示倍速档位,例如用户点击一次倍速播放调节控件,倍速值在1倍的基础上增加0.5。
在一些实施例中,倍速播放调节控件的显示位置能够灵活设置,如设置在播放进度条的尾部,或者根据实际需要设置在其他位置。
S204:响应于用户触发的播放操作,在所述语音信息的播放过程中,所述播放进度条用于显示所述语音信息的播放进度,所述播放进度调节控件用于将语音信息调节到用户指示位置处播放,并且若用户选择了多倍速播放则以用户选择的多倍速播放语音信息。
S205:在所述语音信息的播放过程中判断是否接收到停止播放的触发操作。其中,停止播放的触发操作包括暂停播放的触发操作或者是退出播放的触发操作,例如退出了当前即时通信界面或者切换到其它应用或者在语音信息播放过程中再次点击了语音信息后,语音信息停止播放。
S206:如果在语音信息的播放过程中接收到停止播放的触发操作,则记录所述语音信息的停止播放点。
S207:当接收到继续播放所述语音信息的触发操作时,从记录的所述停止播放点继续播放所述语音信息。
在本申请实施例的上述方案中,支持对语音信息的倍速播放,并且能够记忆语音信息的播放点,当在语音信息播放过程中误操作退出了语音播放当再次播放该条语音信息时,能够从记录的语音信息播放点继续对语音信息进行播放。
另外为了增加语音信息沟通过程中的趣味性,在一些实施例中,本申请的语音信息的播放方法还包括:解析语音信息的语义内容并且显示与从所述语义内容中提取的关键词匹配的表情。例如,在语音信息播放的过程中解析所播放的语音内容的语义,并显示与从语义中提取的关键词匹配的表情。需要说明的是,所述表情是与语义内容中的关键词所匹配的个性化文字、图形等,所述个性化文字/图形包括静态展示的文字/图形或动态的文字/图形。
在一些实施例中,所述显示与从语义内容中提取的关键词匹配的表情,包括:在即时通信界面中与从所述语义中提取的关键词匹配的动态表情显示预设时长或者作为即时通信信息进行显示。在一些实施例中,在对与语义内容中的关键词匹配的表情进行显示时,仅显示预设时长如几秒钟。例如,在语音播放过程中对播放的语音信息进行解析,并对解析出的语音内容中的关键词匹配表情,匹配出的表情在即时通信界面中仅显示一段时间,如几秒钟以增加语音通信过程的乐趣。
在一些实施例中,上述与语义内容中的关键词匹配的表情也能够作为即时通信信息进行显示,作为即时通信信息进行显示是指将匹配出的表情作为发送方发送的信息进行显示。
图3为本申请实施例提供的一种语音信息的播放方法的具体示意图。如图3所示,当用户点击播放语音信息时,显示与语音信息对应的播放进度条、播放进度条上包括播放进度调节控件和倍速播放调节控件,用户能够基于上述控件倍速播放语音,自由调节语音信息进度。
例如,在即时通信聊天过程中对方发送了最长60秒的语音,说话拖拖拉拉没有重点,此时通过语音信息下显示的播放进度条,点击1.5或再次点击倍速调整到2倍速播放,语音播放的速度加快,说话变得不拖沓。当想重复听哪一段关键语句时,拖动进度条确定关键词,快速准确。
在一些实施例中,用户还能够通过播放进度条的定格显示,找到由于操作失误退出播放时的播放点。当用户点击一段语音信息进行播放时,播放中不小心误触退出了语音播放界面,能够记忆用户的播放进度。当用户再次点击进入时,语音自动从上一次停止的位置开始播放,并且语音信息下方的播放进度条会显示上次退出时的记录位置,无需从开始位置再次播放,提高语音沟通效率,提示用户交互体验。
在其它一些实施例中,在播放语音信息时,即时通信应用软件根据语音中的词义,提取关键词,并在聊天界面显示与关键词对应的动态表情包。在一些实施例中,动态表情包在聊天界面中显示预设时长,如停留5秒。在其它一些实施例中,动态表情包作为一条交互信息进行显示,并且显示的动态表情包支持表情收藏。在播放语音信息时显示动态表情包能够个性化表达用户信息,增强用户间的感情交流,增加语音信息交互的趣味性。
上述描述了本申请实施例的语音信息的播放方法,以下将详细描述本申请的语音信息的播放装置。本领域技术人员可以理解,这些装置均可使用市售的硬件组件通过本方案所教导的步骤进行配置来构成。
图4为本申请实施例提供的一种语音信息的播放装置的结构示意图。如图4所示,所示装置包括:
生成模块11,用于根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。
在一些实施例中,所述装置还包括播放模块12,用于在所述语音信息的播放过程中接收到停止播放的触发操作时,记录所述语音信息的停止播放点;当接收到继续播放的触发操作时,从记录的所述停止播放点继续播放所述语音信息。
在一些实施例中,所述停止播放的触发操作包括以下一种:暂停播放触发操作和退出播放触发操作。
在一些实施例中,所述生成模块11在生成所述播放进度条时,还用于生成倍速播放调节控件,所述倍速播放调节控件用于设置所述语音信息的播放倍速。
在一些实施例中,所述装置还包括显示模块13,用于解析所述语音信息的语义内容,并且显示与从所述语义内容中提取的关键词匹配的表情。
在一些实施例中,显示模块13显示与从所述语义内容中提取的关键词匹配的表情,具体包括:在即时通信界面中与从所述语义中提取的关键词匹配的表情显示预设时长或者作为即时通信信息进行显示。
图4所示语音信息的播放装置可以执行图1-图3所示实施例的语音信息的播放方法,本实施例未详细描述的部分,参考对图1-图3所示实施例的相关说明。该技术方案的执行过程和技术效果参见图1-图3所示实施例中的描述,在此不再赘述。
以上描述了语音信息的播放装置的内部功能和结构,在一些实施例中,该语音信息的播放装置的结构可实现为一终端设备。在一些实施例中,如图5所示,该终端设备包括:处理器21和存储器22。其中,所述存储器22用于存储支持该终端设备执行上述图1-图3所示实施例中提供的语音信息播放方法的程序,所述处理器21被配置为基于存储器22中存储的程序执行所述语音信息的播放方法。在一些实施例中,所述终端设备的结构中还包括通信接口23,用于该终端设备与其他设备比如存储节点或通信网络通信。
另外,本申请实施例提供了一种计算机存储介质,用于储存上述终端设备所用的计算机软件指令,其包含用于执行上述图1-图3所示方法实施例中语音信息播放方法所涉及的程序。
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息包括计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储能够被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。
需要说明的是,在本文中,诸如“第一”和“第二”等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体 意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。
以上所述仅是本申请的具体实施方式,使本领域技术人员能够理解或实现本申请。对这些实施例的多种修改对本领域的技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本申请的精神或范围的情况下,在其它实施例中实现。因此,本申请将不会被限制于本文所示的这些实施例,而是要符合与本文所申请的原理和新颖特点相一致的最宽的范围。

Claims (11)

  1. 一种语音信息的播放方法,其特征在于,包括:
    根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;
    所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。
  2. 根据权利要求1所述的语音信息的播放方法,其特征在于,还包括:
    在所述语音信息的播放过程中接收到停止播放的触发操作时,记录所述语音信息的停止播放点;
    当接收到继续播放的触发操作时,从记录的所述停止播放点继续播放所述语音信息。
  3. 根据权利要求2所述的语音信息的播放方法,其特征在于,所述停止播放的触发操作包括以下一种:暂停播放触发操作和退出播放触发操作。
  4. 根据权利要求1至3中任一项所述的语音信息的播放方法,其特征在于,在生成所述播放进度条时,还包括:
    生成倍速播放调节控件,所述倍速播放调节控件用于设置所述语音信息的播放倍速。
  5. 根据权利要求1所述的语音信息的播放方法,其特征在于,还包括:
    解析所述语音信息的语义内容,并且显示与从所述语义内容中提取的关键词匹配的表情。
  6. 根据权利要求5所述的语音信息的播放方法,其特征在于,所述显示与从所述语义内容中提取的关键词匹配的表情,包括:
    在即时通信界面中与从所述语义中提取的关键词匹配的表情显示预设时长或者作为即时通信信息进行显示。
  7. 一种语音信息的播放装置,其特征在于,包括:
    生成模块,用于根据接收到的语音信息的时长生成与所述语音信息对应的播放进度条,所述播放进度条包括播放进度调节控件;
    所述播放进度调节控件用于将语音信息调节到用户指示位置处播放。
  8. 根据权利要求7所述的语音信息的播放装置,其特征在于,还包括播放模块,用于在所述语音信息的播放过程中接收到停止播放的触发操作时,记录所述语音信息的停止播放点;当接收到继续播放的触发操作时,从记录的所述停止播放点继续播放所述语音信息。
  9. 根据权利要求7所述的语音信息的播放装置,其特征在于,还包括:显示模块,用于解析所述语音信息的语义内容,并且显示与从所述语义内容中提取的关键词匹配的表 情。
  10. 一种语音信息的播放装置,其特征在于,包括:存储器、处理器,其中:
    所述存储器用于存储一条或多条计算机指令,所述一条或多条计算机指令被所述处理器执行时实现如权利要求1至6中任一项所述的语音信息的播放方法。
  11. 一种计算机存储介质,其特征在于,所述存储介质用于存储计算机程序,所述计算机程序用于使计算机执行时实现如权利要求1至6中任一项所述的语音信息的播放方法。
PCT/CN2019/128158 2019-05-24 2019-12-25 语音信息的播放方法、装置及存储介质 WO2020238186A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910441998.1 2019-05-24
CN201910441998.1A CN110365574A (zh) 2019-05-24 2019-05-24 一种语音信息的播放方法、装置及存储介质

Publications (1)

Publication Number Publication Date
WO2020238186A1 true WO2020238186A1 (zh) 2020-12-03

Family

ID=68215301

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/128158 WO2020238186A1 (zh) 2019-05-24 2019-12-25 语音信息的播放方法、装置及存储介质

Country Status (2)

Country Link
CN (1) CN110365574A (zh)
WO (1) WO2020238186A1 (zh)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110365574A (zh) * 2019-05-24 2019-10-22 珠海格力电器股份有限公司 一种语音信息的播放方法、装置及存储介质
CN110830654B (zh) * 2019-11-07 2020-12-08 腾讯科技(深圳)有限公司 一种播放语音消息的方法及设备
CN111026358B (zh) * 2019-12-24 2023-05-02 北京明略软件系统有限公司 一种语音消息的播放方法、播放装置及可读存储介质
CN111641551A (zh) * 2020-05-27 2020-09-08 维沃移动通信有限公司 语音播放方法、语音播放装置和电子设备
CN112511407B (zh) * 2020-10-30 2022-04-29 国网山东省电力公司泰安供电公司 自适应语音播放方法和系统
CN112511406B (zh) * 2020-10-30 2022-04-29 国网山东省电力公司泰安供电公司 即时通讯软件的语音播放方法和系统
CN113778370A (zh) * 2021-09-13 2021-12-10 周鹏程 一种语音消息播放方法、装置、电子设备及存储介质
CN117014397A (zh) * 2022-09-02 2023-11-07 腾讯科技(深圳)有限公司 基于语音消息的交互方法、装置、计算机设备和存储介质
CN115497489A (zh) * 2022-09-02 2022-12-20 深圳传音通讯有限公司 语音交互方法、智能终端及存储介质
CN115499401A (zh) * 2022-10-18 2022-12-20 康键信息技术(深圳)有限公司 一种播放语音数据的方法、系统、计算机设备及介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104918121A (zh) * 2014-03-13 2015-09-16 阿里巴巴集团控股有限公司 媒体流播放控制方法及客户端
WO2017186015A1 (zh) * 2016-04-26 2017-11-02 厦门幻世网络科技有限公司 一种为视听化数字媒体配音的方法及装置
CN109379497A (zh) * 2018-12-28 2019-02-22 努比亚技术有限公司 语音信息播放方法、移动终端及计算机可读存储介质
CN109407944A (zh) * 2018-09-29 2019-03-01 传线网络科技(上海)有限公司 多媒体资源播放调节方法及装置
CN109768913A (zh) * 2018-12-11 2019-05-17 平安科技(深圳)有限公司 信息处理方法、装置、计算机设备及存储介质
CN110365574A (zh) * 2019-05-24 2019-10-22 珠海格力电器股份有限公司 一种语音信息的播放方法、装置及存储介质

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050096909A1 (en) * 2003-10-29 2005-05-05 Raimo Bakis Systems and methods for expressive text-to-speech
CN101345790A (zh) * 2007-07-09 2009-01-14 上海基信通讯技术有限公司 在手机上对音频文件进行编辑的方法
CN102811182A (zh) * 2012-08-10 2012-12-05 上海量明科技发展有限公司 即时通信中播放音频消息的方法、客户端及系统
CN106412272A (zh) * 2016-09-23 2017-02-15 珠海格力电器股份有限公司 提示移动终端位置的方法、装置及移动终端
CN106789581A (zh) * 2016-12-23 2017-05-31 广州酷狗计算机科技有限公司 即时通讯方法、装置及系统
CN109067973A (zh) * 2018-06-27 2018-12-21 维沃移动通信有限公司 一种语音信息播放方法及移动终端

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104918121A (zh) * 2014-03-13 2015-09-16 阿里巴巴集团控股有限公司 媒体流播放控制方法及客户端
WO2017186015A1 (zh) * 2016-04-26 2017-11-02 厦门幻世网络科技有限公司 一种为视听化数字媒体配音的方法及装置
CN109407944A (zh) * 2018-09-29 2019-03-01 传线网络科技(上海)有限公司 多媒体资源播放调节方法及装置
CN109768913A (zh) * 2018-12-11 2019-05-17 平安科技(深圳)有限公司 信息处理方法、装置、计算机设备及存储介质
CN109379497A (zh) * 2018-12-28 2019-02-22 努比亚技术有限公司 语音信息播放方法、移动终端及计算机可读存储介质
CN110365574A (zh) * 2019-05-24 2019-10-22 珠海格力电器股份有限公司 一种语音信息的播放方法、装置及存储介质

Also Published As

Publication number Publication date
CN110365574A (zh) 2019-10-22

Similar Documents

Publication Publication Date Title
WO2020238186A1 (zh) 语音信息的播放方法、装置及存储介质
US11227124B2 (en) Context-aware human-to-computer dialog
US11790114B2 (en) Threshold-based assembly of automated assistant responses
US10102854B2 (en) Dialog system with automatic reactivation of speech acquiring mode
US10553209B2 (en) Systems and methods for hands-free notification summaries
KR101617665B1 (ko) 핸즈-프리 상호작용을 위한 자동 적응식 사용자 인터페이스
US10255921B2 (en) Managing dialog data providers
RU2637874C2 (ru) Генерирование диалоговых рекомендаций для чатовых информационных систем
CN107113222B (zh) 基于环境的主动聊天信息系统
US20200210649A1 (en) Transitioning between prior dialog contexts with automated assistants
US11836180B2 (en) System and management of semantic indicators during document presentations
US20140095171A1 (en) Systems and methods for providing a voice agent user interface
BR102012024861A2 (pt) uso de informações de contexto para facilitar o processamento de comandos em um assistente virtual
WO2020221105A1 (zh) 一种语音短消息的处理方法、设备及介质
KR101891496B1 (ko) 사용자간 대화 세션에 대한 능동적 모니터링 및 개입을 제공하는 대화형 ai 에이전트 시스템, 방법 및 컴퓨터 판독가능 기록 매체
KR101834624B1 (ko) 핸즈 프리 상호작용을 위한 사용자 인터페이스 자동 적응
US20100241429A1 (en) Systems And Methods For Punctuating Voicemail Transcriptions
US10431216B1 (en) Enhanced graphical user interface for voice communications
WO2018040040A1 (zh) 消息通信方法及装置
CN106899486B (zh) 一种消息显示方法及装置
CN110311858A (zh) 一种发送会话消息的方法与设备
KR101914583B1 (ko) 보안 등과 관련된 서비스를, 사용자간 대화 세션에 대한 모니터링에 기초하고 대화 세션 또는 별도의 세션을 통해, 능동적으로 제공하는 대화형 ai 에이전트 시스템, 방법 및 컴퓨터 판독가능 기록 매체
WO2023246275A1 (zh) 语音消息的播放方法、装置、终端及存储介质
US20110263228A1 (en) Pre-recorded voice responses for portable communication devices
KR102017544B1 (ko) 메신저 플랫폼에 관계없이 복수의 메신저를 이용하는 사용자간 다양한 형식의 채팅 서비스를 제공하는 대화형 ai 에이전트 시스템, 방법 및 컴퓨터 판독가능 기록 매체

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19931440

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19931440

Country of ref document: EP

Kind code of ref document: A1