WO2019205119A1 - Voice playback method and device, and client - Google Patents

Voice playback method and device, and client Download PDF

Info

Publication number
WO2019205119A1
WO2019205119A1 PCT/CN2018/085027 CN2018085027W WO2019205119A1 WO 2019205119 A1 WO2019205119 A1 WO 2019205119A1 CN 2018085027 W CN2018085027 W CN 2018085027W WO 2019205119 A1 WO2019205119 A1 WO 2019205119A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
objects
progress bar
play
playback progress
Prior art date
Application number
PCT/CN2018/085027
Other languages
French (fr)
Chinese (zh)
Inventor
陈飞
杨磊
廖彬彬
Original Assignee
海能达通信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 海能达通信股份有限公司 filed Critical 海能达通信股份有限公司
Priority to PCT/CN2018/085027 priority Critical patent/WO2019205119A1/en
Publication of WO2019205119A1 publication Critical patent/WO2019205119A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques

Definitions

  • the present application relates to the field of voice processing technologies, and in particular, to a voice playing method, apparatus, and client.
  • voice playback functions (which can be understood as: pre-storing voices, and then playing back stored voices when needed) are realized, and are widely used in communication, education, scientific research, and the like. field.
  • the voice playback function needs to determine the voice of a specified object in a certain segment of speech, so that the voice of the specified object cannot be quickly located, and the voice playback efficiency is low.
  • the embodiment of the present application provides a voice playing method, device, and client, so as to improve the playback speed of a specified object, thereby improving the efficiency of voice playback.
  • the technical solution is as follows:
  • a voice playing method includes:
  • the voice play segment of each of the voice objects is marked on the voice playback progress bar, and includes:
  • the voice play segments of each of the voice objects are marked on the voice playback progress bar in different shading patterns, and the voice object identifiers are marked on the voice play segments of the respective voice objects.
  • the at least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar to perform voice play, including:
  • the voice play segments of the respective voice objects marked on the voice playback progress bar are sequentially played in a voice play.
  • the at least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar, and before the voice play is performed, the method further includes:
  • the unselected voice play segments are displayed in a gray color.
  • the obtaining a voice packet of each voice object from the server includes:
  • the method further includes:
  • the voice, the voice object identity, the start time and the end time of each voice object are encapsulated into voice packets, and sent to the server for storage.
  • a voice playback device includes:
  • a first obtaining module configured to obtain a voice packet from a server
  • a generating module configured to generate a voice playback progress bar according to voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet;
  • a marking module configured to mark a voice playing segment of each of the voice objects on the voice playback progress bar
  • the playing module is configured to select at least one voice playing segment from the voice playing segments of each of the voice objects marked on the voice playback progress bar to perform voice playback.
  • the marking module comprises:
  • a first marking unit configured to mark a voice playing segment of each of the voice objects in different colors on the voice playback progress bar, and mark a voice object identifier on a voice playing segment of each of the voice objects;
  • a second marking unit configured to mark a voice playing segment of each of the voice objects in different shading patterns on the voice playback progress bar, and mark a voice object on a voice playing segment of each of the voice objects logo.
  • the playing module includes:
  • the first playing unit is configured to perform voice playback on the voice playing segments of each of the voice objects marked on the voice playback progress bar in sequence.
  • the marking module further comprises:
  • a display unit configured to perform a gray color display on the unselected voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar.
  • the first acquiring module includes:
  • a sending unit configured to send a voice packet request to the server
  • a receiving unit configured to receive a voice packet returned by the server in response to the voice packet request.
  • the device further comprises:
  • a second acquiring module configured to acquire voices of each of the voice objects, and record start time and end time of voices of each of the voice objects;
  • the sending module is configured to encapsulate the voice, the voice object identity, the start time and the end time of each voice object into a voice packet, and send the message to the server for storage.
  • a client comprising: a processor, a memory, and a data bus, wherein the processor and the memory communicate via the data bus;
  • the memory is configured to store a program
  • the processor configured to execute the program
  • the program when executed by the processor, implements the following method steps:
  • the voice play segments of the respective voice objects may be marked on the voice playback progress bar, and at least one voice play is selected from the voice play segments of the respective voice objects marked on the voice playback progress bar.
  • the selected voice playing end can be a voice playing segment of the specified object, and then the voice of the specified object can be directly played, and the voice of the specified object is not required to be heard from the beginning of the voice, and the voice of the specified object is improved.
  • the playback speed which in turn improves the efficiency of voice playback.
  • FIG. 1 is a flow chart of a voice playing method provided by the present application.
  • FIG. 2 is a schematic structural diagram of a voice playback progress bar provided by the present application.
  • FIG. 3 is another flow chart of a voice playing method provided by the present application.
  • FIG. 5 is a schematic diagram of a logical structure of a voice playback apparatus provided by the present application.
  • the embodiment of the present application discloses a voice playing method, which generates a voice packet of each voice object from a server, and generates a voice object identity identifier, a voice start time, and a voice end time according to voice packets of each voice object.
  • a speech playback progress bar and marking a voice play segment of each of the voice objects on the voice playback progress bar, and playing a voice corresponding to a voice playback progress bar of the voice play segment marked with each of the voice objects To achieve the playback of voice.
  • the voice playback method disclosed in the embodiment of the present application is introduced. Referring to FIG. 1, the method may include:
  • Step S11 Obtain a voice packet from the server.
  • the server is configured to store voice packets, which can reduce the occupation of the client memory and reduce the running load of the client.
  • the voice packet may include at least one voice object voice, a voice object identity, a voice start time, and a voice end time.
  • the voice object identity can be used for identifying the voice object identity.
  • Step S12 Generate a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and mark each voice object on the voice playback progress bar. Voice playback segment.
  • Generating a voice playback progress bar according to the voice object identity, the voice start time, and the voice end time in the voice packets of the voice object which can be understood as: according to the voice object identity in the voice packet of each voice object,
  • the voice start time and the voice end time generate a voice playback progress bar in the form of a time axis.
  • the voice play segments of the voice objects are marked on the voice playback progress bar, and the voice time segments of the voice objects can be displayed intuitively, so as to quickly locate the voice of the voice object of interest. Position, improve the efficiency of voice playback.
  • Step S13 Select at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
  • the voice play segments of the respective voice objects may be marked on the voice playback progress bar, and at least one voice play is selected from the voice play segments of the respective voice objects marked on the voice playback progress bar.
  • the selected voice playing end can be a voice playing segment of the specified object, and then the voice of the specified object can be directly played, and the voice of the specified object is not required to be heard from the beginning of the voice, and the voice of the specified object is improved.
  • the playback speed which in turn improves the efficiency of voice playback.
  • the voice play segment of each of the voice objects is marked on the voice playback progress bar, and specifically includes:
  • a voice play segment of each of the voice objects is marked in a different color on the voice playback progress bar, and a voice object identifier is marked on a voice play segment of each of the voice objects.
  • the voice play segment marks of different voice objects have different colors, and are used to distinguish voice play segments of different voice objects, so that the voice time segments of each voice object are displayed very intuitively.
  • the voice playback segment of the different voice objects in the voice playback progress bar has different colors, and the voice segment of each voice object is marked with a voice object identifier.
  • the voice segment of the voice object A is marked with a person A.
  • the voice play segment of the voice object B is marked with a person B
  • the voice play segment of the voice object C is marked with a person C.
  • another embodiment of the voice play segment of each of the voice objects is marked on the voice playback progress bar, and specifically includes:
  • the voice play segments of each of the voice objects are marked on the voice playback progress bar in different shading patterns, and the voice object identifiers are marked on the voice play segments of the respective voice objects.
  • the voice play segment marks of different voice objects have different shading patterns, and are used to distinguish voice play segments of different voice objects, so as to display the voice time segments of each voice object very intuitively.
  • the manner in which the voice play segments of each of the voice objects are marked on the voice playback progress bar is not limited to the above-mentioned manners marked with different colors and marked with different shading patterns, and any difference may be distinguished.
  • the marking manner of the voice playing segment of the voice object needs to be protected by the present invention.
  • the voice play segments of each of the voice objects may be marked in a combination of a color and a shading pattern. Specifically, the voice play segments of different voice objects are marked with different colors and the same shading pattern. Or, the voice segments of different voice objects are marked with the same color and different shading patterns; or, the voice segments of different voice objects are marked with different colors and different shading patterns.
  • At least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar, and the voice play is performed for introduction.
  • the voice play segments of the respective voice objects marked on the voice playback progress bar are sequentially played in a voice play.
  • the voice play segment of each of the voice objects marked on the voice playback progress bar is played in sequence, and can be understood as: a full voice play mode, that is, a voice of all voice objects is played.
  • another implementation manner of performing voice playback is performed on at least one voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar.
  • Introduction specifically can include:
  • a voice play segment of the specified voice object is selected for voice play.
  • a voice play segment of the specified voice object to perform voice play, which can be understood as: playing only the voice play segment of the specified voice object Voice, which automatically skips the voice playback segment of a non-specified voice object.
  • the user can drag the scroll bar on the voice playback progress bar to the voice play segment of the specified voice object, and the client correspondingly specifies the voice object on the voice playback progress bar marked with the voice play segment of each of the voice objects.
  • the voice corresponding to the voice playback segment is played.
  • the method may include:
  • Step S21 Obtain a voice packet from the server.
  • Step S22 Generate a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and mark each voice object on the voice playback progress bar. Voice playback segment.
  • the steps S21-S22 are the same as the steps S11-S12 in the foregoing embodiment, and the detailed process of the steps S21-S22 can be referred to the related description of the steps S11-S12, and details are not described herein again.
  • step S23 in the voice play segment of each of the voice objects marked on the voice playback progress bar, the unselected voice play segment is displayed in a gray color.
  • the unselected voice play segment is displayed in a gray color, and the time period of the voice to be played can be displayed more intuitively.
  • Step S24 Select at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
  • step S24 refer to the voice play segment of each of the voice objects marked on the voice playback progress bar in the foregoing embodiment, and select at least one voice play segment to perform related introduction of voice play. Let me repeat.
  • the voice packet of each voice object is obtained from the server, and specifically includes:
  • A1 Send a voice packet request to the server.
  • a voice packet request may be sent to the server according to time and/or voice object to request a time and/or a voice packet corresponding to the voice object.
  • A2 Receive a voice packet returned by the server in response to the voice packet request.
  • the method may include:
  • Step S31 Acquire the voices of the respective voice objects, and record the start time and the end time of the voices of the respective voice objects.
  • the client acquires the voice of each voice object, and records the start time and the end time of the voice of each voice object.
  • step S32 the voice, the voice object identity, the start time and the end time of each voice object are encapsulated into voice packets, and sent to the server for storage.
  • Step S33 Obtain a voice packet from the server.
  • Step S34 Generate a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and mark each voice object on the voice playback progress bar. Voice playback segment.
  • Step S35 Select at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
  • the steps S33-S35 are the same as the steps S11-S13 in the foregoing embodiment, and the detailed process of the steps S33-S35 can be referred to the related description of the steps S11-S13, and details are not described herein again.
  • the voice playback device described below and the voice playback method described above can be referred to each other.
  • the voice playback apparatus includes: a first acquisition module 11, a generation module 12, a marking module 13, and a playback module 14.
  • the first obtaining module 11 is configured to obtain a voice packet from a server.
  • the generating module 12 is configured to generate a voice playback progress bar according to the voice, the voice object identity, the voice start time, and the voice end time of each voice object in the voice packet.
  • the marking module 13 is configured to mark a voice playing segment of each of the voice objects on the voice playback progress bar.
  • the playing module 14 is configured to select at least one voice playing segment from the voice playing segments of the voice objects marked on the voice playback progress bar to perform voice playing.
  • the marking module 13 may include: a first marking unit or a second marking unit.
  • the first marking unit is configured to mark the voice playing segments of each of the voice objects in different colors on the voice playback progress bar, and mark the voice object identifiers on the voice playing segments of the voice objects.
  • a second marking unit configured to mark the voice playing segments of each of the voice objects in different shading patterns on the voice playback progress bar, and mark the voice object identifiers on the voice playing segments of the voice objects.
  • the playing module 14 may include: a first playing unit or a second playing unit.
  • the first playing unit is configured to perform voice playback on the voice playing segments of each of the voice objects marked on the voice playback progress bar in sequence.
  • a second playing unit configured to select a voice playing segment of the specified voice object from the voice playing segment of each of the voice objects marked on the voice playback progress bar, and perform voice playing.
  • the marking module 13 may further include:
  • a display unit configured to perform a gray color display on the unselected voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar.
  • the first obtaining module 11 may include: a sending unit and a receiving unit.
  • a sending unit configured to send a voice packet request to the server.
  • a receiving unit configured to receive a voice packet returned by the server in response to the voice packet request.
  • the voice playback device may further include: a second acquiring module and a sending module.
  • a second acquiring module configured to acquire voices of each of the voice objects, and record a start time and an end time of voices of the voice objects.
  • the sending module is configured to encapsulate the voice, the voice object identity, the start time and the end time of each voice object into a voice packet, and send the message to the server for storage.
  • a client in another embodiment, includes a processor, a memory, and a data bus, the processor and the memory being in communication over the data bus.
  • the memory is used to store a program.
  • the processor is configured to execute the program.
  • the program when executed by the processor, implements the following method steps:
  • the present application can be implemented by means of software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present application may be embodied in the form of a software product in essence or in the form of a software product, which may be stored in a storage medium such as a ROM/RAM or a disk. , an optical disk, etc., includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present application or portions of the embodiments.
  • a computer device which may be a personal computer, server, or network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The present application provides a voice playback method and device, and a client. The voice playback method comprises: obtaining a voice packet from a server; generating a voice playback progress bar according to the voice of each voice object, a voice object identifier, the voice start time, and the voice end time in the voice packet, and marking voice playback segments of each voice object on the voice playback progress bar; and selecting at least one voice playback segment from the voice playback segments of each voice object marked on the voice playback progress bar for voice playback. In the present application, by means of the manners above, the voice playback speed of the specified object is improved, and thus the voice playback efficiency is improved.

Description

一种语音播放方法、装置及客户端Voice playing method, device and client 技术领域Technical field
本申请涉及语音处理技术领域,特别涉及一种语音播放方法、装置及客户端。The present application relates to the field of voice processing technologies, and in particular, to a voice playing method, apparatus, and client.
背景技术Background technique
随着电子计算机和计算机网络的发展,语音回放功能(可以理解为:预先对语音进行存储,等需要的时候再对存储的语音进行播放)得以实现,并广泛应用在通信、教育、科研等诸多领域。With the development of electronic computers and computer networks, voice playback functions (which can be understood as: pre-storing voices, and then playing back stored voices when needed) are realized, and are widely used in communication, education, scientific research, and the like. field.
但是,目前的语音回放功能在需要确定某一段语音中指定对象的语音时,需要采用从头听到尾的方式,导致无法快速定位指定对象的语音,语音回放效率低。However, when the voice playback function needs to determine the voice of a specified object in a certain segment of speech, the method of listening to the tail from the beginning is required, so that the voice of the specified object cannot be quickly located, and the voice playback efficiency is low.
发明内容Summary of the invention
为解决上述技术问题,本申请实施例提供一种语音播放方法、装置及客户端,以达到提高指定对象的语音的回放速度,进而提高语音回放的效率的目的,技术方案如下:To solve the above technical problem, the embodiment of the present application provides a voice playing method, device, and client, so as to improve the playback speed of a specified object, thereby improving the efficiency of voice playback. The technical solution is as follows:
一种语音播放方法,包括:A voice playing method includes:
从服务器中获取语音包;Get a voice packet from the server;
根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段;Generating a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and marking the voice play of each of the voice objects on the voice playback progress bar. segment;
从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。And selecting at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
优选的,所述在所述语音回放进度条上标记出各个所述语音对象的语音播放段,包括:Preferably, the voice play segment of each of the voice objects is marked on the voice playback progress bar, and includes:
在所述语音回放进度条上以不同颜色标记出各个所述语音对象的语音播 放段,并在各个所述语音对象的语音播放段上标记出语音对象标识;And displaying a voice play segment of each of the voice objects in different colors on the voice playback progress bar, and marking a voice object identifier on a voice play segment of each of the voice objects;
或,在所述语音回放进度条上以不同底纹图案标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。Or, the voice play segments of each of the voice objects are marked on the voice playback progress bar in different shading patterns, and the voice object identifiers are marked on the voice play segments of the respective voice objects.
优选的,所述从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放,包括:Preferably, the at least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar to perform voice play, including:
按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放。The voice play segments of the respective voice objects marked on the voice playback progress bar are sequentially played in a voice play.
优选的,所述从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放之前,还包括:Preferably, the at least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar, and before the voice play is performed, the method further includes:
对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示。In the voice play segments of the respective voice objects marked on the voice playback progress bar, the unselected voice play segments are displayed in a gray color.
优选的,所述从服务器中获取各个语音对象的语音包,包括:Preferably, the obtaining a voice packet of each voice object from the server includes:
向所述服务器发送语音包请求;Sending a voice packet request to the server;
接收所述服务器响应所述语音包请求返回的语音包。Receiving a voice packet returned by the server in response to the voice packet request.
优选的,所述从服务器中获取语音包之前,还包括:Preferably, before the obtaining the voice packet from the server, the method further includes:
获取各个所述语音对象的语音,并记录各个所述语音对象的语音的开始时间和结束时间;Obtaining voices of each of the voice objects, and recording start and end times of voices of the respective voice objects;
将各个所述语音对象的语音、语音对象身份标识、语音的开始时间及结束时间封装成语音包,发送至所述服务器进行保存。The voice, the voice object identity, the start time and the end time of each voice object are encapsulated into voice packets, and sent to the server for storage.
一种语音播放装置,包括:A voice playback device includes:
第一获取模块,用于从服务器中获取语音包;a first obtaining module, configured to obtain a voice packet from a server;
生成模块,用于根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条;a generating module, configured to generate a voice playback progress bar according to voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet;
标记模块,用于在所述语音回放进度条上标记出各个所述语音对象的语音播放段;a marking module, configured to mark a voice playing segment of each of the voice objects on the voice playback progress bar;
播放模块,用于从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。The playing module is configured to select at least one voice playing segment from the voice playing segments of each of the voice objects marked on the voice playback progress bar to perform voice playback.
优选的,所述标记模块包括:Preferably, the marking module comprises:
第一标记单元,用于在所述语音回放进度条上以不同颜色标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对 象标识;a first marking unit, configured to mark a voice playing segment of each of the voice objects in different colors on the voice playback progress bar, and mark a voice object identifier on a voice playing segment of each of the voice objects;
或,第二标记单元,用于在所述语音回放进度条上以不同底纹图案标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。Or a second marking unit, configured to mark a voice playing segment of each of the voice objects in different shading patterns on the voice playback progress bar, and mark a voice object on a voice playing segment of each of the voice objects Logo.
优选的,所述播放模块包括:Preferably, the playing module includes:
第一播放单元,用于按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放。The first playing unit is configured to perform voice playback on the voice playing segments of each of the voice objects marked on the voice playback progress bar in sequence.
优选的,所述标记模块还包括:Preferably, the marking module further comprises:
显示单元,用于对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示。And a display unit, configured to perform a gray color display on the unselected voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar.
优选的,所述第一获取模块包括:Preferably, the first acquiring module includes:
发送单元,用于向所述服务器发送语音包请求;a sending unit, configured to send a voice packet request to the server;
接收单元,用于接收所述服务器响应所述语音包请求返回的语音包。And a receiving unit, configured to receive a voice packet returned by the server in response to the voice packet request.
优选的,所述装置还包括:Preferably, the device further comprises:
第二获取模块,用于获取各个所述语音对象的语音,并记录各个所述语音对象的语音的开始时间和结束时间;a second acquiring module, configured to acquire voices of each of the voice objects, and record start time and end time of voices of each of the voice objects;
发送模块,用于将各个所述语音对象的语音、语音对象身份标识、语音的开始时间及结束时间封装成语音包,发送至所述服务器进行保存。The sending module is configured to encapsulate the voice, the voice object identity, the start time and the end time of each voice object into a voice packet, and send the message to the server for storage.
一种客户端,包括:处理器、存储器和数据总线,所述处理器和所述存储器通过所述数据总线通信;A client comprising: a processor, a memory, and a data bus, wherein the processor and the memory communicate via the data bus;
所述存储器,用于存放程序;The memory is configured to store a program;
所述处理器,用于执行所述程序;The processor, configured to execute the program;
所述程序当由所述处理器执行时实现以下方法步骤:The program, when executed by the processor, implements the following method steps:
从服务器中获取语音包;Get a voice packet from the server;
根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段;Generating a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and marking the voice play of each of the voice objects on the voice playback progress bar. segment;
从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。And selecting at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
与现有技术相比,本申请的有益效果为:Compared with the prior art, the beneficial effects of the present application are:
在本申请中,可以在语音回放进度条上标记出各个语音对象的语音播放段,并从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,选取的语音播放端可以为指定对象的语音播放段,进而可以直接对指定对象的语音进行播放,不需要对语音从头听到尾来确定指定对象的语音并播放,提高了指定对象的语音的回放速度,进而提高了语音回放的效率。In the present application, the voice play segments of the respective voice objects may be marked on the voice playback progress bar, and at least one voice play is selected from the voice play segments of the respective voice objects marked on the voice playback progress bar. In the segment, the selected voice playing end can be a voice playing segment of the specified object, and then the voice of the specified object can be directly played, and the voice of the specified object is not required to be heard from the beginning of the voice, and the voice of the specified object is improved. The playback speed, which in turn improves the efficiency of voice playback.
附图说明DRAWINGS
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings used in the description of the embodiments will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present application. Other drawings may also be obtained from those of ordinary skill in the art in view of the drawings.
图1是本申请提供的语音播放方法的一种流程图;1 is a flow chart of a voice playing method provided by the present application;
图2是本申请提供的语音回放进度条的一种结构示意图;2 is a schematic structural diagram of a voice playback progress bar provided by the present application;
图3是本申请提供的语音播放方法的另一种流程图;3 is another flow chart of a voice playing method provided by the present application;
图4是本申请提供的语音播放方法的再一种流程图;4 is still another flowchart of the voice playing method provided by the present application;
图5是本申请提供的语音播放装置的一种逻辑结构示意图。FIG. 5 is a schematic diagram of a logical structure of a voice playback apparatus provided by the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application are clearly and completely described in the following with reference to the drawings in the embodiments of the present application. It is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
本申请实施例公开了一种语音播放方法,通过从服务器中获取各个语音对象的语音包,及根据各个所述语音对象的语音包中的语音对象身份标识、语音开始时间、语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段,及对标记有各个所 述语音对象的语音播放段的语音回放进度条对应的语音,进行播放,实现语音的播放。The embodiment of the present application discloses a voice playing method, which generates a voice packet of each voice object from a server, and generates a voice object identity identifier, a voice start time, and a voice end time according to voice packets of each voice object. a speech playback progress bar, and marking a voice play segment of each of the voice objects on the voice playback progress bar, and playing a voice corresponding to a voice playback progress bar of the voice play segment marked with each of the voice objects To achieve the playback of voice.
接下来对本申请实施例公开的语音播放方法进行介绍,请参见图1,可以包括:The voice playback method disclosed in the embodiment of the present application is introduced. Referring to FIG. 1, the method may include:
步骤S11、从服务器中获取语音包。Step S11: Obtain a voice packet from the server.
本实施例中,服务器用于存储语音包,可以减少对客户端内存的占用,减轻客户端的运行负担。In this embodiment, the server is configured to store voice packets, which can reduce the occupation of the client memory and reduce the running load of the client.
语音包中可以包括至少一个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间。其中,语音对象身份标识可以用于语音对象身份的识别。The voice packet may include at least one voice object voice, a voice object identity, a voice start time, and a voice end time. The voice object identity can be used for identifying the voice object identity.
步骤S12、根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段。Step S12: Generate a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and mark each voice object on the voice playback progress bar. Voice playback segment.
根据各个所述语音对象的语音包中的语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,可以理解为:根据各个所述语音对象的语音包中的语音对象身份标识、语音开始时间、语音结束时间,以时间轴的形式生成语音回放进度条。Generating a voice playback progress bar according to the voice object identity, the voice start time, and the voice end time in the voice packets of the voice object, which can be understood as: according to the voice object identity in the voice packet of each voice object, The voice start time and the voice end time generate a voice playback progress bar in the form of a time axis.
在生成语音回放进度条后,在所述语音回放进度条上标记出各个所述语音对象的语音播放段,可以直观的显示各个语音对象的语音时间段,便于快速定位所关注的语音对象的语音位置,提高语音回放的效率。After the voice playback progress bar is generated, the voice play segments of the voice objects are marked on the voice playback progress bar, and the voice time segments of the voice objects can be displayed intuitively, so as to quickly locate the voice of the voice object of interest. Position, improve the efficiency of voice playback.
步骤S13、从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。Step S13: Select at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
在本申请中,可以在语音回放进度条上标记出各个语音对象的语音播放段,并从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,选取的语音播放端可以为指定对象的语音播放段,进而可以直接对指定对象的语音进行播放,不需要对语音从头听到尾来确定指定对象的语音并播放,提高了指定对象的语音的回放速度,进而提高了语音回放的效率。In the present application, the voice play segments of the respective voice objects may be marked on the voice playback progress bar, and at least one voice play is selected from the voice play segments of the respective voice objects marked on the voice playback progress bar. In the segment, the selected voice playing end can be a voice playing segment of the specified object, and then the voice of the specified object can be directly played, and the voice of the specified object is not required to be heard from the beginning of the voice, and the voice of the specified object is improved. The playback speed, which in turn improves the efficiency of voice playback.
在本申请的另一个实施例中,对在所述语音回放进度条上标记出各个所述语音对象的语音播放段进行介绍,具体可以包括:In another embodiment of the present application, the voice play segment of each of the voice objects is marked on the voice playback progress bar, and specifically includes:
在所述语音回放进度条上以不同颜色标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。A voice play segment of each of the voice objects is marked in a different color on the voice playback progress bar, and a voice object identifier is marked on a voice play segment of each of the voice objects.
不同语音对象的语音播放段标记的颜色不同,用于区分不同的语音对象的语音播放段,便于非常直观的显示各个语音对象的语音时间段。The voice play segment marks of different voice objects have different colors, and are used to distinguish voice play segments of different voice objects, so that the voice time segments of each voice object are displayed very intuitively.
请参见图2,语音回放进度条中不同语音对象的语音播放段的颜色不同,并且各个语音对象的语音播放段上标记有语音对象标识,如,语音对象A的语音播放段上标记有人员A,语音对象B的语音播放段上标记有人员B,语音对象C的语音播放段上标记有人员C。Referring to FIG. 2, the voice playback segment of the different voice objects in the voice playback progress bar has different colors, and the voice segment of each voice object is marked with a voice object identifier. For example, the voice segment of the voice object A is marked with a person A. The voice play segment of the voice object B is marked with a person B, and the voice play segment of the voice object C is marked with a person C.
在本申请的另一个实施例中,对在所述语音回放进度条上标记出各个所述语音对象的语音播放段的另外一种实施方式进行介绍,具体可以包括:In another embodiment of the present application, another embodiment of the voice play segment of each of the voice objects is marked on the voice playback progress bar, and specifically includes:
在所述语音回放进度条上以不同底纹图案标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。The voice play segments of each of the voice objects are marked on the voice playback progress bar in different shading patterns, and the voice object identifiers are marked on the voice play segments of the respective voice objects.
不同语音对象的语音播放段标记的底纹图案不同,用于区分不同的语音对象的语音播放段,便于非常直观的显示各个语音对象的语音时间段。The voice play segment marks of different voice objects have different shading patterns, and are used to distinguish voice play segments of different voice objects, so as to display the voice time segments of each voice object very intuitively.
需要说明的是,在所述语音回放进度条上标记出各个所述语音对象的语音播放段的实施方式并不局限于上述以不同颜色标记和以不同底纹图案标记的方式,任何可以区分不同语音对象的语音播放段的标记方式均需要被本发明保护。It should be noted that, the manner in which the voice play segments of each of the voice objects are marked on the voice playback progress bar is not limited to the above-mentioned manners marked with different colors and marked with different shading patterns, and any difference may be distinguished. The marking manner of the voice playing segment of the voice object needs to be protected by the present invention.
当然,本实施例可以以颜色和底纹图案结合的方式标记出各个所述语音对象的语音播放段,具体地,不同的语音对象的语音播放段采用不同的颜色和相同的底纹图案进行标记;或,不同的语音对象的语音播放段采用相同的颜色和不同的底纹图案进行标记;或,不同的语音对象的语音播放段采用不同的颜色和不同的底纹图案进行标记。Certainly, in this embodiment, the voice play segments of each of the voice objects may be marked in a combination of a color and a shading pattern. Specifically, the voice play segments of different voice objects are marked with different colors and the same shading pattern. Or, the voice segments of different voice objects are marked with the same color and different shading patterns; or, the voice segments of different voice objects are marked with different colors and different shading patterns.
在本申请的另一个实施例中,对从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放进行介绍,具体可以包括:In another embodiment of the present application, at least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar, and the voice play is performed for introduction.
按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放。The voice play segments of the respective voice objects marked on the voice playback progress bar are sequentially played in a voice play.
按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放,可以理解为:全语音播放方式,即播放所有语音对象的语音。The voice play segment of each of the voice objects marked on the voice playback progress bar is played in sequence, and can be understood as: a full voice play mode, that is, a voice of all voice objects is played.
在本申请的另一个实施例中,对从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放的另外一种实施方式进行介绍,具体可以包括:In another embodiment of the present application, another implementation manner of performing voice playback is performed on at least one voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar. Introduction, specifically can include:
从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取指定语音对象的语音播放段,进行语音播放。From the voice play segments of each of the voice objects marked on the voice playback progress bar, a voice play segment of the specified voice object is selected for voice play.
从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取指定语音对象的语音播放段,进行语音播放,可以理解为:仅播放指定语音对象的语音播放段对应的语音,自动跳过非指定语音对象的语音播放段。Selecting, from the voice play segment of each of the voice objects marked on the voice playback progress bar, a voice play segment of the specified voice object to perform voice play, which can be understood as: playing only the voice play segment of the specified voice object Voice, which automatically skips the voice playback segment of a non-specified voice object.
仅播放指定语音对象的语音播放段对应的语音,自动跳过非指定语音对象的语音播放段,可以节省时间,进一步提高语音回放效率。Only the voice corresponding to the voice segment of the specified voice object is played, and the voice segment of the non-designated voice object is automatically skipped, which can save time and further improve the voice playback efficiency.
具体地,用户可以拖动语音回放进度条上的滚动条至指定语音对象的语音播放段,客户端则相应的对标记有各个所述语音对象的语音播放段的语音回放进度条上指定语音对象的语音播放段对应的语音,进行播放。Specifically, the user can drag the scroll bar on the voice playback progress bar to the voice play segment of the specified voice object, and the client correspondingly specifies the voice object on the voice playback progress bar marked with the voice play segment of each of the voice objects. The voice corresponding to the voice playback segment is played.
在本申请的另一个实施例中,提供了另外一种语音播放方法,请参见图3,可以包括:In another embodiment of the present application, another voice playing method is provided. Referring to FIG. 3, the method may include:
步骤S21、从服务器中获取语音包。Step S21: Obtain a voice packet from the server.
步骤S22、根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段。Step S22: Generate a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and mark each voice object on the voice playback progress bar. Voice playback segment.
步骤S21-S22与前述实施例中的步骤S11-S12相同,步骤S21-S22的详细过程可以参见步骤S11-S12的相关介绍,在此不再赘述。The steps S21-S22 are the same as the steps S11-S12 in the foregoing embodiment, and the detailed process of the steps S21-S22 can be referred to the related description of the steps S11-S12, and details are not described herein again.
步骤S23、对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示。In step S23, in the voice play segment of each of the voice objects marked on the voice playback progress bar, the unselected voice play segment is displayed in a gray color.
对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示,,可以更为直观的显示需要播放的语音的时间段。In the voice play segment of each of the voice objects marked on the voice playback progress bar, the unselected voice play segment is displayed in a gray color, and the time period of the voice to be played can be displayed more intuitively.
步骤S24、从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。Step S24: Select at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
步骤S24的详细过程可以参见前述实施例中从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放的相关介绍,在此不再赘述。For the detailed process of step S24, refer to the voice play segment of each of the voice objects marked on the voice playback progress bar in the foregoing embodiment, and select at least one voice play segment to perform related introduction of voice play. Let me repeat.
在本申请的另一个实施例中,对从服务器中获取各个语音对象的语音包进行介绍,具体可以包括:In another embodiment of the present application, the voice packet of each voice object is obtained from the server, and specifically includes:
A1、向所述服务器发送语音包请求。A1. Send a voice packet request to the server.
具体地,可以根据时间和/或语音对象向所述服务器发送语音包请求,以请求时间和/或语音对象对应的语音包。Specifically, a voice packet request may be sent to the server according to time and/or voice object to request a time and/or a voice packet corresponding to the voice object.
A2、接收所述服务器响应所述语音包请求返回的语音包。A2. Receive a voice packet returned by the server in response to the voice packet request.
基于前述各个实施例的内容,在本申请的另一个实施例中,提供了另外一种语音播放方法,请参见图4,可以包括:Based on the content of the foregoing various embodiments, in another embodiment of the present application, another voice playing method is provided. Referring to FIG. 4, the method may include:
步骤S31、获取各个语音对象的语音,并记录各个语音对象的语音的开始时间和结束时间。Step S31: Acquire the voices of the respective voice objects, and record the start time and the end time of the voices of the respective voice objects.
在语音对象发起语音时,客户端则获取各个语音对象的语音,并记录各个语音对象的语音的开始时间和结束时间。When the voice object initiates the voice, the client acquires the voice of each voice object, and records the start time and the end time of the voice of each voice object.
步骤S32、将各个所述语音对象的语音、语音对象身份标识、语音的开始时间及结束时间封装成语音包,发送至所述服务器进行保存。In step S32, the voice, the voice object identity, the start time and the end time of each voice object are encapsulated into voice packets, and sent to the server for storage.
步骤S33、从服务器中获取语音包。Step S33: Obtain a voice packet from the server.
步骤S34、根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段。Step S34: Generate a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and mark each voice object on the voice playback progress bar. Voice playback segment.
步骤S35、从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。Step S35: Select at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
步骤S33-S35与前述实施例中的步骤S11-S13相同,步骤S33-S35的详细过程可以参见步骤S11-S13的相关介绍,在此不再赘述。The steps S33-S35 are the same as the steps S11-S13 in the foregoing embodiment, and the detailed process of the steps S33-S35 can be referred to the related description of the steps S11-S13, and details are not described herein again.
接下来对本申请提供的语音播放装置进行介绍,下文描述的语音播放装置与上文描述的语音播放方法可相互对应参照。Next, the voice playback device provided by the present application will be described. The voice playback device described below and the voice playback method described above can be referred to each other.
请参见图5,其示出了本申请提供的语音播放装置的一种逻辑结构示意图,语音播放装置包括:第一获取模块11、生成模块12、标记模块13和播放模块14。Referring to FIG. 5, a schematic diagram of a logical structure of a voice playback apparatus provided by the present application is shown. The voice playback apparatus includes: a first acquisition module 11, a generation module 12, a marking module 13, and a playback module 14.
第一获取模块11,用于从服务器中获取语音包。The first obtaining module 11 is configured to obtain a voice packet from a server.
生成模块12,用于根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条。The generating module 12 is configured to generate a voice playback progress bar according to the voice, the voice object identity, the voice start time, and the voice end time of each voice object in the voice packet.
标记模块13,用于在所述语音回放进度条上标记出各个所述语音对象的语音播放段。The marking module 13 is configured to mark a voice playing segment of each of the voice objects on the voice playback progress bar.
播放模块14,用于从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。The playing module 14 is configured to select at least one voice playing segment from the voice playing segments of the voice objects marked on the voice playback progress bar to perform voice playing.
本实施例中,标记模块13可以包括:第一标记单元或第二标记单元。In this embodiment, the marking module 13 may include: a first marking unit or a second marking unit.
第一标记单元,用于在所述语音回放进度条上以不同颜色标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。The first marking unit is configured to mark the voice playing segments of each of the voice objects in different colors on the voice playback progress bar, and mark the voice object identifiers on the voice playing segments of the voice objects.
第二标记单元,用于在所述语音回放进度条上以不同底纹图案标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。And a second marking unit, configured to mark the voice playing segments of each of the voice objects in different shading patterns on the voice playback progress bar, and mark the voice object identifiers on the voice playing segments of the voice objects.
本实施例中,播放模块14可以包括:第一播放单元或第二播放单元。In this embodiment, the playing module 14 may include: a first playing unit or a second playing unit.
第一播放单元,用于按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放。The first playing unit is configured to perform voice playback on the voice playing segments of each of the voice objects marked on the voice playback progress bar in sequence.
第二播放单元,用于从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取指定语音对象的语音播放段,进行语音播放。。And a second playing unit, configured to select a voice playing segment of the specified voice object from the voice playing segment of each of the voice objects marked on the voice playback progress bar, and perform voice playing. .
本实施例中,上述标记模块13还可以包括:In this embodiment, the marking module 13 may further include:
显示单元,用于对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示。And a display unit, configured to perform a gray color display on the unselected voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar.
本实施例中,第一获取模块11可以包括:发送单元和接收单元。In this embodiment, the first obtaining module 11 may include: a sending unit and a receiving unit.
发送单元,用于向所述服务器发送语音包请求。And a sending unit, configured to send a voice packet request to the server.
接收单元,用于接收所述服务器响应所述语音包请求返回的语音包。And a receiving unit, configured to receive a voice packet returned by the server in response to the voice packet request.
本实施例中,上述语音播放装置还可以包括:第二获取模块和发送模块。In this embodiment, the voice playback device may further include: a second acquiring module and a sending module.
第二获取模块,用于获取各个所述语音对象的语音,并记录各个所述语音对象的语音的开始时间和结束时间。And a second acquiring module, configured to acquire voices of each of the voice objects, and record a start time and an end time of voices of the voice objects.
发送模块,用于将各个所述语音对象的语音、语音对象身份标识、语音的开始时间及结束时间封装成语音包,发送至所述服务器进行保存。The sending module is configured to encapsulate the voice, the voice object identity, the start time and the end time of each voice object into a voice packet, and send the message to the server for storage.
在本申请的另一个实施例中,提供了一种客户端,包括:处理器、存储器和数据总线,所述处理器和所述存储器通过所述数据总线通信。In another embodiment of the present application, a client is provided that includes a processor, a memory, and a data bus, the processor and the memory being in communication over the data bus.
所述存储器,用于存放程序。The memory is used to store a program.
所述处理器,用于执行所述程序。The processor is configured to execute the program.
所述程序当由所述处理器执行时实现以下方法步骤:The program, when executed by the processor, implements the following method steps:
从服务器中获取语音包;Get a voice packet from the server;
根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段;Generating a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and marking the voice play of each of the voice objects on the voice playback progress bar. segment;
从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。And selecting at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
需要说明的是,本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。对于装置类实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。It should be noted that each embodiment in the specification is described in a progressive manner, and each embodiment focuses on differences from other embodiments, and the same similar parts between the embodiments are referred to each other. can. For the device type embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment.
最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。Finally, it should also be noted that in this context, relational terms such as first and second are used merely to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply these entities. There is any such actual relationship or order between operations. Furthermore, the term "comprises" or "comprises" or "comprises" or any other variations thereof is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device that comprises a plurality of elements includes not only those elements but also Other elements, or elements that are inherent to such a process, method, item, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
为了描述的方便,描述以上装置时以功能分为各种单元分别描述。当然,在实施本申请时可以把各单元的功能在同一个或多个软件和/或硬件中实现。For the convenience of description, the above devices are described separately by function into various units. Of course, the functions of each unit may be implemented in the same software or software and/or hardware when implementing the present application.
通过以上的实施方式的描述可知,本领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例或者实施例的某些部分所述的方法。It will be apparent to those skilled in the art from the above description of the embodiments that the present application can be implemented by means of software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present application may be embodied in the form of a software product in essence or in the form of a software product, which may be stored in a storage medium such as a ROM/RAM or a disk. , an optical disk, etc., includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present application or portions of the embodiments.
以上对本申请所提供的一种语音播放方法、装置及客户端进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The voice playback method, device, and client provided by the present application are described in detail. The principles and implementation manners of the application are described in the specific examples. The description of the above embodiments is only used to help understand the present application. The method of application and its core idea; at the same time, for those of ordinary skill in the art, according to the idea of the present application, there will be changes in the specific implementation manner and application scope. In summary, the content of this specification should not be understood. To limit the application.

Claims (13)

  1. 一种语音播放方法,其特征在于,包括:A voice playing method, comprising:
    从服务器中获取语音包;Get a voice packet from the server;
    根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段;Generating a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and marking the voice play of each of the voice objects on the voice playback progress bar. segment;
    从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。And selecting at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
  2. 根据权利要求1所述的方法,其特征在于,所述在所述语音回放进度条上标记出各个所述语音对象的语音播放段,包括:The method according to claim 1, wherein the marking a voice play segment of each of the voice objects on the voice playback progress bar comprises:
    在所述语音回放进度条上以不同颜色标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识;Marking a voice play segment of each of the voice objects in different colors on the voice playback progress bar, and marking a voice object identifier on a voice play segment of each of the voice objects;
    或,在所述语音回放进度条上以不同底纹图案标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。Or, the voice play segments of each of the voice objects are marked on the voice playback progress bar in different shading patterns, and the voice object identifiers are marked on the voice play segments of the respective voice objects.
  3. 根据权利要求1所述的方法,其特征在于,所述从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放,包括:The method according to claim 1, wherein the at least one voice play segment is selected from the voice play segments of the voice objects marked on the voice playback progress bar to perform voice play, including:
    按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放。The voice play segments of the respective voice objects marked on the voice playback progress bar are sequentially played in a voice play.
  4. 根据权利要求3所述的方法,其特征在于,所述从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放之前,还包括:The method according to claim 3, wherein said at least one voice play segment is selected from the voice play segments of each of the voice objects marked on the voice playback progress bar, and before the voice play is performed, include:
    对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示。In the voice play segments of the respective voice objects marked on the voice playback progress bar, the unselected voice play segments are displayed in a gray color.
  5. 根据权利要求1所述的方法,其特征在于,所述从服务器中获取各个语音对象的语音包,包括:The method according to claim 1, wherein the obtaining a voice packet of each voice object from the server comprises:
    向所述服务器发送语音包请求;Sending a voice packet request to the server;
    接收所述服务器响应所述语音包请求返回的语音包。Receiving a voice packet returned by the server in response to the voice packet request.
  6. 根据权利要求1-5任意一项所述的方法,其特征在于,所述从服务器中获取语音包之前,还包括:The method according to any one of claims 1-5, wherein before the obtaining the voice packet from the server, the method further comprises:
    获取各个所述语音对象的语音,并记录各个所述语音对象的语音的开始时间和结束时间;Obtaining voices of each of the voice objects, and recording start and end times of voices of the respective voice objects;
    将各个所述语音对象的语音、语音对象身份标识、语音的开始时间及结束时间封装成语音包,发送至所述服务器进行保存。The voice, the voice object identity, the start time and the end time of each voice object are encapsulated into voice packets, and sent to the server for storage.
  7. 一种语音播放装置,其特征在于,包括:A voice playback device, comprising:
    第一获取模块,用于从服务器中获取语音包;a first obtaining module, configured to obtain a voice packet from a server;
    生成模块,用于根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条;a generating module, configured to generate a voice playback progress bar according to voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet;
    标记模块,用于在所述语音回放进度条上标记出各个所述语音对象的语音播放段;a marking module, configured to mark a voice playing segment of each of the voice objects on the voice playback progress bar;
    播放模块,用于从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。The playing module is configured to select at least one voice playing segment from the voice playing segments of each of the voice objects marked on the voice playback progress bar to perform voice playback.
  8. 根据权利要求7所述的装置,其特征在于,所述标记模块包括:The apparatus according to claim 7, wherein said marking module comprises:
    第一标记单元,用于在所述语音回放进度条上以不同颜色标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识;a first marking unit, configured to mark a voice playing segment of each of the voice objects in different colors on the voice playback progress bar, and mark a voice object identifier on a voice playing segment of each of the voice objects;
    或,第二标记单元,用于在所述语音回放进度条上以不同底纹图案标记出各个所述语音对象的语音播放段,并在各个所述语音对象的语音播放段上标记出语音对象标识。Or a second marking unit, configured to mark a voice playing segment of each of the voice objects in different shading patterns on the voice playback progress bar, and mark a voice object on a voice playing segment of each of the voice objects Logo.
  9. 根据权利要求7所述的装置,其特征在于,所述播放模块包括:The device according to claim 7, wherein the playing module comprises:
    第一播放单元,用于按照顺序对所述语音回放进度条上标记出的各个所述语音对象的语音播放段,进行语音播放。The first playing unit is configured to perform voice playback on the voice playing segments of each of the voice objects marked on the voice playback progress bar in sequence.
  10. 根据权利要求9所述的装置,其特征在于,所述标记模块还包括:The device according to claim 9, wherein the marking module further comprises:
    显示单元,用于对所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,未被选取的语音播放段,进行灰暗颜色显示。And a display unit, configured to perform a gray color display on the unselected voice play segment of the voice play segments of each of the voice objects marked on the voice playback progress bar.
  11. 根据权利要求7所述的装置,其特征在于,所述第一获取模块包括:The device according to claim 7, wherein the first obtaining module comprises:
    发送单元,用于向所述服务器发送语音包请求;a sending unit, configured to send a voice packet request to the server;
    接收单元,用于接收所述服务器响应所述语音包请求返回的语音包。And a receiving unit, configured to receive a voice packet returned by the server in response to the voice packet request.
  12. 根据权利要求7-11任意一项所述的装置,其特征在于,所述装置还包括:The device of any of claims 7-11, wherein the device further comprises:
    第二获取模块,用于获取各个所述语音对象的语音,并记录各个所述语音对象的语音的开始时间和结束时间;a second acquiring module, configured to acquire voices of each of the voice objects, and record start time and end time of voices of each of the voice objects;
    发送模块,用于将各个所述语音对象的语音、语音对象身份标识、语音的开始时间及结束时间封装成语音包,发送至所述服务器进行保存。The sending module is configured to encapsulate the voice, the voice object identity, the start time and the end time of each voice object into a voice packet, and send the message to the server for storage.
  13. 一种客户端,其特征在于,包括:处理器、存储器和数据总线,所述处理器和所述存储器通过所述数据总线通信;A client, comprising: a processor, a memory, and a data bus, wherein the processor and the memory communicate via the data bus;
    所述存储器,用于存放程序;The memory is configured to store a program;
    所述处理器,用于执行所述程序;The processor, configured to execute the program;
    所述程序当由所述处理器执行时实现以下方法步骤:The program, when executed by the processor, implements the following method steps:
    从服务器中获取语音包;Get a voice packet from the server;
    根据所述语音包中各个语音对象的语音、语音对象身份标识、语音开始时间及语音结束时间,生成语音回放进度条,并在所述语音回放进度条上标记出各个所述语音对象的语音播放段;Generating a voice playback progress bar according to the voice, voice object identity, voice start time, and voice end time of each voice object in the voice packet, and marking the voice play of each of the voice objects on the voice playback progress bar. segment;
    从所述语音回放进度条上标记出的各个所述语音对象的语音播放段中,选取至少一个语音播放段,进行语音播放。And selecting at least one voice play segment from the voice play segments of each of the voice objects marked on the voice playback progress bar to perform voice play.
PCT/CN2018/085027 2018-04-28 2018-04-28 Voice playback method and device, and client WO2019205119A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/085027 WO2019205119A1 (en) 2018-04-28 2018-04-28 Voice playback method and device, and client

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/085027 WO2019205119A1 (en) 2018-04-28 2018-04-28 Voice playback method and device, and client

Publications (1)

Publication Number Publication Date
WO2019205119A1 true WO2019205119A1 (en) 2019-10-31

Family

ID=68293433

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/085027 WO2019205119A1 (en) 2018-04-28 2018-04-28 Voice playback method and device, and client

Country Status (1)

Country Link
WO (1) WO2019205119A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9147393B1 (en) * 2013-02-15 2015-09-29 Boris Fridman-Mintz Syllable based speech processing method
CN105895077A (en) * 2015-11-15 2016-08-24 乐视移动智能信息技术(北京)有限公司 Recording editing method and recording device
CN106128460A (en) * 2016-08-04 2016-11-16 周奇 A kind of record labels method and device
CN106601252A (en) * 2016-10-28 2017-04-26 努比亚技术有限公司 Voice identification device and method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9147393B1 (en) * 2013-02-15 2015-09-29 Boris Fridman-Mintz Syllable based speech processing method
CN105895077A (en) * 2015-11-15 2016-08-24 乐视移动智能信息技术(北京)有限公司 Recording editing method and recording device
CN106128460A (en) * 2016-08-04 2016-11-16 周奇 A kind of record labels method and device
CN106601252A (en) * 2016-10-28 2017-04-26 努比亚技术有限公司 Voice identification device and method

Similar Documents

Publication Publication Date Title
US20210006404A1 (en) Systems and methods for accessing and controlling media stored remotely
KR102444777B1 (en) Routing messages by message parameter
CN104169913B (en) A kind of picture display method and device, terminal device
US20140344286A1 (en) Method and apparatus for displaying webcast roomss
US20170264973A1 (en) Video playing method and electronic device
TW201520794A (en) Data migration method and device
US20180309705A1 (en) Chat videos
TW201916678A (en) Method and apparatus for displaying conference information
JP2017519406A (en) Network video playback method and apparatus
WO2019227429A1 (en) Method, device, apparatus, terminal, server for generating multimedia content
WO2023116122A1 (en) Subtitle generation method, electronic device, and computer-readable storage medium
CN114638232A (en) Method and device for converting text into video, electronic equipment and storage medium
US20200304627A1 (en) Incoming Voice Calling Method and Terminal
WO2016161922A1 (en) Video file processing method and device
US20190109882A1 (en) System and Method for Assembling and Playing a Composite Audiovisual Program Using Single-Action Content Selection Gestures and Content Stream Generation
WO2019205119A1 (en) Voice playback method and device, and client
CN116309964A (en) Video generation method, device, equipment and storage medium
WO2016131264A1 (en) Method and device for constructing contact information
CN108241711A (en) Song recognition method and device
WO2015180305A1 (en) Content acquisition method, mobile terminal and computer storage medium
CN110493456A (en) A kind of animation playing method, device, terminal device and server
WO2019114133A1 (en) Method and apparatus for altering color of editing page content, terminal, and storage medium
JP2006344026A (en) Client terminal, management server, used program and management program
US20240073269A1 (en) Non-fungible tokens as souvenirs of multimedia communication sessions
CN113992866B (en) Video production method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18915840

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 12.03.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18915840

Country of ref document: EP

Kind code of ref document: A1