CN104038827B - Method and apparatus for multimedia playback - Google Patents

Method and apparatus for multimedia playback Download PDF

Info

Publication number
CN104038827B
CN104038827B CN201410250800.9A CN201410250800A CN104038827B CN 104038827 B CN104038827 B CN 104038827B CN 201410250800 A CN201410250800 A CN 201410250800A CN 104038827 B CN104038827 B CN 104038827B
Authority
CN
China
Prior art keywords
time
position
statement
data
audio data
Prior art date
Application number
CN201410250800.9A
Other languages
Chinese (zh)
Other versions
CN104038827A (en
Inventor
王斌
郑志光
纪东方
Original Assignee
小米科技有限责任公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 小米科技有限责任公司 filed Critical 小米科技有限责任公司
Priority to CN201410250800.9A priority Critical patent/CN104038827B/en
Publication of CN104038827A publication Critical patent/CN104038827A/en
Application granted granted Critical
Publication of CN104038827B publication Critical patent/CN104038827B/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/22Means responsive to presence or absence of recorded information signals
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object or an image, setting a parameter value or selecting a range
    • G06F3/04842Selection of a displayed object
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object or an image, setting a parameter value or selecting a range
    • G06F3/04847Interaction techniques to control parameter settings, e.g. interaction with sliders, dials
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B19/00Driving, starting, stopping record carriers not specifically of filamentary or web form, or of supports therefor; Control thereof; Control of operating function ; Driving both disc and head
    • G11B19/02Control of operating function, e.g. switching from recording to reproducing
    • G11B19/022Control panels
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/005Reproducing at a different information rate from the information rate of recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/105Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • G11B27/30Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on the same track as the main recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/08Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division
    • H04N7/087Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only
    • H04N7/088Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital
    • H04N7/0884Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital for the transmission of additional display-information, e.g. menu for programme or channel selection
    • H04N7/0885Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital for the transmission of additional display-information, e.g. menu for programme or channel selection for the transmission of subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording

Abstract

本公开是关于一种多媒体播放方法及装置。 The present disclosure relates to an apparatus and method for playing multimedia. 所述方法包括:获取多媒体的暂停位置之前第一预设时长的音频数据和/或字幕数据;根据所述音频数据和/或字幕数据确定完整语句的语句起始位置;当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述语句起始位置继续播放所述多媒体。 The method comprising: acquiring before paused position display first preset duration of audio data and / or subtitle data; determining a starting position of a complete statement of the statement based on the audio data and / or subtitle data; continue playing when the detected when said multimedia commands or continue to play the multimedia conditions satisfied, continue to play the multimedia according to the statement starting position. 本公开用于使得在暂停后继续播放时用户可以捕捉并理解到完整的语句。 The present disclosure allows for users to capture and understand when to complete statement continued playing after being paused.

Description

多媒体播放方法及装置 Method and apparatus for multimedia playback

技术领域 FIELD

[0001] 本公开涉及多媒体处理技术领域,尤其涉及一种多媒体播放方法及装置。 [0001] The present disclosure relates to multimedia processing technology, and particularly to a method and apparatus for multimedia playback.

背景技术 Background technique

[0002] 相关技术中,在播放视频的时候,经常会出现暂停,包括因为用户主观原因的主动暂停,也包括由于网络的原因,出现短暂的卡顿暂停。 [0002] the related art, when playing video, pause often occur, including the initiative to suspend the user because of subjective reasons, but also because of the network, a brief pause Caton. 由于暂定的时刻机动性比较大,在继续播放的时候,视频里面出现的声音往往是从一个句子的中间开始,甚至从一个字或者词的一半开始,这样不便于人们连续地理解情节。 Since the tentative time mobility is relatively large, the time to continue playing, the sound of the video which appears often start from the middle of a sentence, or even start from half a word or words, so it is not easy to continuously understand the plot.

[0003] 相关技术中,一些多媒体播放软件或网页,在播放过程中关闭软件或网页后,当再次开启软件播放同一视频或重新打开上次关闭的视频网页时,也会采取在暂停位置返回固定时间量的回退播放方式。 After the [0003] the related art, some multimedia playback software, or web pages, web page or close the software during playback, when the software to play the same video or reopen the last page again turned off the video will pause to take in return a fixed position the amount of time rollback playback. 例如,返回的时间值固定设置为5秒,则当中断后重新开启软件或网页,继续播放原视频时,从中断点之前的5秒开始播放,以便用户接续到上次观看的记忆。 For example, the return value is fixed at the time to 5 seconds, then bring up the rear among reopen software or web page, continue playing the original video, before the point of interruption 5 seconds to start playing, so that users continue to remember the last viewed.

[0004] 这种回退播放方式,后退的时间值是预先设定的固定值,虽然给予用户一定回想的时间,但是切入的时间点比较生硬,不够人性化。 [0004] This reverse playback mode, backward time value is a predetermined fixed value, although the user is given a certain time to recall, but more stiff cutting point in time, not human. 因为即使后退5秒,也会出现从一个句子的中间开始继续播放的情况,不利于用户理解完整的语句。 Because even back five seconds, there will be the case from the middle of a sentence beginning to continue playing, is not conducive to the user to understand the full statement.

发明内容 SUMMARY

[0005] 为克服相关技术中存在的问题,本公开实施例提供一种多媒体播放方法及装置。 [0005] In order to overcome the problems in the related art, embodiments of the present disclosure provides a method and apparatus for multimedia playback. [0006]根据本公开实施例的第一方面,提供一种多媒体播放方法,包括: [0006] According to a first aspect of the disclosed embodiment of the present embodiment, there is provided a method for playing multimedia, comprising:

[0007] 获取多媒体的暂停位置之前第一预设时长的音频数据和/或字幕数据; [0007] pause position before obtaining the first preset duration of multimedia audio data and / or subtitle data;

[0008] 根据所述音频数据和/或字幕数据确定完整语句的语句起始位置; [0008] The statement determines the starting position of a complete sentence according to the audio data and / or subtitle data;

[0009] 当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述语句起始位置继续播放所述多媒体。 [0009] When the multimedia playback condition is detected to continue to meet the multimedia instructions or continue playing the continuing playing the multimedia according to the statement starting position.

[0010] 本实施例中,通过分析音频数据和/或字幕数据,确定一句完整语句的语句起始位置,根据确定的语句起始位置继续播放视频或音频,使得在暂停后继续播放时用户可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0010] In this embodiment, by analyzing audio data and / or subtitle data, determining a starting position of a complete statement statement, continued video or audio playback is determined according to statement starting position, such that when the resume after a user can pause capture and understand the complete statement, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0011] 可选的,所述根据所述音频数据确定完整语句的语句起始位置,包括: [0011] Alternatively, the starting position of the statement is determined based on the complete statement of audio data, comprising:

[0012] 检测所述音频数据中相邻两个音频信号之间的时间间隔; [0012] the time of detecting the audio data in an audio signal between the two adjacent intervals;

[0013] 当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置。 [0013] When the length of the interval between the adjacent two time interval is greater than a first predetermined audio signal, determining whether any two audio signals between a time position adjacent to the starting position of the statement.

[0014] 可选的,所述根据所述字幕数据确定完整语句的语句起始位^,包括: [0014] Alternatively, the complete sentence is determined based on the caption sentence data ^ start bit, comprising:

[0015] 获取所述字幕数据中每条字幕的起始显示时间和/或终止显示时间; [0015] The subtitle data acquisition starting time of each subtitle display and / or termination time display;

[0016] 根据所述字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0016] The start time of the subtitle display and / or terminate the display time determining the starting position of the statement.

[0017] 可选的,所述根据所述音频数据和字幕数据确定完整语句的语句起始位置,包括: [0017] Alternatively, the audio data, and subtitle data to determine the starting position of a complete statement sentence, comprising the:

[0018]检测所述音频数据中每个音频信号的播放时间; [0018] The playback time of each audio signal detected in the audio data;

[0019]当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间; [0019] When the length of the interval between the adjacent two time interval is greater than a first predetermined audio signals, acquires the audio signal corresponding to the adjacent starting subtitle display time and / or termination time display;

[0020] 根据所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0020] The audio signal corresponding to the adjacent time subtitle player audio signals and the two adjacent display start time and / or terminating the statements display time determining the starting position.

[0021] 在可选方案中,通过根据相邻音频信号之间的时间间隔或相邻字幕之间的时间间隔确定语句起始位置,使得后续可以根据语句起始位置继续播放首频或视频,用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0021] In an alternative embodiment, the starting position is determined by the statement based on the time interval between adjacent audio signal or the time interval between adjacent subtitles, so that subsequent playback may continue in accordance with the first statement or video frequency starting position, users can continue to play catch and understand when to complete statement, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback. 另外,同时对音频数据和字幕数据进行分析,确定两个完整语句之间的间隔位置,从而更精确地获得完整语句的起始点,不仅不会影响到用户对语句的理解,也不会影响到用户观看到字幕。 Further, while the audio data and subtitle data are analyzed to determine the position of the interval between two complete sentence, starting to more accurately obtain the complete statement, only the user will not affect the understanding of the statement, it will not affect users watch the subtitles.

[0022] 可选的,所述根据所述音频数据确定完整语句的语句起始位置,包括: [0022] Alternatively, the starting position of the statement is determined based on the complete statement of audio data, comprising:

[0023] 根据人声频率对所述音频数据进行过滤,得到人声音频数据; [0023] filtering the audio data according to frequency of the human voice, the human voice audio data obtained;

[0024]检测所述人声音频数据中相邻两个人声音频信号之间的时间间隔; [0024] The time between the two vocal tone detector of the vocal audio data adjacent interval;

[0025]当相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定所述相邻两个人声音频信号间之间的任一时间位置为所述语句起始位置。 [0025] When the time between two adjacent vocal audio signal is greater than the spacing interval of the first predetermined time duration, determines a position adjacent any one time between the two between the audio signal is a human voice statement from starting position.

[0026]在可选方案中,按照人声通常的频率先对音频数据过滤,从而单纯对人声音频信号进行分析,根据人声音频信号之间的时间间隔确定语句起始位置,使得对语句起始位置的确定更加准确。 [0026] In an alternative embodiment, the first filtering on the audio data according to a normal human voice frequency, so simple to vocal audio signal is analyzed to determine the interval based on the time between the starting position of the statement of the human voice audio signal, such that the statement determine the starting position is more accurate.

[0027]可选的,当根据所述音频数据和/或字幕数据确定出至少两个完整语句的语句起始位置时,所述根据所述语句起始位置继续播放所述多媒体,包括: [0027] Alternatively, when it is determined that the start position of the at least two complete statement statements based on the audio data and / or subtitle data, the initial position according to the statement to continue playing the multimedia, comprising:

[0028]从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者 [0028] The distance from the location of the nearest pause statement starting position to continue playing the multimedia; or

[0029]当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 [0029] When the preset number of backoff statement N, continue playing the media from the previous N-th position of the pause statement starting position, wherein N is an integer greater than or equal to 2.

[0030] 在可选方案中,当确定多个语句起始位置时,可以灵活选择其中一个作为暂停后继续播放音视频的起点,使得用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0030] In an alternative embodiment, when the plurality of statements determining the starting position, wherein the flexibility to choose to continue after a pause to play back as audio and video, can be captured and understood that such complete statement user continues playing, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0031]可选的,当根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时,所述方法还包括: [0031] Alternatively, when the first predetermined data length within the audio and / or subtitle data can not be determined prior to the complete statement by statement multimedia pause position the starting position, the method further comprising:

[0032]按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中, 本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前; [0032] in chronological order from the back, access to the first predetermined duration of audio data and / or subtitle data, wherein the playback time, acquired this time the first preset duration of audio data and / or subtitle data before a playback time duration acquired first preset audio data and / or subtitle data;

[0033]从本次获得的该第一预设时长的音频数据和/或字幕数据中确定完整语句的语句起始位置; [0033] determined from a complete sentence when the first predetermined length of audio data obtained this time and / or subtitle data sentence initial position;

[0034]若从本次获得的该第一预设时长的音频数据和/或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/ 或字幕数据并确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 [0034] When the predetermined duration from the first audio data obtained this time and / or subtitle data can not determine the starting position of a complete statement statement, the first pre continue receiving forward chronological order from the back of the long audio data and / or subtitle data set and determining the starting position of a complete statement statements, statements starting position until it is determined that at least one complete statement.

[0035]可选的,所述获取多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据,包括: [0035] Optionally, before obtaining the first preset multimedia pause position within the length of the audio data and / or subtitle data, comprising:

[0036]获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置; [0036] pause position before acquiring multimedia, and long second predetermined time when the pause position spaced locations;

[0037]获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据; [0037] acquire the audio data within the first predetermined length of time and / or subtitle data before the time position;

[0038]所述根据所述音频数据和/或字幕数据确定完整语句的语句起始位置,包括: [0038] The complete sentence is determined based on the audio data and / or subtitle data sentence initial position, comprising:

[0039]根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 [0039] audio data within a first predetermined duration and / or subtitle data according to the position prior to the time, the start position determination statement statement is complete.

[0040] 在可选方案中,可以先选取到暂停位置前一段时间的时间位置,以该时间位置作为往回寻找完整语句的语句起始位置的起点,使得用户可以获得提供更充裕的进入视频情节的时间。 [0040] In an alternative embodiment, it is possible to select the pause position to the position before the time period to the time position as a starting point for a complete statement statement back start position, so that the user can obtain a more sufficient to enter the video time plot.

[0041] 根据本公开实施例的第二方面,提供一种多媒体播放装置,包括: [0041] According to a second aspect of the disclosed embodiment of the present embodiment, there is provided a multimedia playback device, comprising:

[0042] 获取模块,用于获取多媒体的暂停位置之前第一预设时长的音频数据和/或字幕数据; [0042] acquiring module, for acquiring the pause position prior to a first predetermined duration of multimedia audio data and / or subtitle data;

[0043] 分析模块,用于根据所述获取模块获取的音频数据和/或字幕数据确定完整语句的语句起始位置; [0043] The analysis module for determining a complete sentence according to the data acquisition module acquires audio and / or subtitle data sentence initial position;

[0044]播放模块,用于当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述分析模块确定的语句起始位置继续播放所述多媒体。 [0044] a playing module, for detecting when the resume playback of the multimedia instructions or continue to play the multimedia conditions satisfied, continue playing multimedia according to the statement starting position determined by the analysis module.

[0045] 所述分析模块包括: [0045] The analysis module comprises:

[0046] 检测单元,用于检测所述获取模块获取的所述音频数据中相邻两个音频信号之间的时间间隔; [0046] detection means for detecting a time between the acquisition of two audio signals of the audio data acquisition module adjacent interval;

[0047]分析确定单元,用于当所述检测单元检测到的相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置。 Between any two audio signals between the audio signal when the time between two adjacent detection means detects the first predetermined time interval is greater than the interval duration, determining the neighboring [0047] Analysis determination means for a statement of the time position of the starting position.

[0048] 所述分析模块包括: [0048] The analysis module comprises:

[0049] 获取单元,用于从所述获取模块获取的所述字幕数据中获取每条字幕的起始显示时间和/或终止显示时间; [0049] acquiring unit, configured to obtain the start time of each subtitle display and / or termination of the subtitle display time from the data acquisition module is acquired;

[0050] 分析确定单元,用于根据所述获取单元获取的所述字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0050] Analysis determination unit configured to start the acquisition unit acquires the subtitle display time and / or terminating the statements display time determining the starting position. 所述分析模块包括: The analysis module comprises:

[0051] 检测单元,用于检测所述获取模块获取的所述音频数据中每个音频信号的播放时间; [0051] detection means for detecting the play time of each audio signal acquiring the acquired audio data module;

[0052] 获取单元,用于当所述检测单元检测的相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间;分析确定单元,用于根据所述获取单元获取的所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0052] acquiring unit, the time between when the detection unit is greater than the interval between two adjacent audio signal of the first predetermined length of interval, acquires an audio signal corresponding to the adjacent starting subtitle display time and / or termination of display time; Analytical determination unit configured to start the acquisition unit acquires the playback time of the adjacent two of the audio signal and an audio signal corresponding to said adjacent subtitle display time and / or termination determining the display time statement starting position. [0053] 所述分析模块还包括: [0053] The analysis module further comprises:

[0054] 所述分析模块还包括: [0054] The analysis module further comprises:

[0055] 过滤单元,用于根据人声频率对所述获取模块获取的所述音频数据进行过滤,得到人声音频数据; [0055] The filtering unit for filtering the audio data acquiring module acquires the vocal frequency according to obtain voice audio data;

[0056] 所述检测单元,用于检测所述过滤单元过滤后的所述人声音频数据中相邻两个人声音频信号之间的时间间隔; [0056] The detection means for detecting the time interval between two filter singing voice audio signal from the audio data adjacent filtration unit;

[0057] 所述分析确定单元,用于当所述检测单元检测到的相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定所述相邻两个人声音频信号间之间的任一时间位置为所述语句起始位置。 [0057] The analysis determination unit configured to, when the time between the vocal audio signal detecting means detects the two adjacent intervals is greater than the length of said first predetermined interval, determining the neighboring two voice position at any one time between the audio signals between the starting position of the statement.

[0058] 所述播放模块,用于当所述分析模块确定出至少两个完整语句的语句起始位置时,从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 [0058] The playing module when the module is determined for at least two complete statement sentence analyzing the starting position, the distance from the position of the latest pause start position of the statement to continue playing the multimedia; or when the pre- when the set number of backoff statement is N, continue playing the media from the previous N-th position of the pause statement starting position, wherein N is an integer greater than or equal to 2.

[0059]所述获取模块,用于当所述分析模块根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时,按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中,本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前; [0059] The acquisition module, the analysis module is used when the audio data within a first predetermined duration and / or subtitle data can not be determined prior to complete statement of the multimedia pause position according to statement starting position in time obtaining a first preset length audio data and / or subtitle data in order from the back to the front, wherein this first preset duration acquired audio data and / or subtitle data playback time available on a first primary long time before the playback audio data and / or subtitle data of a predetermined;

[0060] 所述分析模块,用于从所述获取模块本次获得的该第一预设时长的音频数据和/ 或字幕数据中确定完整语句的语句起始位置;若从本次获得的该第一预设时长的音频数据和/或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/或字幕数据并确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 [0060] The analysis module for obtaining from said first predetermined duration of the audio data obtained by this module and / or subtitle data, determines the complete statement sentence initial position; if the present time obtained from the first predetermined duration of audio data and / or subtitle data can not determine the complete statement sentence initial position, the first predetermined duration continue receiving audio data and / or subtitle data in chronological order forward from the back of the and to determine the complete statement sentence initial position, until it is determined that the starting position of at least one complete statement statement.

[0061] 所述获取模块,用于获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置;获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据; [0061] The acquisition module configured to acquire multimedia before the paused position, and the second predetermined time duration and the pause position spaced locations; position prior to acquiring the first predetermined time duration and the audio data / or subtitle data;

[0062] 所述分析模块,用于根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 [0062] The analysis module for audio data within a first predetermined duration and / or subtitle data according to the position prior to the time, the start position determination statement statement is complete.

[0063] 根据本公开实施例的第三方面,提供一种多媒体播放装置,包括: [0063] According to a third aspect of the disclosed embodiment of the present embodiment, there is provided a multimedia playback device, comprising:

[0064] 处理器; [0064] processor;

[0065]用于存储处理器可执行指令的存储器; [0065] processor-executable instructions for storing a memory;

[0066]其中,所述处理器被配置为: [0066] wherein the processor is configured to:

[0067] 获取多媒体的暂停位置之前第一预设时长的音频数据和/或字幕数据; [0067] pause position before obtaining the first preset duration of multimedia audio data and / or subtitle data;

[0068] 根据所述音频数据和/或字幕数据确定完整语句的语句起始位置; [0068] determining the starting position of a complete statement sentence according to the audio data and / or subtitle data;

[0069] 当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述语句起始位置继续播放所述多媒体。 [0069] When the multimedia playback condition is detected to continue to meet the multimedia instructions or continue playing the continuing playing the multimedia according to the statement starting position. _ _

[0070] 应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本公开。 [0070] It should be understood that both the foregoing general description and the details described hereinafter are merely exemplary and explanatory and are not intended to limit the present disclosure.

附图说明_ BRIEF DESCRIPTION _

[0071] 此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本发明的实施例,并与说明书一起用于解释本发明的原理。 [0071] The accompanying drawings, which are incorporated herein and constitute a part of this specification, illustrate embodiments consistent with the present invention, and together with the description serve to explain the principles of the invention.

[0072] 图1是根据一示例性实施例示出的一种多媒体播放方法的流程图; [0072] FIG. 1 is a flow chart of a method for playing multimedia according to an illustrated exemplary embodiment;

[0073] 图2是根据一示例性实施例示出的暂停位置与语句起始位置的时间轴示意图; [0073] FIG. 2 is a schematic diagram illustrating a pause position and the starting position of the time axis according to the statement to an exemplary embodiment;

[0074] 图3是根据一示例性实施例示出的一种多媒体播放方法的流程图; [0074] FIG. 3 is a flowchart illustrating a multimedia playback method shown according to an exemplary embodiment;

[0075]图4是根据一示例性实施例f出的一种多媒体播放方法的流程图; [0075] FIG. 4 is a flowchart of a method for playing multimedia according to an of f the exemplary embodiment;

[0076]图5是根据一示例性实施例f出的一种多媒体播放方法的流程图; [0076] FIG. 5 is a flow chart of a method for playing multimedia according to an of f the exemplary embodiment;

[0077]图6是根据一示例性实施例f出的一=多媒体播放方法的流程图; [0077] FIG. 6 is a flowchart illustrating a method for playing multimedia = f of an exemplary embodiment according to an exemplary embodiment;

[0078] 图7是根据一示例性实施例^出的暂停位置与语句起始位置的时间轴示意图; [0078] ^ FIG. 7 is a schematic diagram illustrating a pause position and the starting position of the time axis according to the statement to an exemplary embodiment;

[0079] 图8是根据一示例性实施例^出的暂停位置与语句起始位置的时间轴示意图; [0079] ^ FIG. 8 is a schematic diagram illustrating a pause position and the starting position of the time axis according to the statement to an exemplary embodiment;

[0080] 图9是根据一示例性实施例示出的一种多媒体播放方法的流程图; [0080] FIG. 9 is a flowchart illustrating a multimedia playback method of an embodiment according to an exemplary embodiment;

[0081] 图1〇是根据一示例性实施例不出的一种多媒体播放方法的流程图; [0081] FIG 1〇 is a flowchart of a method for playing multimedia according to an exemplary embodiment of no;

[0082] 图丨丨是根据一示例性实施例示出的一种多媒体播放装置的框图; [0082] FIG Shushu is a block diagram showing a multimedia playing apparatus shown according to an exemplary embodiment;

[0083] 图12a是根据一示例性实施例不出的分析模块的框图; [0083] FIG 12a is a block diagram of an exemplary analysis module of the exemplary embodiment not in accordance with embodiments;

[0084] 图以匕是根据另一示例性实施例示出的分析模块的框图; [0084] In FIG dagger is a block diagram illustrating the analysis module according to another exemplary embodiment;

[0085] 图12c是根据另一示例性实施例示出的分析模块的框图; [0085] FIG 12c is a block diagram illustrating the analysis module according to another exemplary embodiment;

[0086] 图I2d是根据另一示例性实施例示出的分析模块的框图; [0086] FIG I2d is a block diagram illustrating the analysis module according to another exemplary embodiment;

[0087] 图13是根据一示例性实施例示出的一种用于多媒体播放的装置1300的框图。 [0087] FIG. 13 is a block diagram of an apparatus 1300 for playing multimedia according to an exemplary embodiment illustrated in the exemplary embodiment.

具体实施方式一_ DETAILED DESCRIPTION _ a

[0088] 这里将详细地对示例性实施例进行说明,其示例表示在附图中。 [0088] The exemplary embodiments herein be described in detail embodiments of which are illustrated in the accompanying drawings. 下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。 When the following description refers to the accompanying drawings, unless otherwise indicated, the same numbers in different drawings represent the same or similar elements. 以下示例性实施例中所描述的实施方式并不代表与本发明相一致的所有实施方式。 The following exemplary embodiments described in the exemplary embodiments do not represent consistent with all embodiments of the present invention. 相反,它们仅是与如所附权利要求书中所详述的、本发明的一些方面相一致的装置和方法的例子。 Instead, they are only in the book as detailed in the appended claims, some aspects of the present invention, examples of methods and apparatus consistent phase.

[0089] 本公开实施例中的多媒体包括视频、音频等等。 [0089] In this example a multimedia including video, audio, etc. disclosed embodiments. 多媒体播放过程中发生暂停,暂停可以是用户主动触发的,也可能是由于网络原因引起的。 Pause occurs multimedia playback, pause can be triggered by a user initiative, they may be due to network causes. 用户主动的暂停多媒体播放,可以通过操作指令获知。 User actively pause media player, it can be learned by the operation instruction. 由于网络原因的暂停多媒体播放,则可以通过检测视频缓存区中剩余未播放的数据量获知,当后续没有可供播放的视频缓存数据时,视频播放便会中止。 Since the network multimedia playback pause reason, you can not play the rest of the video buffer area by detecting a known amount of data, when there is no follow-up data for playing a video buffer, a video player will be aborted.

[0090] 本公开实施例中,在多媒体播放暂停后,通过分析多媒体中音频数据和/或字幕数据中完整语句的语句起始点,使得对多媒体的继续播放可以从一个完整语句开始,解决固定时间回退播放所导致的影响用户理解语句的问题。 [0090] Example embodiments of the present disclosure, after the multimedia playback is paused, starting by analyzing the sentence multimedia audio data and / or subtitle data in a complete sentence, making it possible to start playback from a multimedia continues complete statement, a fixed time resolved fallback play caused problems for users to understand the impact statement.

[0091] 图1是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图1所示,多媒体播放方法用于终端中,包括以下步骤。 [0091] FIG. 1 is a flow chart of a method for playing multimedia according to an illustrated exemplary embodiment, shown in FIG. 1, a method for playing a multimedia terminal, comprising the following steps.

[0092]在步骤S11中,获取多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据。 [0092] In step S11, the multimedia pause position before obtaining audio data within a first predetermined duration and / or subtitle data.

[0093]在步骤S12中,根据音频数据和/或字幕数据确定完整语句的语句起始位置。 [0093] In step S12, the starting position of the statement is determined based on a complete statement of audio data and / or subtitle data.

[0094]在步骤S13中,当检测到继续播放多媒体的指令或满足继续播放多媒体的条件时, 根据语句起始位置继续播放多媒体。 [0094] In step S13, when it is detected to continue playing multimedia instructions or conditions continue playing multimedia satisfied, continue playing multimedia according to statement starting position.

[0095]本实施例中,通过分析音频数据和/或字幕数据,确定一句完整语句的语句起始位置,根据确定的语句起始位置继续播放视频或音频,使得用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0095] In this embodiment, by analyzing audio data and / or subtitle data, determining a starting position of a complete statement statement, continued video or audio playback is determined according to statement starting position, so that the user can continue to capture and playback appreciated to complete statement, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0096]按照经验,完整的一句话通常不超过16秒,实际应用时,在步骤S11中,可以设置第一预设时长为16秒。 [0096] according to experience, a complete sentence is usually not more than 16 seconds, the actual application, in step S11, may be set to 16 seconds and the first preset. 例如,图2是根据一示例性实施例示出的暂停位置与语句起始位置的时间轴示意图,如图2所示,用户播放视频时,暂停位置为3分2〇秒处,可以获取暂停位置之前16秒,即3分04秒至3分20秒的音频数据和/或字幕数据,用以在这些数据中确定完整语句的语句起始位置。 For example, FIG. 2 is a sentence pause position and the starting position of the time axis shows the diagram according to an exemplary embodiment, as shown, when the user plays the video, the pause position of 3 minutes 2〇 2 seconds, the pause position can be acquired before 16 seconds, i.e., 3 minutes 04 seconds to 3 minutes and 20 seconds of audio data and / or subtitle data, to determine the complete statement statements in these data start position.

[0097]图3是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图3所示,可选的,在步骤S12中,根据所述音频数据确定完整语句的语句起始位置,包括以下步骤。 [0097] FIG. 3 is a flow chart of a method for playing multimedia according to an illustrated exemplary embodiment, shown in Figure 3, an alternative, in step S12, it is determined according to statement complete statement of the audio data starting position, comprising the following steps.

[0098]在步骤S31中,检测音频数据中相邻两个音频信号之间的时间间隔。 [0098] In step S31, the time interval between the two audio signals detected audio data adjacent.

[0099]在步骤S32中,当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定相邻两个音频信号之间的任一时间位置为语句起始位置。 [0099] In step S32, when the time duration of the interval between two adjacent intervals is greater than a first predetermined audio signal, determining a time position between any adjacent two of the audio signal is a statement starting position.

[0100]图4是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图4所示,可选的,在步骤S12中,根据所述字幕数据确定完整语句的语句起始位置,包括以下步骤。 [0100] FIG. 4 is a flowchart illustrating a multimedia playback method of an embodiment of an exemplary embodiment shown in Figure 4, an alternative, in step S12, a complete sentence is determined based on the caption sentence data starting position, comprising the following steps.

[0101]在步骤S41中,获取字幕数据中每条字幕的起始显示时间和/或终止显示时间; [0102] 在步骤S42中,根据字蒂的起始显不时间和/或终止显不时间确定语句起始位置。 [0101] In step S41, the subtitle data acquisition starting time of each subtitle display and / or termination time display; [0102] In step S42, according to the starting time of the word is not significant pedicle and / or termination is not significant time to determine the starting position statement. 由于两句话之间应该有一定的时间间隔,如〇. 1秒,因此,可以根据音频信号之间的间隔时长确定完整句子。 Since between two words should have a certain time interval, such as square. 1 second, therefore, can determine the full length of the interval between sentences in accordance with the audio signal. 同理,当用户观播放的音视频文件有字幕时,还可以根据相邻字幕之间的时间间隔确定完整句子。 Similarly, when users view audio and video files to play with subtitles, you can determine the full sentence based on the time interval between adjacent subtitle. 例如,获取到前一条字幕的终止显示时间为3分04秒160毫秒,后一条字幕的起始显示时间为3分〇4秒290毫秒,两条字幕之间的间隔为130毫秒,g卩0.13秒,超过了0.1秒,可以判断这两条字幕之间存在语句起始位置。 Example, acquired before the termination of a subtitle display time of 3 minutes 04 seconds 160 ms, after a subtitle display start time of 3 minutes 〇4 seconds 290 milliseconds, the interval between two subtitles 130 ms, g 0.13 Jie seconds, more than 0.1 seconds, based on the starting position of the statement is present between these two subtitles.

[0103]或者,在有些音视频文件中,一条字幕本身就对应一句完整的语句,这样可以根据本条字幕的起始显示时间或上一条字幕的终止显示时间确定语句起始位置。 [0103] Alternatively, in some video and audio files, corresponding to a caption in itself a complete statement, so that the starting section according to the subtitle display time or a termination of the subtitle display time determining the starting position of the statement.

[0104] 在可选方案中,通过根据相邻音频信号之间的时间间隔或相邻字幕之间的时间间隔确定语句起始位置,使得后续可以根据语句起始位置继续播放音频或视频,用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0104] In an alternative embodiment, the starting position is determined by the statement based on the time interval between adjacent audio signal or the time interval between adjacent subtitles, so that the subsequent audio or video continues to play according to the sentence start position, the user capture and playback can continue to understand the complete statement, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0105] 图5是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图5所示,可选的,在步骤S12中,根据音频数据和字幕数据确定完整语句的语句起始位置,包括: [0105] FIG. 5 is a flow chart of a method for playing multimedia according to an illustrated exemplary embodiment, shown in Figure 5, an alternative, in step S12, it is determined according to statement complete sentence caption data and audio data from starting position, comprising:

[0106] 在步骤S51中,检测音频数据中每个音频信号的播放时间。 [0106] In step S51, the detection time of each audio data playback of the audio signal.

[0107] 在步骤S52中,当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取相邻音频信号对应的字幕的起始显不时间和/或终止显不时间。 [0107] In step S52, when the time duration of the interval between two adjacent intervals is greater than a first predetermined audio signal, an audio signal corresponding to the adjacent acquisition start time of subtitles is not significant and / or termination time without significant .

[0108] 在步骤S52中,根据相邻两个音频信号的播放时间及相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定语句起始位置。 [0108] In step S52, according to the start time of playing the audio signal and the adjacent two adjacent audio signal corresponding to the subtitle display time and / or terminate the display time determining the starting position of the statement.

[0109] 例如,通过对音频数据分析,获得相邻两个音频信号的播放时间3分09秒和3分12 秒,这两个相邻音频信号的时间间隔未3秒,大于预先设定的〇• 1秒;而这两个相邻音频信号对应的相邻两条字幕的显示时间为3分〇8秒和3分11秒,也大于预先设定的(^丨秒。因此,可以确定至少在3分10秒处同时出现音频信号和字幕的空白,可以将3分10秒作为继续播放多媒体的起点。 [0109] For example, by analyzing the audio data to obtain audio signals of two adjacent playing time of 3 minutes 09 seconds and 3 minutes and 12 seconds, the two adjacent time interval of the audio signal is not three seconds, is greater than a predetermined • 1 second square; and the display time of the two adjacent adjacent two audio signals corresponding to the caption for the second and 3 minutes 〇8 3 minutes and 11 seconds, is also greater than a predetermined (second Shu ^ Thus, it is possible to determine at least an audio signal and subtitles appear blank while at 3 minutes 10 seconds, 3 minutes and 10 seconds may be used as starting points for further playing multimedia.

[0110] 在可选方案中,同时对音频数据和字幕数据进行分析,确定两个完整语句之间的间隔位置,从而更精确地获得完整语句的起始点,不仅不会影响到用户对语句的理解,也不会影响到用户观看到字幕。 [0110] In an alternative embodiment, while the audio data and subtitle data are analyzed to determine the position of the interval between two complete statements to more accurately obtain a complete statement of the starting point, not only does not affect the user's statement understanding, it will not affect the user to view subtitles. _ _

[0111] 图6是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图6所示,可选的,在步骤S12中,根据所述音频数据确定完整语句的语句起始位置,包括以下步骤。 [0111] FIG. 6 is a flowchart illustrating a multimedia playback method of an embodiment according to an exemplary embodiment, shown in Figure 6, an alternative, in step S12, a complete sentence is determined based on the audio data initial statement position, comprising the following steps.

[0112]在步骤S61中,根据人声频率对音频数据进行过滤,得到人声音频数据。 [0112] In step S61, according to the frequency of human voice on the audio data filtering, to obtain voice audio data.

[0113]在步骤S62中,检测人声音频数据中相邻两个人声音频信号之间的时间间隔。 [0113] In step S62, the time interval between the two vocal tone detector vocal audio data adjacent.

[0114]在步骤S63中,当相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定相邻两个人声音频信号间之间的任一时间位置为语句起始位置。 [0114] In step S63, when the time interval between adjacent two vocal audio signal interval is greater than the first preset time interval duration, at any one time between the two positions between vocal audio signal is determined for the statement adjacent starting point.

[0115]在可选方案中,当音频数据中除了人声外,还存在背景声音(音乐、环境音等)的干扰,因此,无法根据音频信号之间的时间间隔确定语句起始位置。 [0115] In an alternative embodiment, when the audio data in addition to voice, but also the presence of the background sound (music, environmental sound, etc.) of the interference, therefore, can not determine the interval of time between the sentence in accordance with the start position of the audio signal. 那么,可以按照人声通常的频率先对音频数据过滤,从而单纯对人声音频信号进行分析,根据人声音频信号之间的时间间隔确定语句起始位置,使得对语句起始位置的确定更加准确。 Then, the audio data may be first filtered by a normal human voice frequency, so simple to vocal audio signal is analyzed to determine the interval based on the time between the starting position of the statement of the human voice audio signal, such that the determination of the starting position of the statement is more accurate.

[0116]可选的,当根据所述音频数据和/或字幕数据确定出至少两个完整语句的语句起始位置时,在步骤S13中,包括:从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 [0116] Alternatively, when it is determined that the start position of the at least two complete statement statements based on the audio data and / or subtitle data, in step S13, comprising: a distance from the pause start position of the latest statement resume playing of the multimedia; or when a preset number of backoff statement N, continue to play from the position before the pause start position of the N-th sentence of the multimedia, wherein N is an integer greater than or equal to 2, .

[0117]例如,如图2所示,分析音频数据后,得到两个语句起始位置:3分10秒和3分18秒, 暂停位置为3分20秒。 [0117] For example, as shown in FIG. 2, the analysis of audio data, the start position to give two statements: 3 minutes 10 seconds and 3 minutes and 18 seconds, the pause position of 3 minutes and 20 seconds. 可以选择距离暂停位置最近的3分18秒继续播放视频,或者,如果预先设定回退语句数量为2,即回退2句话继续播放视频,则可以选择从3分10秒继续播放视频。 You can select a distance nearest pause position 3 minutes 18 seconds to continue to play the video, or, if the predetermined number of backoff statement 2, i.e. 2 backoff otherwise continue to play the video, it can choose to continue to play the video from 3 minutes and 10 seconds. [0118]在可选方案中,当确定多个语句起始位置时,可以灵活选择其中一个作为暂停后继续播放音视频的起点,使得用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0118] In an alternative embodiment, when the plurality of statements determining the starting position, wherein the flexibility to choose to continue after a pause to play back as audio and video, can be captured and understood that such complete statement user continues playing, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0119]可选的,在步骤S11和步骤S12中,当根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时,该方法还包括: [0119] Alternatively, in step S11 and step S12, when the audio data within a first predetermined duration and / or subtitle data can not determine the starting position of a complete statement statement before the pause position based multimedia, the the method further comprises:

[0120]按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中, 本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前; [0120] in chronological order from the back, access to the first predetermined duration of audio data and / or subtitle data, wherein the playback time, acquired this time the first preset duration of audio data and / or subtitle data before a playback time duration acquired first preset audio data and / or subtitle data;

[0121]从本次获得的该第一预设时长的音频数据和/或字幕数据中确定完整语句的语句起始位置; [0121] determined from a complete sentence when the first predetermined length of audio data obtained this time and / or subtitle data sentence initial position;

[0122]若从本次获得的该第一预设时长的音频数据和/或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/ 或字幕数据并确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 [0122] When the predetermined duration from the first audio data obtained this time and / or subtitle data can not determine the starting position of a complete statement statement, the first pre continue receiving forward chronological order from the back of the long audio data and / or subtitle data set and determining the starting position of a complete statement statements, statements starting position until it is determined that at least one complete statement.

[0123 ] 例如,图7是根据一;^:例性实施例示出的暂停位置与语句起始位置的时间轴不意图,如图7所示,用户播放视频时,暂停位置为3分20秒处,根据获取到的暂停位置之前16秒, 即3分04秒至3分20秒之间的音频数据和/或字幕数据,没有得到一个完整语句的语句起始位置,可以在3分04秒之前再获取16秒,S卩2分48秒至3分04秒之间的音频数据和/或字幕数据进行语句起始位置的分析,直到确定出至少一个完整语句的语句起始位置。 [0123] For example, FIG. 7 is a; ^: when the pause position illustrated embodiment the time axis statement embodiments are not intended starting position, shown in Figure 7, the user to play the video, the pause position of 3 minutes 20 seconds at 16 seconds according to the previously acquired pause position, i.e., 3 minutes 04 seconds to 3 minutes and 20 seconds between the audio data and / or subtitle data, not a complete statement statement starting position, can be 3 minutes 04 seconds 16 seconds before reacquisition, S Jie 2 minutes and 48 seconds to 3 minutes and 04 seconds between the audio data and / or subtitle data are analyzed sentence initial position, until it is determined that the starting position of at least one complete statement statement.

[0124]在可选方案中,在暂停位置之前按照时间顺序获取一段时间的数据进行语句起始位置的分析,如果没有得到一个完整语句的语句起始位置,则在此之前再获取一段时间的数据进行分析,直到确定出一个语句起始位置,作为暂停后继续播放音视频的起点,使得用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0124] In an alternative embodiment, the data acquisition period prior to the pause position chronologically analyzed sentence initial position, if not a complete statement sentence initial position, then re-acquired before this period of time analysis of the data, until it is determined that a sentence starting position, after a pause as the starting point to continue playing audio and video, making it possible to capture and understand the complete statement the user continues to play, video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0125]可选的,在步骤S11中,还可以获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置;获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据。 Within a first predetermined length of time prior to acquiring the position; [0125] Alternatively, in step S11, it may also be acquired before the multimedia pause position, and the length of the pause and the second predetermined time position spaced locations audio data and / or subtitle data.

[0126]在步骤S12中,根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 [0126] In step S12, prior to the time position in the audio data according to a first predetermined duration and / or subtitle data, to determine the complete statement sentence initial position.

[0127]例如,图8是根据一示例性实施例示出的暂停位置与语句起始位置的时间轴示意图,如图8所示,暂停位置为3分20秒处,第一预设时长为16秒,第二预设时长为5秒,获取3分15秒之前16秒,S卩2分59秒至3分15秒之间的音频数据和/或字幕数据。 [0127] For example, FIG. 8 is an exemplary embodiment in accordance with an exemplary embodiment the pause position and the starting position of the statement of the time axis shows the diagram shown in Figure 8, the pause position of 3 minutes and 20 seconds, a first predetermined length 16 second, the second preset duration is 5 seconds, 16 seconds before acquiring 3 minutes and 15 seconds, S Jie 2 minutes and 59 seconds to 3 minutes and 15 seconds between the audio data and / or subtitle data. 经分析后得到语句起始位置为3分18秒。 Statement obtained by analysis start position is 3 minutes and 18 seconds.

[0128] 在可选方案中,可以先选取到暂停位置之前一段时间例如5秒前的时间位置,以该时间位置作为往回寻找完整语句的语句起始位置的起点,使得用户可以获得提供更充裕的进入视频情节的时间。 [0128] In an alternative embodiment, it is possible to select the pause position before the first period of time, for example, at position 5 seconds ago, at which time back position as a starting point for a complete statement sentence initial position, so that the user can obtain a more plenty of time to enter the video episodes.

[0129]例如,如图8所示,经分析后得到两个语句起始位置,3分10秒和3分18秒,分别在上述时间位置(3分15秒)之前和在上述时间位置和暂停位置(3分20秒)之间,这两个语句起始位置均可以用于作为暂停后继续播放音视频的起点。 [0129] For example, as shown in FIG. 8, after the analysis start position to give two statements, 3 minutes 10 seconds and 3 minutes and 18 seconds, respectively, said time position at said time position (3 minutes and 15 seconds) prior to and at and between pause positions (3 minutes and 20 seconds), these two statements may be used in the starting position as a starting point after a pause resume playing audio and video.

[0130] 在可选方案中,可以按照时间从后往前的顺序依序获取上述时间位置(3分15秒) 之前至少一个ie秒内的音频数据和/或字幕数据,并在每获得一个16秒内的音频数据和/或字幕数据时,从获得的该16秒内的音频数据和/或字幕数据中确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 [0130] In an alternative embodiment, may be sequentially acquires the time positions in order from the back of the time (3 minutes and 15 seconds) before the audio data in at least one of ie second and / or subtitle data, and each obtaining a and / or subtitle data of the audio data for 16 seconds from the audio data obtained within 16 seconds, and / or subtitle data, determines the complete statement sentence initial position, until it is determined that at least one starting position of a complete statement statements .

[0131]在可选方案中,当以暂停位置之前一段时间的时间位置作为往回寻找完整语句的语句起始位置的起点时,确定的语句起始位置可以在该时间位置之前,也可以在该时间位置与暂停位置之间,对于暂停后继续播放音视频的起点的选择更加灵活,使得用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0131] In an alternative embodiment, the time when the pause position to a position before a period of time as a starting point for a complete statement statement back start position, the starting position can be determined by the statement before the time position, it may be the pause between the time position with the position, after a pause for continuing to play back audio and video options more flexible, so that the user can continue to capture and understand the complete statement, video or audio dialogue in a more natural when you play, the plot is more continuous, improve the user experience for video or audio playback.

[0132]下面分别以两个具体示例对本公开的多媒体播放方法进行具体说明。 [0132] The following are two specific examples of the method for playing multimedia according to the present disclosure will be specifically described.

[0133]示例一 [0133] Example a

[0134]图9是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图9所示,该方法包括以下步骤。 [0134] FIG. 9 is a flowchart illustrating a multimedia playback method of an embodiment according to an exemplary embodiment, shown in FIG. 9, the method comprising the following steps.

[0135]在步骤S91中,在视频播放过程中发生暂停,暂停位置为5分36秒。 [0135] In step S91, the pause occurs during video playback, pause at 5 minutes and 36 seconds.

[0136]在步骤S92中,读取暂停位置之前I6秒的音频数据,g卩5分20秒至5分36秒的音频数据。 [0136] In step S92, the reading position of the audio data before the pause I6 seconds, g Jie audio data to 5 minutes 20 seconds 5 minutes 36 seconds.

[0137]在步骤S93中,根据人声频率对读取到的音频数据进行过滤,得到人声音频数据。 [0137] In step S93, according to the human voice frequency filtering the read audio data to obtain voice audio data. [01¾]在步骤S94中,检测人声音频数据中相邻两个人声音频信号之间的时间间隔。 [01¾] In step S94, the time interval between the two vocal tone detector vocal audio data adjacent.

[0139]在步骤S%中,判断相邻两个人声音频信号之间的时间间隔是否大于〇.丨秒,如果是,则执行步骤S%;如果否,则该相邻人声音频信号之间不是语句起始位置。 [0139] In step S%, the time is determined between two adjacent vocal audio signal is greater than a billion Shu second interval, if so, executing step S%;. If not, the audio signal of the vocal adjacent inter are not statements starting position.

[0140]在步骤S%中,确定相邻人声音频信号之间的任一时间位置为语句起始位置,得到的语句起始位置有2个,5分29秒和5分33秒。 [0140] In step S%, determining a time position between any adjacent the vocal audio signal is a start position statements, statements are obtained starting position 2, and 5 minutes 29 seconds 5 minutes 33 seconds. 在步骤S97中,选择距离暂停位置最近的5分33秒继续播放视频。 In step S97, select the pause position from the nearest 5 minutes 33 seconds to resume the video.

[0142]示例二 [0142] Example Two

[0143]图1〇是根据一示例性实施例示出的一种多媒体播放方法的流程图,如图10所示, 该方法包括以下步骤。 [0143] FIG 1〇 is a flowchart of a method for playing multimedia according to an illustrated exemplary embodiment, shown in Figure 10, the method comprising the following steps.

[0144] 在步骤S101中,在视频播放过程中发生暂停,暂停位置为5分36秒。 [0144] In step S101, the pause occurs during video playback, pause at 5 minutes and 36 seconds.

[0145] 在步骤S102中,按照时间从后往前的顺序依序读取暂停位置之前5秒,g卩5分31秒的时间位置之前16秒的字幕数据。 [0145] In step S102, the chronological order are sequentially read from the back 5 seconds before the pause position, g Jie 5 minutes before the time position of 31 seconds 16 seconds subtitle data.

[0146]在步骤S103中,根据每次读取到的16秒的字幕数据中判断是否存在语句起始位置,如果是,执行步骤SM,如果否,返回步骤Sl〇2,读取5分15秒之前16秒的字幕数据; [0146] In step S103, according to each read subtitle data in 16 seconds is determined whether there is a sentence initial position, if yes, step SM, if not, returns to step Sl〇2, 15 minutes to read 5 16 seconds before the second subtitle data;

[0147] 在步骤S104中,得到语句起始位置有3个:5分02秒,5分09秒和5分13秒。 [0147] In step S104, the statement obtained starting Location 3: 5 minutes 02 seconds 5 minutes 09 seconds and 5 minutes 13 seconds.

[0148] 在步骤S105中,预先设定回退语句数量为2,则回退到5分31秒之前的2句,g卩5分09 秒的位置继续播放视频。 [0148] In step S105, a predetermined backoff statement number is 2, then fall back to 2 minutes and 31 seconds before 5, g Jie 5 minutes 09 seconds to resume playing the video.

[0149]在上述两个具体示例中,通过分析音频数据和/或字幕数据,可以灵活地根据确定的语句起始位置继续播放视频或音频,使得用户继续播放时可以捕捉并理解到完整的语句,视频或音频里的对话更自然,情节更连续,提高用户对视频或音频播放的体验度。 [0149] In the above-described two specific examples, by analyzing audio data and / or subtitle data, the flexibility to continue to play the video or audio statement according to the determined starting position, so that the user continues to capture and playback understood complete statement , video or audio dialogue in a more natural, more continuous plot, improve the user experience for video or audio playback.

[0150] 图11是根据一示例性实施例示出的一种多媒体播放装置的框图。 [0150] FIG. 11 is a block diagram of an embodiment of a multimedia playing apparatus shown according to an exemplary embodiment. 参照图11,该装置包括获取模块111、分析模块112和播放模块113。 Referring to FIG. 11, the apparatus includes an obtaining module 111, analysis module 112 and the playback module 113.

[0151] 获取模块111被配置为获取多媒体的暂停位置之前第一预设时长的音频数据和/ 或字幕数据。 [0151] 111 acquisition module configured to acquire before the pause position of the first predetermined duration of multimedia audio data and / or subtitle data.

[0152] 分析模块112被配置为根据所述获取模块获取的音频数据和/或字幕数据确定完整语句的语句起始位置。 [0152] Analysis module 112 is configured to determine a complete sentence according to the data acquisition module acquires audio and / or subtitle data statements starting position.

[0153] 播放模块113被配置为用于当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述分析模块确定的语句起始位置继续播放所述多媒体。 [0153] The playing module 113 is configured for detecting when the resume playback of the multimedia instructions or continue to play the multimedia conditions satisfied, continue playing multimedia according to the statement starting position determined by the analysis module.

[0154] 图12a是根据一示例性实施例示出的分析模块的框图。 [0154] FIG 12a is a block diagram illustrating the analysis module according to an exemplary embodiment. 如图12a所示,可选的,所述分析模块112包括:检测单元1121和分析确定单元1122。 As shown in FIG 12a, Alternatively, the analysis module 112 comprises: a detection and analysis unit 1121 determination unit 1122.

[0155] 检测单元1121被配置为检测所述获取模块111获取的所述音频数据中相邻两个音频信号之间的时间间隔; [0155] The detection unit 1121 is configured to detect the audio data acquired by the acquiring module 111 adjacent time interval between two audio signals;

[0156] 分析确定单元1122被配置为当所述检测单元1121检测到的相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置; Between [0156] Analysis determination unit 1122 is greater than the time interval between when the detection unit configured to detect 1121 adjacent two predetermined audio signals of the first duration, the audio signal is determined two adjacent time interval any one time position is the start position of the statement;

[0157] 图12b是根据一示例性实施例示出的分析模块的框图。 [0157] FIG 12b is a block diagram illustrating the analysis module according to an exemplary embodiment. 如图12b所示,可选的,分析模块112包括:获取单元1123和分析确定单元1122。 12b, an optional analysis module 112 includes: an obtaining unit 1123 and the analysis unit 1122 determination.

[0158] 获取单元1123被配置为从所述获取模块111获取的所述字幕数据中获取每条字幕的起始显示时间和/或终止显示时间。 [0158] acquiring unit 1123 is configured to acquire the start of each subtitle caption data from the acquisition module 111 acquires the display time and / or terminating the display time.

[0159] 分析确定单元1122被配置为根据所述获取单元1123获取的所述字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0159] Analysis determination unit 1122 is configured to initiate the subtitle according to the acquisition unit 1123 acquires display time and / or terminating the statements display time determining the starting position.

[0160] 图12c是根据一示例性实施例示出的分析模块的框图。 [0160] FIG 12c is a block diagram illustrating the analysis module according to an exemplary embodiment. 如图12c所示,可选的,分析模块112包括:检测单元1121、获取单元1123和分析确定单元1122。 Shown in FIG. 12c, an optional analysis module 112 comprises: a detection unit 1121, obtaining unit 1123 and the analysis unit 1122 determination.

[0161] 检测单元1121被配置为检测所述获取模块111获取的所述音频数据中每个音频信号的播放时间。 Play time of each audio signal into the audio data [0161] detecting unit 1121 is configured to detect the acquisition module 111 acquired.

[0162] 获取单元1123被配置为当所述检测单元1121检测的相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显示时间和/ 或终止显示时间。 [0162] configured to obtain the time between when the detecting unit 1121 detects adjacent two audio signals starting period is greater than a first predetermined duration, acquires an audio signal corresponding to the adjacent unit interval subtitle 1123 display time and / or terminate the display time.

[0163] 分析确定单元1122被配置为根据所述获取单元1123获取的所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0163] Analysis determination unit 1122 is configured to be adjacent to two audio playback time based on said signals acquired by the acquisition unit 1123 and an audio signal corresponding to the adjacent starting subtitle display time and / or terminate the display time determining a sentence initial position.

[0164] 图12d是根据一示例性实施例示出的分析模块的框图。 [0164] Figure 12d is a block diagram illustrating the analysis module according to an exemplary embodiment. 如图12d所示,可选的,所述分析模块112还包括:过滤单元1124。 FIG. 12d, Optionally, the analysis module 112 further comprises: a filter unit 1124.

[0165] 过滤单元1124被配置为根据人声频率对所述获取模块111获取的所述音频数据进行过滤,得到人声音频数据; [0165] Filter unit 1124 is configured to filter the audio data according to the vocal frequency acquisition module 111 acquires obtain voice audio data;

[0166] 所述检测单元1121被配置为检测所述过滤单元1124过滤后的所述人声音频数据中相邻两个人声音频信号之间的时间间隔; [0166] The detection unit 1121 is configured to detect the time between two filter vocal audio signal after the vocal audio data filtering unit 1124 adjacent interval;

[0167] 所述分析确定单元1122被配置为当所述检测单元1121检测到的相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定所述相邻两个人声音频信号间之间的任一时间位置为所述语句起始位置。 [0167] The analysis time between determination unit 1122 is configured to, when the detecting unit 1121 detects the voice of the adjacent two of said audio signal interval is greater than a first predetermined time interval length determining the two adjacent at any one time between the individual positions between the acoustic audio signal into the starting position statement.

[0168] 可选的,所述播放模块113被配置为当所述分析模块112确定出至少两个完整语句的语句起始位置时,从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 [0168] Optionally, the playback module 113 is configured to continue the play when the analysis module 112 determines the start position of the at least two complete statement statements, the distance from the position of the latest pause start position statements said multimedia; or when a preset number of backoff statement N, continue playing the media from the previous N-th position of the pause statement starting position, wherein N is an integer greater than or equal to 2.

[0169] 可选的,所述获取模块111被配置为当所述分析模块112根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时, 按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中,本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前。 [0169] Optionally, the acquisition module 111 is configured to the analysis module 112 when the audio data within the first predetermined duration and / or subtitle data can not be determined prior to complete statement of the multimedia pause position according to statement from when the initial position, in chronological order from the back, access to the first predetermined duration of audio data and / or subtitle data, wherein the playback time, acquired this time the first preset duration of audio data and / or subtitle data before playing time of the last acquired a first predetermined duration of the audio data and / or subtitle data.

[0170]所述分析模块112被配置为从所述获取模块111本次获得的该第一预设时长的音频数据和/或字幕数据中确定完整语句的语句起始位置;若从本次获得的该第一预设时长的音频数据和/或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/或字幕数据并确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 [0170] The analysis module 112 is configured to determine a complete sentence from the first predetermined duration of the audio data acquisition module 111. The acquired and / or subtitle data in a sentence initial position; if obtained from this the first predetermined duration of audio data and / or subtitle data can not determine the complete statement sentence initial position, then continue receiving forward chronological order from the back of a first predetermined duration of audio data and / or subtitle data and determine the starting position of a complete statement statements, statements starting position until it is determined that at least one complete statement.

[0171] 可选的,所述获取模块111被配置为获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置;获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据。 [0171] Optionally, the acquisition module 111 is configured to acquire before the pause position of the multimedia, and long when the second predetermined time interval position and the inactive position; the position before acquiring the first predetermined length of time in the audio data and / or subtitle data.

[0172] 可选的,所述分析模块112被配置为根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 [0172] Alternatively, the analysis module 112 is configured to predetermined audio data in the first duration and / or subtitle data according to the position prior to the time, the start position determination statement statement is complete.

[0173] 关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。 [0173] For the above-described embodiment apparatus, wherein each module performs a specific operation of the embodiment has been described in detail in an embodiment relating to the method, and will not be here described in detail.

[0174] 图13是根据一示例性实施例示出的一种用于多媒体播放的装置1300的框图。 [0174] FIG. 13 is a block diagram of an apparatus 1300 for playing multimedia according to an exemplary embodiment illustrated in the exemplary embodiment. 例如,装置1300可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。 For example, device 1300 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, medical equipment, fitness equipment, personal digital assistant.

[0175]参照图13,装置1300可以包括以下一个或多个组件:处理组件1302,存储器1304, 电源组件1306,多媒体组件1加8,音频组件UlO,输入/输出(I/O)的接口1312,传感器组件1314,以及通信组件1316。 [0175] Referring to Figure 13, apparatus 1300 may include one or more components: a processing component 1302, a memory 1304, a power assembly 1306, plus 1 8 multimedia components, audio components ULO, input / output (I / O) interface 1312 , the sensor assembly 1314, 1316 and a communication component.

[0176]处理组件1302通常控制装置1300的整体操作,诸如与显示,电话呼叫,数据通信, 相机操作和记录操作相关联的操作。 [0176] The processing component 1302 generally controls the overall operation of the device 1300, such as a display, a telephone call, data communication, camera operations and recording operations associated with the operation. 处理组件1302可以包括一个或多个处理器1320来执行指令,以完成上述的方法的全部或部分步骤。 Processing component 1302 may include one or more processor 1320 to execute instructions, to perform all or part of the steps of the method described above. 此外,处理组件1302可以包括一个或多个模块,便于处理组件13〇2和其他组件之间的交互。 Moreover, processing component 1302 may include one or more modules, facilitates the interaction between a component and other components 13〇2. 例如,处理部件1302可以包括多媒体模块, 以方便多媒体组件1308和处理组件1302之间的交互。 For example, processing component 1302 may include a multimedia module, multimedia components to facilitate interaction between a processing component 1308 and 1302.

[0177]存储器1304被配置为存储各种类型的数据以支持在设备1300的操作。 [0177] The memory 1304 is configured to store various types of data to support the operation of the device 1300. 这些数据的示例包括用于在装置13〇0上操作的任何应用程序或方法的指令,联系人数据,电话簿数据, 消息,图片,视频等。 These examples of the data include instructions or any application method on a device for operating 13〇0, contact data, phonebook data, messages, pictures, videos and the like. 存储器1304可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。 The memory 1304 may be implemented by any type of volatile or non-volatile storage devices, or combinations thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable programmable Read Only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disk.

[0178]电力组件13〇6为装置1300的各种组件提供电力。 [0178] Power assembly 13〇6 apparatus 1300 provides power to the various components. 电力组件1306可以包括电源管理系统,一个或多个电源,及其他与为装置1300生成、管理和分配电力相关联的组件。 The power assembly 1306 may include a power management system, one or more power sources, and the other is generating device 1300, a power management and distribution assembly is associated.

[0179]多媒体组件13〇8包括在所述装置1300和用户之间的提供一个输出接口的屏幕。 [0179] Display assembly 13〇8 comprises providing between the user device 1300 and a screen output interface. 在一些实施例中,屏幕可以包括液晶显示器(IXD)和触摸面板(TP)。 In some embodiments, the screen may include a liquid crystal display (IXD) and the touch panel (TP). 如果屏幕包括触摸面板, 屏幕可以被实现为触摸屏,以接收来自用户的输入信号。 If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. 触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。 The touch panel includes one or more touch sensors to sense touch, a gesture on the touch panel and sliding. 所述触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与所述触摸或滑动操作相关的持续时间和压力。 The touch sensor may sense not only a touch or sliding motion of the boundary, but also detecting the touch or sliding correlation operation duration and pressure. 在一些实施例中,多媒体组件13〇8包括一个前置摄像头和/或后置摄像头。 In some embodiments, the multimedia 13〇8 assembly comprising a front camera and / or the rear camera. 当设备1300处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。 When the apparatus 1300 is in operation mode, such as a shooting mode or video mode, front camera and / or the rear camera may receive an external multimedia data. 每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。 Each of the front camera and the rear camera may be a fixed optical system or a lens having a focal length and optical zoom capability.

[0180]音频组件1310被配置为输出和/或输入音频信号。 [0180] Audio component 1310 is configured to output and / or input audio signal. 例如,音频组件1310包括一个麦克风(MIC),当装置1:300处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。 For example, an audio assembly 1310 includes a microphone (the MIC), when the apparatus 1: 300 in the operation mode, such as a call mode, recording mode and voice recognition mode, the microphone configured to receive an external audio signal. 所接收的音频信号可以被进一步存储在存储器1304或经由通信组件1316发送。 The received audio signal may be transmitted further or stored in the memory 1304 via the communications component 1316. 在一些实施例中,音频组件1310还包括一个扬声器,用于输出音频信号。 In some embodiments, an audio assembly 1310 further includes a speaker for outputting an audio signal. [0181] I/O接口1:312为处理组件1加2和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。 [0181] I / O interfaces 1: 312 provides an interface between the processing unit 1 plus 2 and a peripheral interface module, said peripheral interface module may be a keyboard, a click wheel, buttons and the like. 这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。 These buttons may include, but are not limited to: home button, volume button, start button and the lock button.

[0182]传感器组件1:314包括一个或多个传感器,用于为装置1300提供各个方面的状态评估。 [0182] Sensor assembly 1: 314 comprises one or more sensors, for providing the state evaluation of various aspects of the apparatus 1300. 例如,传感器组件1314可以检测到设备1300的打开/关闭状态,组件的相对定位,例如所述组件为装置1300的显示器和小键盘,传感器组件1314还可以检测装置1300或装置1300— 个组件的位置改变,用户与装置1300接触的存在或不存在,装置1300方位或加速/减速和装置1300的温度变化。 For example, the sensor assembly 1314 to the device 1300 may detect an open / closed state, the relative positioning of components, such as the assembly of a display device 1300 and a keypad, the sensor assembly 1314 may also detect the position of components 1300 or 1300 means the presence of the contact changes, the user device 1300 and the presence or absence, position 1300 or temperature change acceleration / deceleration device 1300 and the device. 传感器组件1:314可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。 The sensor assembly 1: 314 may include a proximity sensor, configured to in the absence of any physical contact to detect the presence of nearby objects. 传感器组件1314还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。 The sensor assembly may further include a light sensor 1314, such as CMOS or CCD image sensors, for use in imaging applications. 在一些实施例中,该传感器组件1314还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。 In some embodiments, the sensor assembly 1314 may further include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor or a temperature sensor. 10183 J通伝组件1316被配置为便于装置1300和其他设备之间有线或无线方式的通信。 10183 J-pass vale assembly 1316 is configured to communicate between the device 1300 and other devices to facilitate wired or wireless. 装置1300可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。 Apparatus 1300 may access the wireless network-based communications standards, such as WiFi, 2G or 3G, or combinations thereof. 在一个示例性实施例中,通信部件1316经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。 In one exemplary embodiment, the communication section 1316 receives a broadcast signal or broadcast associated information from an external broadcast management system via a broadcast channel. 在一个示例性实施例中,所述通信部件1316还包括近场通信(NFC)模块,以促进短程通信。 In one exemplary embodiment, the communication means 1316 further comprises a near field communication (NFC) module to facilitate short-range communications. 例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。 For example, the NFC module can be based on radio frequency identification (RFID) technology, infrared data association (IrDA), ultra wideband (UWB) technology, Bluetooth (BT) technology and other technologies.

[0184]在示例性实施例中,装置1300可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。 [0184] In an exemplary embodiment, the device 1300 may be one or more application specific integrated circuits (ASIC), a digital signal processor (DSP), digital signal processing devices (DSPDs), programmable logic devices (PLD), a field programmable gate array (the FPGA), a controller, a microcontroller, a microprocessor, or other electronic components to achieve, for performing the above method.

[0185]在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器1304,上述指令可由装置1300的处理器1320执行以完成上述方法。 [0185] In an exemplary embodiment, further comprising instructions provided a non-transitory computer-readable storage medium such as a memory including instructions 1304, the command executed by the processor 1320 means 1300 to perform the method described above. 例如,所述非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-R0M、磁带、软盘和光数据存储设备等。 For example, the non-transitory computer-readable storage medium may be a ROM, a random access memory (RAM), CD-R0M, magnetic tapes, floppy disks, and optical data storage devices.

[0186] —种非临时性计算机可读存储介质,当所述存储介质中的指令由移动终端的处理器执行时,使得移动终端能够执行一种多媒体播放方法,包括: [0186] - species of non-transitory computer-readable storage medium, when the storage medium is executed by a processor of the mobile terminal, cause the mobile terminal to perform a multimedia playback method, comprising:

[0187]获取多媒体的暂停位置之前第一预设时长的音频数据和/或字幕数据; [0187] pause position before obtaining the first preset duration of multimedia audio data and / or subtitle data;

[0188]根据所述音频数据和/或字幕数据确定完整语句的语句起始位置; [0188] determining the starting position of a complete statement sentence according to the audio data and / or subtitle data;

[0189] 当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述语句起始位置继续播放所述多媒体。 [0189] When the multimedia playback condition is detected to continue to meet the multimedia instructions or continue playing the continuing playing the multimedia according to the statement starting position.

[0190] 可选的,所述根据所述音频数据确定完整语句的语句起始位置,包括: [0190] Alternatively, the starting position of the statement is determined based on the complete statement of audio data, comprising:

[0191] 检测所述音频数据中相邻两个音频信号之间的时间间隔; [0191] the time of detecting the audio data in an audio signal between the two adjacent intervals;

[0192]当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置。 [0192] When the length of the interval between the adjacent two time interval is greater than a first predetermined audio signal, determining whether any two audio signals between a time position adjacent to the starting position of the statement.

[0193] 可选的,所述根据所述字幕数据确定完整语句的语句起始位置,包括: [0193] Alternatively, the complete sentence is determined based on the caption sentence data start position, comprising:

[0194] 获取所述字幕数据中每条字幕的起始显示时间和/或终止显示时间; [0194] The subtitle data acquisition starting time of each subtitle display and / or termination time display;

[0195] 根据所述字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0195] The start time of the subtitle display and / or terminate the display time determining the starting position of the statement.

[0196] 可选的,所述根据所述音频数据和字幕数据确定完整语句的语句起始位置,包括: [0196] Alternatively, the audio data, and subtitle data to determine the starting position of a complete statement sentence, comprising the:

[0197] 检测所述音频数据中每个音频信号的播放时间; [0197] the audio playback time of each signal detected in the audio data;

[0198] 当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间; [0198] When the length of the interval between the adjacent two time interval is greater than a first predetermined audio signals, acquires the audio signal corresponding to the adjacent starting subtitle display time and / or termination time display;

[0199] 根据所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 [0199] The audio signal corresponding to the adjacent time subtitle player audio signals and the two adjacent display start time and / or terminating the statements display time determining the starting position.

[0200] 可选的,所述根据所述音频数据确定完整语句的语句起始位置,包括: [0200] Alternatively, the starting position of the statement is determined based on the complete statement of audio data, comprising:

[0201] 根据人声频率对所述音频数据进行过滤,得到人声音频数据; [0201] filtering the audio data according to frequency of the human voice, the human voice audio data obtained;

[0202] 检测所述人声音频数据中相邻两个人声音频信号之间的时间间隔; [0202] The time between the two vocal tone detector of the vocal audio data adjacent interval;

[0203] 当相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定所述相邻两个人声音频信号间之间的任一时间位置为所述语句起始位置。 [0203] When the time between two adjacent vocal audio signal is greater than the spacing interval of the first predetermined time duration, determines a position adjacent any one time between the two between the audio signal is a human voice statement from starting position.

[0204] 可选的,当根据所述音频数据和/或字幕数据确定出至少两个完整语句的语句起始位置时,所述根据所述语句起始位置继续播放所述多媒体,包括: [0204] Alternatively, when it is determined that the start position of the at least two complete statement statements based on the audio data and / or subtitle data, the initial position according to the statement to continue playing the multimedia, comprising:

[0205] 从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者 [0205] The distance from the location of the nearest pause statement starting position to continue playing the multimedia; or

[0206] 当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 [0206] When the preset number of backoff statement N, continue playing the media from the previous N-th position of the pause statement starting position, wherein N is an integer greater than or equal to 2.

[0207]可选的,当根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时,所述方法还包括: [0207] Alternatively, when the first predetermined data length within the audio and / or subtitle data can not be determined prior to the complete statement by statement multimedia pause position the starting position, the method further comprising:

[0208]按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中, 本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前; [0208] in chronological order from the back, access to the first predetermined duration of audio data and / or subtitle data, wherein the playback time, acquired this time the first preset duration of audio data and / or subtitle data before a playback time duration acquired first preset audio data and / or subtitle data;

[0209]从本次获得的该第一预设时长的音频数据和/或字幕数据中确定完整语句的语句起始位置; [0209] determined from a complete sentence when the first predetermined length of audio data obtained this time and / or subtitle data sentence initial position;

[0210]若从本次获得的该第一预设时长的音频数据和/或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/ 或字幕数据并确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 [0210] When the predetermined duration from the first audio data obtained this time and / or subtitle data can not determine the starting position of a complete statement statement, the first pre continue receiving forward chronological order from the back of the long audio data and / or subtitle data set and determining the starting position of a complete statement statements, statements starting position until it is determined that at least one complete statement.

[0211]可选的,所述获取多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据,包括: [0211] Optionally, before obtaining the first preset multimedia pause position within the length of the audio data and / or subtitle data, comprising:

[0212]获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置; [0212] Before acquiring multimedia pause position, and long time position and the second predetermined time interval pause position;

[0213]获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据; [0213] acquire the audio data within the first predetermined length of time and / or subtitle data before the time position;

[0214]所述根据所述音频数据和/或字幕数据确定完整语句的语句起始位置,包括: [0214] The complete sentence is determined based on the audio data and / or subtitle data sentence initial position, comprising:

[0215]根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 [0215] audio data within a first predetermined duration and / or subtitle data according to the position prior to the time, the start position determination statement statement is complete.

[0216] 本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本发明的其它实施方案。 [0216] Those skilled in the art upon consideration of the specification and practice of the invention disclosed herein, will readily appreciate other embodiments of the present invention. 本申请旨在涵盖本发明的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本发明的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。 This application is intended to cover any variations, uses, or adaptations of the present invention encompasses these variations, uses, or adaptations of the invention following the general principles of the common general knowledge and comprises in the art of the present disclosure is not disclosed in the conventional techniques or . 说明书和实施例仅被视为示例性的,本发明的真正范围和精神由下面的权利要求指出。 The specification and examples be considered as exemplary only, with a true scope and spirit of the invention indicated by the following claims claim.

[0217] 应当理解的是,本发明并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。 [0217] It should be appreciated that the present invention is not limited to the above has been described and illustrated in the drawings precise structure, and may be carried out without departing from the scope of the various modifications and changes. 本发明的范围仅由所附的权利要求来限制。 Scope of the invention be limited only by the appended claims.

Claims (9)

1. 一种多媒体播放方法,其特征在于,包括: 获取多媒体的暂停位置之前第一预设时长的音频数据;或者,获取多媒体的暂停位置之前第一预设时长的音频数据和字幕数据; 根据所述音频数据确定完整语句的语句起始位置;或者,根据所述首频数据和字幕数据确定完整语句的语句起始位置; 当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述语句起始位置继续播放所述多媒体; 所述根据所述音频数据确定完整语句的语句起始位置,包括: 检测所述音频数据中相邻两个音频信号之间的时间间隔; 当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置,或者; 所述根据所述音频数据和字幕数据确定完整语句的语句起始位置,包括: 检测所述音 1. A multimedia playback method, characterized by comprising: display position before obtaining the first preset pause duration of audio data; or, before obtaining a first predetermined pause position display duration of subtitle data and audio data; The the audio data to determine the complete statement sentence initial position; or, a complete sentence is determined based on said first video data and subtitle data sentence initial position; continue playing the multimedia player when it is detected to continue to meet the multimedia instructions or when the condition continues to play according to the statement of the multimedia initial position; determining the starting position of a complete statement sentence based on the audio data, comprising: detecting the audio signal of the audio data between two adjacent time interval; time duration when the interval between two adjacent intervals is greater than a first predetermined audio signal, determining a position adjacent any one time between the two audio signals to the statement starting position, or; the complete sentence is determined based on the subtitle data and audio data sentence initial position, comprising: detecting a tone 频数据中每个音频信号的播放时间; 当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显不时间和/或终止显不时间; 根据所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置; 所述根据所述音频数据确定完整语句的语句起始位置,还包括: 根据人声频率对所述音频数据进行过滤,得到人声音频数据; 检测所述人声音频数据中相邻两个人声音频信号之间的时间间隔; 当相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定所述相邻两个人声音频信号间之间的任一时间位置为所述语句起始位置。 Play time of each audio data in the audio signal; duration when the time interval between two adjacent intervals is greater than a first predetermined audio signals, acquires the audio signal corresponding to the adjacent starting time and does not significantly subtitle / Stopping of time or not; in accordance with the audio signal corresponding to two adjacent playing time of the audio signal and the adjacent subtitle display starting time and / or terminate the display time determining sentence initial position; according to the the complete statement of the audio data to determine the starting position of the statement, further comprising: filtering the audio data according to frequency of the human voice, the human voice audio data obtained; two singing voice audio signal of the audio data adjacent to the detected the time interval between; when two adjacent time interval between the sound of the audio signal is greater than said first predetermined length of interval, determining that the adjacent position between any one time between the two audio signals to the vocal statement to the starting position.
2. 根据权利要求1所述的方法,其特征在于, 当根据所述音频数据和/或字幕数据确定出至少两个完整语句的语句起始位置时,所述根据所述语句起始位置继续播放所述多媒体,包括: 从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 2. The method according to claim 1, wherein, when it is determined that the start position of the at least two complete statement statements based on the audio data and / or subtitle data, the initial position according to the statement continued the multimedia player, comprising: a distance from the location of the nearest pause statement starting position to continue playing the multimedia; or when a preset number of backoff statement N, prior to the pause position from the N-th sentence from continue playing the multimedia starting position, wherein N is an integer greater than or equal to 2.
3.根据权利要求1所述的方法,其特征在于,当根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时,所述方法还包括: 按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中,本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前; 从本次获得的该第一预设时长的音频数据和/或字幕数据中确定完整语句的语句起始位置; 若从本次获得的该第一预设时长的音频数据和/或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/或字幕数据并确定完整语句的语句起始位置,直到确定出至少一个完整语句的语句起始位置。 3. The method according to claim 1, wherein, when the audio data within a first predetermined duration and / or subtitle data can not determine the starting position of a complete statement statement before the pause position of the multimedia according to the said method further comprising: in order from the back of the time, acquiring the first predetermined length of audio data and / or subtitle data, wherein the play, this first preset duration acquired audio data and / or subtitle data before playing time at a time of obtaining the first preset duration of audio data and / or subtitle data; determining complete statement statement from the first predetermined duration of the audio data currently obtained and / or subtitle data from start position; if a complete statement can not be determined from the first predetermined duration of the audio data currently obtained and / or subtitle data sentence initial position, the first pre continue receiving forward chronological order from the back of the long audio data and / or subtitle data set and determining the starting position of a complete statement statements, statements starting position until it is determined that at least one complete statement.
4. 根据权利要求1所述的方法,其特征在于,所述获取多媒体的暂停位置乙則弟一预取时长内的音频数据和/或字幕数据,包括: 获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置; 获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据; 所述根据所述音频数据和/或字幕数据确定完整语句的语句起始位置,包括: 根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 4. The method according to claim 1, wherein said obtaining multimedia audio data in the pause position B brother a prefetch length and / or subtitle data, comprising: obtaining prior to the pause position of the multimedia, and long pause when the second predetermined time position spaced locations; obtaining audio data prior to the time position within the first predetermined duration and / or subtitle data; said data according to the audio and / or subtitle data to determine the complete statement sentence initial position, comprising: an audio data within a first predetermined time duration before the position and / or subtitle data, determining if a statement complete sentence starting position.
5. —种多媒体播放装置,其特征在于,包括: 获取模块,用于获取多媒体的暂停位置之前第一预设时长的音频数据;或者,获取多媒体的暂停位置之前第一预设时长的音频数据和字幕数据; 、 分析模块,用于根据所述获取模块获取的音频数据确定完整语句的语句起始位置;或者,根据所述音频数据和字幕数据确定完整语句的语句起始位置; 播放模块,用于当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述分析模块确定的语句起始位置继续播放所述多媒体; 过滤模块,用于根据人声频率对所述获取模块获取的所述音频数据进行过滤,得到人声音频数据; I 检测模块,用于检测所述过滤单元过滤后的所述人声音频数据中相邻两个人声音频信号之间的时间间隔; 所述分析模块包括: 检测单元,用于检测所述获取模块获 5. - kind of multimedia playing apparatus, characterized by comprising: obtaining means for obtaining prior to the pause position of the first predetermined duration of multimedia audio data; Alternatively, the pause position before obtaining the first preset duration of the multimedia audio data and subtitle data; analysis module, a module for acquiring audio data is determined according to the statement for a complete sentence initial position; or, a complete sentence is determined according to statement start position of the audio data and caption data; playing module, when detecting a resume playback of the multimedia instructions or continue to play the multimedia condition is satisfied, according to the statement continues to play the analysis module determines the starting position of the multimedia; filtration module for voice frequency the acquiring module acquires the audio data is filtered to obtain voice audio data; the I detection means for detecting the filtered audio signals between the two singing voice audio data in the adjacent filtration unit time interval; the analysis module comprises: a detection means for detecting the obtaining module is eligible 的所述音频数据中相邻两个音频信号之间的时间间隔; 分析确定单元,用于当所述检测单元检测到的相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置; 或者,所述分析模块包括: 检测单元,用于检测所述获取模块获取的所述音频数据中每个音频信号的播放时间; 获取单元,用于当所述检测单元检测的相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间;分析确定单元,用于根据所述获取单元获取的所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置。 The time between the two audio signals in the audio data adjacent to the spacer; Analysis determining means for detecting when the length of time between two adjacent unit detects the audio signal is greater than a first predetermined spacing interval when determining the position of a time between any adjacent two of said audio signal is a sentence initial position; Alternatively, the analysis module comprises: a detection means for detecting the acquisition of the audio data acquisition module play time of each audio signal; obtaining unit, the time between when the detection unit detects an interval of adjacent two of the audio signal is greater than a first predetermined time interval length, obtaining an audio signal corresponding to the adjacent subtitle display starting time and / or termination of display time; start analyzing unit determining, based on said acquiring unit acquires caption said adjacent playback time of two audio signals corresponding to the audio signal and the adjacent display time and / or terminating the statements display time determining the starting position.
6. 根据权利要求5中所述的装置,其特征在于,所述播放模块,用于当所述分析模块确定出至少两个完整语句的语句起始位置时,从距离所述暂停位置最近的语句起始位置继续播放所述多媒体;或者当预设的回退语句数量为N时,从所述暂停位置之前的第N个语句起始位置继续播放所述多媒体,所述N为大于或等于2的整数。 6. The apparatus according to claim 5, characterized in that, the playback module, the analysis module for determining, when at least two complete statement sentence initial position, the distance from the nearest pause position statement starting position to continue playing the multimedia; or when a preset number of backoff statement N, continue playing the media from the previous N-th position of the pause statement starting position, wherein N is greater than or equal to integer.
7. 根据权利要求5中所述的装置,其特征在于, 所述获取模块,用于当所述分析模块根据所述多媒体的暂停位置之前第一预设时长内的音频数据和/或字幕数据无法确定完整语句的语句起始位置时,按照时间从后往前的顺序,获取第一预设时长的音频数据和/或字幕数据,其中,本次获取的第一预设时长的音频数据和/或字幕数据的播放时间在上一次获取的第一预设时长的音频数据和/或字幕数据的播放时间之前; 所述分析模块,用于从所述获取模块本次获得的该第一预设时长的音频数据和/或字幕数据中确定完整语句的语句起始位置;若从本次获得的该第一预设时长的音频数据和/ 或字幕数据中无法确定完整语句的语句起始位置,则按照时间从后往前的顺序继续向前获取第一预设时长的音频数据和/或字幕数据并确定完整语句的语句起始位置,直到确定出 7. The apparatus according to claim 5, characterized in that the acquisition module, the analysis module configured to, when the audio data within a first predetermined duration and / or subtitle data before the pause position in accordance with the multimedia statement can not be determined in the starting position of a complete statement, in chronological order from the back, access to the first predetermined duration of audio data and / or subtitle data, wherein the first predetermined duration of the audio data acquired this time and playback time and / or subtitle data acquired at a first predetermined time before the data length of the audio and / or subtitle data, playback time; the analysis module, for acquiring from the first pre-module of the currently obtained setting a duration of audio data and / or subtitle data, determines the complete statement sentence initial position; if a complete statement can not be determined from the first predetermined duration of the audio data currently obtained and / or subtitle data sentence initial position , continue obtaining the starting position of a first predetermined length statement audio data and / or subtitle data and determine the complete statement forwardly from the back in order of time, until it is determined that 少一个完整语句的语句起始位置。 At least one complete sentence starting position statement.
8. 根据权利要求5中所述的装置,其特征在于, 所述获取模块,用于获取多媒体的暂停位置之前的、且与所述暂停位置间隔第二预设时长的时间位置;获取所述时间位置之前第一预设时长内的音频数据和/或字幕数据; 所述分析模块,用于根据所述时间位置之前第一预设时长内的音频数据和/或字幕数据,确定完整语句的语句起始位置。 8. The apparatus according to claim 5, characterized in that said acquisition module for acquiring the pause position prior to the multimedia, and long when the second predetermined time interval position and the inactive position; obtaining the a first position before the preset time duration within the audio data and / or subtitle data; the analysis module, according to a first predetermined position prior to the time length of the audio data and / or subtitle data, to determine the complete statement statement to the starting position.
9. 一种多媒体播放装置,其特征在于,包括: 处理器; 用于存储处理器可执行指令的存储器; 其中,所述处理器被配置为: 获取多媒体的暂停位置之前第一预设时长的音频数据;或者,获取多媒体的暂停位置之前第一预设时长的音频数据和字幕数据; 根据所述音频数据确定完整语句的语句起始位置;或者,根据所述音频数据和字幕数据确定完整语句的语句起始位置; 当检测到继续播放所述多媒体的指令或满足继续播放所述多媒体的条件时,根据所述语句起始位置继续播放所述多媒体; 所述根据所述音频数据确定完整语句的语句起始位置,包括: 检测所述音频数据中相邻两个音频信号之间的时间间隔; 当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,确定所述相邻两个音频信号之间的任一时间位置为所述语句起始位置,或者; 所述 A multimedia playing apparatus comprising: a processor; a memory storing processor-executable instructions; and wherein the processor is configured to: obtain a first predetermined length of time before the pause position of the multimedia audio data; or, before obtaining a first multimedia pause position preset length of audio data and subtitle data; determining a complete sentence according to the audio data sentence initial position; or, a complete sentence is determined based on the subtitle data and audio data the sentence starting position; when the multimedia playback instruction is detected to continue or to meet the conditions to continue playing the multimedia resume playback start position in accordance with the statement of the multimedia; determining the complete sentence according to the audio data the sentence starting position, comprising: detecting the audio data in the time interval between two adjacent audio signals; time duration when the interval between two adjacent intervals is greater than a first predetermined audio signal, determining the at any one time position between adjacent two of said audio signal is a sentence initial position, or; a 据所述音频数据和字幕数据确定完整语句的语句起始位置,包括: 检测所述音频数据中每个音频信号的播放时间; 当相邻两个音频信号之间的时间间隔大于第一预设间隔时长时,获取所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间; 根据所述相邻两个音频信号的播放时间及所述相邻音频信号对应的字幕的起始显示时间和/或终止显示时间确定所述语句起始位置; 所述根据所述音频数据确定完整语句的语句起始位置,还包*括: 根据人声频率对所述音频数据进行过滤,得到人声音频数据; 检测所述人声音频数据中相邻两个人声音频信号之间的时间间隔; 当相邻两个人声音频信号之间的时间间隔大于所述第一预设间隔时长时,确定所述相邻两个人声音频信号间之间的任一时间位置为所述语句起始位置。 According to the audio data and subtitle data to determine the complete statement sentence initial position, comprising: detecting in the audio data playback time of each audio signal; when the time interval between two adjacent intervals is greater than a first predetermined audio signal long, adjacent to acquire an audio signal corresponding to the interval time of starting subtitle display and / or termination of display time; according to the play time from the two audio signals and an audio signal corresponding to the caption adjacent to said adjacent display start time and / or terminate the display time determining the starting position of the statement; statement determining the starting position of a complete sentence according to the audio data, * further comprising: filtering the audio data according to the frequency of the human voice, vocal audio data obtained; detecting the vocal audio data in the time interval between two adjacent vocal audio signal; when two adjacent time interval between the sound of the audio signal is greater than said first predetermined length of interval determining the position adjacent any one time between the vocal audio signal between two statements is the starting position.
CN201410250800.9A 2014-06-06 2014-06-06 Method and apparatus for multimedia playback CN104038827B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410250800.9A CN104038827B (en) 2014-06-06 2014-06-06 Method and apparatus for multimedia playback

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
CN201410250800.9A CN104038827B (en) 2014-06-06 2014-06-06 Method and apparatus for multimedia playback
RU2015105625/08A RU2605361C2 (en) 2014-06-06 2014-11-20 Multimedia playing method and device
KR1020157001317A KR101657913B1 (en) 2014-06-06 2014-11-20 Method, apparatus, program, and recording medium for multimedia playing
BR112015003350A BR112015003350A2 (en) 2014-06-06 2014-11-20 method and multimedia playback device
PCT/CN2014/091757 WO2015184738A1 (en) 2014-06-06 2014-11-20 Multimedia playing method and device
JP2016524682A JP2016525765A (en) 2014-06-06 2014-11-20 Multimedia reproducing method, apparatus, program, and recording medium
MX2015002051A MX352076B (en) 2014-06-06 2014-11-20 Multimedia playing method and device.
US14/620,508 US9589596B2 (en) 2014-06-06 2015-02-12 Method and device of playing multimedia and medium
EP15170892.2A EP2953133A1 (en) 2014-06-06 2015-06-05 Method and device of playing multimedia
US15/411,765 US9786326B2 (en) 2014-06-06 2017-01-20 Method and device of playing multimedia and medium

Publications (2)

Publication Number Publication Date
CN104038827A CN104038827A (en) 2014-09-10
CN104038827B true CN104038827B (en) 2018-02-02

Family

ID=51469394

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410250800.9A CN104038827B (en) 2014-06-06 2014-06-06 Method and apparatus for multimedia playback

Country Status (9)

Country Link
US (2) US9589596B2 (en)
EP (1) EP2953133A1 (en)
JP (1) JP2016525765A (en)
KR (1) KR101657913B1 (en)
CN (1) CN104038827B (en)
BR (1) BR112015003350A2 (en)
MX (1) MX352076B (en)
RU (1) RU2605361C2 (en)
WO (1) WO2015184738A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104038827B (en) 2014-06-06 2018-02-02 小米科技有限责任公司 Method and apparatus for multimedia playback
CN107181986A (en) * 2016-03-11 2017-09-19 百度在线网络技术(北京)有限公司 Video and subtitle matching method and apparatus
CN105959829A (en) * 2016-06-24 2016-09-21 封雷迅 Video playing method and tool for sentence-by-sentence rereading
CN106373598B (en) * 2016-08-23 2018-11-13 珠海市魅族科技有限公司 Audio reproduction control method and apparatus
WO2018080447A1 (en) * 2016-10-25 2018-05-03 Rovi Guides, Inc. Systems and methods for resuming a media asset
WO2018080445A1 (en) * 2016-10-25 2018-05-03 Rovi Guides, Inc. Systems and methods for resuming a media asset
WO2019084181A1 (en) * 2017-10-26 2019-05-02 Rovi Guides, Inc. Systems and methods for recommending a pause position and for resuming playback of media content

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08275205A (en) * 1995-04-03 1996-10-18 Sony Corp Method and device for data coding/decoding and coded data recording medium
AU5697800A (en) * 1999-07-06 2001-01-22 At & T Laboratories Cambridge Limited A thin multimedia communication device and method
JP3754269B2 (en) * 2000-04-18 2006-03-08 三洋電機株式会社 The video signal reproducing apparatus
US20090282444A1 (en) * 2001-12-04 2009-11-12 Vixs Systems, Inc. System and method for managing the presentation of video
KR100456441B1 (en) * 2002-01-18 2004-11-09 주식회사 휴맥스 Method and Apparatus for Reproducing Past Images for Use in a Medium of Storage
JP2003307997A (en) * 2002-04-15 2003-10-31 Sony Corp Language education system, voice data processor, voice data processing method, voice data processing program, and recording medium
EP1551027A4 (en) * 2002-09-12 2009-08-05 Panasonic Corp Recording medium, reproduction device, program, reproduction method, and recording method
JP2004157457A (en) * 2002-11-08 2004-06-03 Nissan Motor Co Ltd Speech presentation system
TW200537941A (en) 2004-01-26 2005-11-16 Koninkl Philips Electronics Nv Replay of media stream from a prior change location
JP4247626B2 (en) * 2005-01-20 2009-04-02 ソニー株式会社 Playback apparatus and method
JP2006208866A (en) * 2005-01-28 2006-08-10 Sun Corp Reproducing device
JP4622728B2 (en) * 2005-08-03 2011-02-02 カシオ計算機株式会社 Sound reproducing apparatus and audio playback processing program
CN1956504A (en) * 2005-10-26 2007-05-02 其乐达科技股份有限公司 Subtitling method of video-audio playing system
US8731914B2 (en) * 2005-11-15 2014-05-20 Nokia Corporation System and method for winding audio content using a voice activity detection algorithm
US9411781B2 (en) * 2006-01-18 2016-08-09 Adobe Systems Incorporated Rule-based structural expression of text and formatting attributes in documents
JP2007235543A (en) * 2006-03-01 2007-09-13 Funai Electric Co Ltd Optical disk drive
CN101438348B (en) * 2006-05-08 2011-12-07 汤姆逊许可证公司 Across a method for content reproduction apparatus to restore
EP2095363A4 (en) * 2006-11-22 2011-07-20 Multimodal Technologies Inc Recognition of speech in editable audio streams
JP5026294B2 (en) 2008-01-29 2012-09-12 京セラ株式会社 Content reproducing apparatus
CN101588470B (en) * 2008-05-20 2013-05-29 深圳市同洲电子股份有限公司 Time shifting suspension method, time shifting suspension system and time shifting suspension equipment of IP-QAM video-on-demand system
US8737806B2 (en) 2008-11-13 2014-05-27 Mitsubishi Electric Corporation Reproduction device and reproduction method
ES2537073T3 (en) * 2008-11-18 2015-06-02 Panasonic Corporation Playback device, reproduction method and program for stereoscopic playback
CN101963968A (en) * 2009-07-24 2011-02-02 艾比尔国际多媒体有限公司 Multimedia identification system and method as well as applied multimedia customization method thereof
US8755921B2 (en) * 2010-06-03 2014-06-17 Google Inc. Continuous audio interaction with interruptive audio
JP2012004722A (en) * 2010-06-15 2012-01-05 Panasonic Corp Content reproduction device, content reproduction method, and content reproduction program
US9355683B2 (en) * 2010-07-30 2016-05-31 Samsung Electronics Co., Ltd. Audio playing method and apparatus
US20130103770A1 (en) * 2011-10-25 2013-04-25 Microsoft Corporation Distributed semi-synchronized event driven playback of multimedia
KR101830656B1 (en) * 2011-12-02 2018-02-21 엘지전자 주식회사 Mobile terminal and control method for the same
US20140253702A1 (en) * 2013-03-10 2014-09-11 OrCam Technologies, Ltd. Apparatus and method for executing system commands based on captured image data
US9462032B2 (en) * 2013-07-24 2016-10-04 Google Inc. Streaming media content
CN104038827B (en) * 2014-06-06 2018-02-02 小米科技有限责任公司 Method and apparatus for multimedia playback

Also Published As

Publication number Publication date
EP2953133A1 (en) 2015-12-09
US9786326B2 (en) 2017-10-10
JP2016525765A (en) 2016-08-25
MX352076B (en) 2017-11-08
US20150356997A1 (en) 2015-12-10
BR112015003350A2 (en) 2017-07-04
RU2605361C2 (en) 2016-12-20
US20170133060A1 (en) 2017-05-11
KR20160003619A (en) 2016-01-11
RU2015105625A (en) 2016-09-10
WO2015184738A1 (en) 2015-12-10
MX2015002051A (en) 2016-10-28
KR101657913B1 (en) 2016-09-19
CN104038827A (en) 2014-09-10
US9589596B2 (en) 2017-03-07

Similar Documents

Publication Publication Date Title
US20070260634A1 (en) Apparatus, system, method, and computer program product for synchronizing the presentation of media content
JP2015064896A (en) Image processing method for mobile terminal
CN104238759A (en) Method and device for controlling terminal through physical keys
CN103576834B (en) And power-saving control method of the power saving control method of an electronic support apparatus
CN104182173A (en) Camera switching method and device
RU2653355C2 (en) Volume adjustment method and apparatus and terminal
CN104092936B (en) AF method and apparatus
CN104050035B (en) Method and apparatus for processing an application
CN104991789B (en) Method and apparatus for the application opening
CN104934048A (en) Sound effect regulation method and device
CN103916711A (en) Method and device for playing video signals
CN103885588B (en) Method and apparatus for automatic switching
US20170125035A1 (en) Controlling smart device by voice
CN104699248A (en) Electronic equipment, device and method for control of audio play
CN104598111B (en) A method and apparatus for switching the display mode of
CN104469437B (en) Method and apparatus for advertising push
CN105791958A (en) Method and device for live broadcasting game
CN105898364A (en) Video playing processing method, device, terminal and system
CN105895115A (en) Squeal determining method and squeal determining device
CN104021148A (en) Method and device for adjusting sound effect
US20170344192A1 (en) Method and device for playing live videos
WO2017092247A1 (en) Method, apparatus and system for playing multimedia data
CN104867506B (en) Method and apparatus for automatic control of the music
CN104318934A (en) Method, terminal, wearable device and play device for closing multimedia file
CN105224349A (en) Application deletion prompting method and apparatus

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination
GR01