WO2014161282A1 - 视频文件播放进度的调整方法及装置 - Google Patents

视频文件播放进度的调整方法及装置 Download PDF

Info

Publication number
WO2014161282A1
WO2014161282A1 PCT/CN2013/084520 CN2013084520W WO2014161282A1 WO 2014161282 A1 WO2014161282 A1 WO 2014161282A1 CN 2013084520 W CN2013084520 W CN 2013084520W WO 2014161282 A1 WO2014161282 A1 WO 2014161282A1
Authority
WO
WIPO (PCT)
Prior art keywords
subtitle
file
text information
video file
content
Prior art date
Application number
PCT/CN2013/084520
Other languages
English (en)
French (fr)
Inventor
周鹏
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Priority to EP13880926.4A priority Critical patent/EP2978232A4/en
Priority to US14/890,186 priority patent/US9799375B2/en
Publication of WO2014161282A1 publication Critical patent/WO2014161282A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/105Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440236Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47217End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection
    • H04N21/4828End-user interface for program selection for searching program descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • the present invention relates to the field of communications, and in particular to a method and apparatus for adjusting a video file playback schedule.
  • the progress bar is usually provided with a slider, which will advance during the video playback, and the position of the slider in the progress bar corresponds to the progress of the video playback.
  • the user can also use the finger to drag the slider back and forth.
  • the video content displayed on the screen and the current elapsed time will be updated accordingly, and the user can view the video content or currently play during the process of dragging the slider.
  • Time to locate the playback position of interest is not convenient. This positioning method is only suitable for the user to locate according to the playing time, and is not suitable in other scenes. For example: The user has seen the currently playing video before, and has an impression on a certain picture or a certain sentence spoken by a certain character.
  • the present invention provides a method and apparatus for adjusting a video file playback progress, so as to at least solve the problem that a mobile terminal user searches for a specific segment in a video that has been viewed in the related art, and the accuracy of the playback progress of the specific segment is determined. Poor question.
  • a method of adjusting a progress of playing a video file comprises: receiving text information to be searched; searching for subtitle content matching the text information in the subtitle file of the video file, wherein the subtitle file is from the video file The obtained or generated according to the video file; determining the playing time corresponding to the subtitle content according to the found subtitle content, and adjusting the playing progress of the video file according to the playing time.
  • an apparatus for adjusting a progress of a video file playback is provided.
  • the apparatus for adjusting the playback progress of the video file of the present invention comprises: a receiving module configured to receive text information to be searched; a searching module configured to search for subtitle content matching the text information in the subtitle file of the video file, wherein the subtitle file Is obtained from the video file or generated according to the video file; the adjustment module is configured to determine a play time corresponding to the subtitle content according to the found subtitle content, and adjust the play progress of the video file according to the play time.
  • the foregoing technical solution has the following beneficial effects: receiving text information to be searched; searching for subtitle content matching the text information in the subtitle file of the video file, the subtitle file being obtained from the video file or generated according to the video file; Determining a play time corresponding to the subtitle content according to the found subtitle content, and adjusting a play progress of the video file according to the play time, whereby the mobile terminal can determine the text information to be searched by the user, and then the text information and the video file The subtitle file in the matching is matched. If the matching subtitle content can be found, the playing time corresponding to the matching subtitle content is obtained, thereby accurately adjusting the playing progress of the video file according to the playing time, and solving the related art that the mobile terminal user has watched the video.
  • FIG. 2 is a flowchart of a method for adjusting a progress of playing a video file according to a preferred embodiment of the present invention
  • FIG. 4 is a structural block diagram of an apparatus for adjusting a playback progress of a video file according to a preferred embodiment of the present invention
  • FIG. 4 is a block diagram of an intelligent mobile terminal according to a preferred embodiment of the present invention
  • Step S102 Receive text information to be searched
  • Step S104 Search for subtitle content matching the text information in a subtitle file of the video file, where the subtitle file is The video file is generated or generated according to the video file.
  • Step S106 determining a play time corresponding to the subtitle content according to the found subtitle content, and adjusting a play progress of the video file according to the play time.
  • Step S1 determining whether a subtitle file exists in the video file
  • Step S2 If not, according to the audio data in the video file Generate a subtitle file.
  • the video file has been loaded with a subtitle file in a preset format (for example, srt format)
  • the subtitle file corresponding to the video file may be directly obtained from the local video file, and may also be from the network.
  • the website that provides the subtitles downloads the corresponding subtitle file; if the corresponding subtitle file is missing in the video file, the corresponding subtitle file can be generated by collecting the audio data in the video file.
  • the subtitle file may be a text file, wherein each subtitle information in the video file is described, and each subtitle information may include: a subtitle number, a start time, and a subtitle content.
  • Subtitle files can be in multiple formats, for example: One of the subtitle files is in the srt format, and the file names of such subtitle files are usually suffixed with .srt.
  • each subtitle is as follows - subtitle serial number start time ⁇ end time subtitle text (one or more lines) blank line subtitle numbers are generally numbered starting from 1, and the time format used is "hour: minute: second, millisecond" .
  • step S2 generating the subtitle file according to the audio data may include the following operations: Step S21: performing decoding processing on the audio data in the video file; Step S22: converting the decoded audio data into a subtitle file.
  • the audio data in the video file can be decoded using an audio and video decoder of the mobile terminal and then converted to text in a particular language (e.g., Chinese or English).
  • Step S3 determining whether the language used by the subtitle file is consistent with the language used by the text information
  • Step S4 If not, The language used for the subtitle file is translated into the language used for the text information, or the subtitle file is regenerated according to the language used for the text information.
  • the mobile terminal has determined the file information to be searched by the user and has acquired or has generated a subtitle file corresponding to the video file, if it is desired to match the two, it is necessary to ensure that both are used. The voice is consistent.
  • the video player can allow the user to specify the subtitle file.
  • the subtitle file is not loaded in the video file or the subtitle file it loads is used in a different language than the language used by the user. At this time, it is necessary to translate the language used by the subtitle file into the language used for the text information or regenerate the subtitle file in the language used for the text information.
  • determining a play time corresponding to the subtitle content according to the found subtitle content, and adjusting the play progress according to the play time may include the following processing steps: Step S5: determining the subtitle content according to the found subtitle content Step S6: acquiring a playing time period corresponding to the found subtitle content according to the subtitle serial number, and determining a starting playing time corresponding to the found subtitle content in the playing time period; Step S7: according to the playing time Adjust the playback progress.
  • the subtitle file may have multiple formats
  • the subtitle files of the various formats include multi-segment subtitle information
  • each subtitle information may include: a subtitle serial number, a play time period, and subtitle content.
  • the subtitle serial number of the subtitle content may be further determined, and the playing time period corresponding to the subtitle content may be further determined according to the subtitle serial number (including : Start time and end time), thereby determining the start play time of the subtitle content of the segment, and adjusting the play progress according to the play time.
  • the text information to be searched is "At the left we can see"
  • the subtitle information matching the text information is actually found in the subtitle file, specifically as follows:
  • determining text information may include, but is not limited to, one of the following modes: mode one, receiving input text information; mode two, receiving voice data, and converting the voice data into text information.
  • the mobile terminal user can input the text information to be searched in the search dialog box of the video player through the keyboard or the touch screen, or input the text information to be searched into the mobile terminal by using a microphone.
  • the above preferred implementation process will be further described below in conjunction with the preferred embodiment shown in FIG. 2.
  • 2 is a flow chart of a method of adjusting a playback progress of a video file in accordance with a preferred embodiment of the present invention. As shown in FIG. 2, the process may include the following processing steps: Step S202: The user opens the video player software on the mobile terminal and selects to play a specific video file. Step S204: The user searches for a subtitle file corresponding to the video file.
  • the subtitle file can be specified in the video player; if yes, go to step S208; if no, continue to step S206; step S206: if not, the audio and video decoder of the mobile terminal can be used in the video file
  • the audio data is decoded, and then converted into a text format of a specific language (for example: Chinese or English) and simultaneously recorded time information, that is, a new subtitle file is generated;
  • FIG. 3 is a structural block diagram of an apparatus for adjusting a progress of a video file playback according to an embodiment of the present invention.
  • the apparatus for adjusting the playback progress of the video file may include: a receiving module 10 configured to receive text information to be searched; and a searching module 20 configured to search for a text information in the subtitle file of the video file.
  • a subtitle content wherein the subtitle file is obtained from the video file or generated according to the video file;
  • the adjustment module 30 is configured to determine a play time corresponding to the subtitle content according to the found subtitle content, and adjust the video file according to the play time. Play progress.
  • the apparatus may further include: a first determining module 40, configured to determine whether a subtitle file exists in the video file; and the first processing module 50 is configured to: when the output of the first determining module is negative, A subtitle file is generated based on the audio data in the video file.
  • a first determining module 40 configured to determine whether a subtitle file exists in the video file
  • the first processing module 50 is configured to: when the output of the first determining module is negative, A subtitle file is generated based on the audio data in the video file.
  • the first processing module 50 may include: a decoding unit 500 configured to perform decoding processing on audio data in a video file; and a converting unit 502 configured to convert the decoded audio data into a subtitle file .
  • the apparatus may further include: a second determining module 60 configured to determine whether a language used by the subtitle file is consistent with a language used by the text information; and the second processing module 70 is configured to be When the output of the second judgment module is NO, the language used by the subtitle file is translated into the language used for the text information, or the subtitle file is regenerated according to the language used in the text information.
  • a decoding unit 500 configured to perform decoding processing on audio data in a video file
  • a converting unit 502 configured to convert the decoded audio data into a subtitle file .
  • the apparatus may further include: a second determining module 60 configured to determine whether a language used by the subtitle file is consistent with a language used by the text information; and the second processing module 70 is configured to be When the output of the second judgment module is
  • the adjustment module 30 may include: a first determining unit 300, configured to determine a subtitle number of the subtitle content according to the found subtitle content; and second determining unit 302, configured to acquire and according to the subtitle serial number The playback time period corresponding to the found subtitle content, and determining the initial playing time corresponding to the found subtitle content in the playing time period; the adjusting unit 304 is configured to adjust the playing progress according to the playing time.
  • the determining module 10 may include: a first receiving unit 100 configured to receive input text information; and a second receiving unit 102 configured to receive voice data and convert the voice data into text information.
  • FIG. 5 is a schematic diagram of a software and hardware architecture of an intelligent mobile terminal according to a preferred embodiment of the present invention.
  • the architecture can be divided into three levels, from bottom to top, the hardware layer, the operating system layer, and the application layer.
  • Hardware layer To include: processor, memory, microphone, speaker, and touch display.
  • the operating system layer is responsible for managing hardware devices, providing file systems and function libraries, and the function library may include: a voice recognition module (corresponding to the first processing module and the determining module described above).
  • the function of the speech recognition module is to convert the speech data into corresponding text content.
  • the application layer can include: Multiple applications, such as: video player, calculator.
  • An audio and video decoder can be included in the video player to decode the video file, then play the image on the display and play the sound through the speaker.
  • the technical solution provided by the present invention can add a function module to the video player, and the function can be named as a voice search (corresponding to the above-mentioned search module and adjustment module) in the video image displayed on the display screen.
  • a voice search corresponding to the above-mentioned search module and adjustment module
  • users can choose to use the voice search feature.
  • the user can speak the statement of the desired retrieval in the mind to the intelligent mobile terminal, and then the voice search module can search the corresponding audio data in the video file according to the subtitle content of the statement spoken by the user, thereby finding a matching item. . If a match is found, position the video player's progress bar slider to the appropriate location.
  • the voice search module needs to have the following preconditions to complete the foregoing work: Condition 1.
  • the software system of the smart mobile terminal includes a voice recognition module, which can be provided by an operating system or other application, and the video The player software can use it.
  • the speech recognition module can receive audio input in a preset format (for example: Pulse Code Modulation (PCM)) and convert it into text in a specific language (for example: Chinese or English).
  • PCM Pulse Code Modulation
  • Condition 2 The audio and video decoder in the video player can recognize the format of the video file (for example: MP4, AVI), and decode the audio data in the video file into a format that the speech recognition module can receive.
  • Condition 3 The voice search module can collect the voice data spoken by the user from the microphone through the operating system, and the voice data can be received by the voice recognition module.
  • the subtitle file in the srt format can be used as an example to further describe the positioning of the text information that the user wants to search in the video file.
  • the subtitle file does not constitute a limitation on the present invention.
  • the whole process specifically includes the following steps: Step 1: The audio decoder in the video player is used to decode the audio data in the video file into a format recognizable by the speech recognition module (for example: PCM format) and saved to the audio file. .
  • the second step is to analyze the above audio file and generate a subtitle file. Cycling the voice data in the audio file, you can read the voice data for 1 second each time, and then input the audio data of the 1 second into the voice recognition module.
  • the corresponding subtitle text is generated, and then a subtitle information is generated according to the Srt format and saved to the subtitle file.
  • N-segment subtitles are generated in the subtitle file.
  • the third step is the search and location of the video file.
  • the voice search module collects the voice data spoken by the user from the microphone, and then inputs it into the voice recognition module to generate the corresponding text.
  • the voice search module searches for the generated text in the subtitle file, and if the matched text is searched, the play time corresponding to the matched text can be obtained according to the format of the subtitle file.
  • the technical solution provided by the embodiment of the present invention is used for the user.
  • a new method and device for adjusting the progress of playing a video file is provided.
  • the mobile terminal can determine the text information to be searched by the user, and then match the text information with the subtitle file in the video file, if the matching subtitle can be found.
  • the content is obtained, and the playing time corresponding to the matching subtitle content is obtained, so that the playing progress of the video file is accurately adjusted according to the playing time, and the manner in which the mobile terminal user searches for a specific segment in the already viewed video is more complicated and the specific segment is The accuracy of the playback progress positioning is poor, and thus the accurate positioning of the playback segment desired by the user is realized, and the operation is simple and convenient.
  • the above modules or steps of the embodiments of the present invention can be implemented by a general computing device, which can be concentrated on a single computing device or distributed in multiple computing devices.
  • the computing device may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from The steps shown or described are performed sequentially, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated into a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.
  • the above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.

Abstract

本发明公开了一种视频文件播放进度的调整方法及装置,在上述方法中,接收待搜索的文本信息;在视频文件的字幕文件中查找与文本信息相匹配的字幕内容,其中,字幕文件是从视频文件中获取的或者根据视频文件生成的;根据查找到的字幕内容确定与该字幕内容对应的播放时间,并按照播放时间调整视频文件的播放进度。根据本发明提供的技术方案,实现了对用户期望回放片段进行准确定位,操作简单方便。

Description

视频文件播放进度的调整方法及装置 技术领域 本发明涉及通信领域, 具体而言,涉及一种视频文件播放进度的调整方法及装置。 背景技术 随着智能移动终端处理能力的不断提高和显示屏幕的不断增大, 视频播放器已经 逐渐成为智能移动终端中普遍安装的应用程序。 目前, 大多数的智能移动终端均配置 有触摸屏, 用户可以通过手指触摸屏幕来操作和控制应用程序。 这种智能移动终端上 的视频播放器在播放视频时, 通常都会在屏幕上显示进度条、 视频的总时间长度以及 当前已播放的时间。 而进度条上又通常设置有滑块, 该滑块会在视频播放的过程中前 进, 并且该滑块在进度条中的位置与视频播放的进度相对应。 当然, 用户也可以使用 手指来回拖动滑块, 此时屏幕上显示的视频内容和当前已播放时间将会进行相应地更 新, 用户在拖动滑块的过程中通过查看视频内容或者当前已播放时间来定位感兴趣的 播放位置。 但上述定位播放内容的方式并不方便, 这种定位方式只适合于用户根据播放时间 进行定位, 而在其它一些场景下并不太适用。 例如: 用户以前看过当前播放的视频, 对某个画面或者某个人物说的某句话有印象, 其可以是在观看教学视频的时候, 记得 老师讲过的某个知识点, 也可以是在观看电影的时候, 记得某句经典台词。 如果用户 想根据上述线索对视频进行搜索定位, 则需要来回反复地拖动进度条上的滑块, 并查 看对应的视频内容或者视频中人物的讲话, 直至能够查找到用户脑海中存有印象的视 频播放位置。 发明内容 本发明提供了一种视频文件播放进度的调整方法及装置, 以至少解决相关技术中 移动终端用户在已经观看视频中查找特定片段的方式较为复杂且对该特定片段的播放 进度定位的准确性较差的问题。 根据本发明的一个方面, 提供了一种视频文件播放进度的调整方法。 本发明的视频文件播放进度的调整方法包括: 接收待搜索的文本信息; 在视频文 件的字幕文件中查找与文本信息相匹配的字幕内容, 其中, 字幕文件是从视频文件中 获取的或者根据视频文件生成的; 根据查找到的字幕内容确定与该字幕内容对应的播 放时间, 并按照播放时间调整视频文件的播放进度。 根据本发明的另一方面, 提供了一种视频文件播放进度的调整装置。 本发明的视频文件播放进度的调整装置包括: 接收模块, 设置为接收待搜索的文 本信息; 查找模块, 设置为在视频文件的字幕文件中查找与文本信息相匹配的字幕内 容, 其中, 字幕文件是从视频文件中获取的或者根据视频文件生成的; 调整模块, 设 置为根据查找到的字幕内容确定与该字幕内容对应的播放时间, 并按照播放时间调整 视频文件的播放进度。 上述技术方案具有如下有益效果: 采用接收待搜索的文本信息; 在视频文件的字 幕文件中查找与文本信息相匹配的字幕内容, 该字幕文件是从视频文件中获取的或者 根据视频文件生成的; 根据查找到的字幕内容确定与该字幕内容对应的播放时间, 并 按照播放时间调整视频文件的播放进度, 由此, 移动终端可以通过确定用户待搜索的 文本信息, 然后将该文本信息与视频文件中的字幕文件进行匹配, 如果能够查找到匹 配的字幕内容, 则获取与匹配字幕内容对应的播放时间, 从而根据播放时间准确调整 视频文件的播放进度, 解决了相关技术中移动终端用户在已经观看视频中查找特定片 段的方式较为复杂且对该特定片段的播放进度定位的准确性较差的问题, 进而实现了 对用户期望回放片段进行准确定位, 操作简单方便。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部分, 本发 明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的不当限定。 在附图 中- 图 1是根据本发明实施例的视频文件播放进度的调整方法的流程图; 图 2是根据本发明优选实施例的视频文件播放进度的调整方法的流程图; 图 3是根据本发明实施例的视频文件播放进度的调整装置的结构框图; 图 4是根据本发明优选实施例的视频文件播放进度的调整装置的结构框图; 图 5是根据本发明优选实施例的智能移动终端的软硬件架构示意图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在不冲突的 情况下, 本申请中的实施例及实施例中的特征可以相互组合。 图 1是根据本发明实施例的视频文件播放进度的调整方法的流程图。如图 1所示, 该方法可以包括以下处理步骤: 步骤 S102: 接收待搜索的文本信息; 步骤 S104: 在视频文件的字幕文件中查找与文本信息相匹配的字幕内容, 其中, 字幕文件是从视频文件中获取的或者根据视频文件生成的; 步骤 S106: 根据查找到的字幕内容确定与该字幕内容对应的播放时间, 并按照播 放时间调整视频文件的播放进度。 相关技术中, 移动终端用户在已经观看视频中查找特定片段的方式较为复杂且对 该特定片段的播放进度定位的准确性较差。 采用如图 1所示的方法, 接收待搜索的文 本信息; 在视频文件的字幕文件中查找与文本信息相匹配的字幕内容, 该字幕文件是 从视频文件中获取的或者根据视频文件生成的; 根据查找到的字幕内容确定与该字幕 内容对应的播放时间, 并按照播放时间调整视频文件的播放进度, 由此, 移动终端可 以通过确定用户待搜索的文本信息, 然后将该文本信息与视频文件中的字幕文件进行 匹配, 如果能够查找到匹配的字幕内容, 则获取与匹配字幕内容对应的播放时间, 从 而根据播放时间准确调整视频文件的播放进度, 解决了相关技术中移动终端用户在已 经观看视频中查找特定片段的方式较为复杂且对该特定片段的播放进度定位的准确性 较差的问题, 进而实现了对用户期望回放片段进行准确定位, 操作简单方便。 优选地, 在步骤 S104, 查找与文本信息相匹配的字幕内容之前, 还可以包括以下 操作- 步骤 S1 : 判断视频文件中是否存在字幕文件; 步骤 S2: 如果否, 则根据视频文件中的音频数据生成字幕文件。 在优选实施例中, 如果视频文件已经加载了预设格式 (例如: srt格式) 的字幕文 件, 则可以直接从本地的视频文件中获取与该视频文件对应的字幕文件, 当然还可以 从网络中专门提供字幕的网站下载相应的字幕文件; 如果视频文件中缺少对应的字幕 文件, 则可以通过采集视频文件中的音频数据生成相应的字幕文件。 字幕文件可以是一个文本文件, 其中, 描述了视频文件中的各段字幕信息, 而每 一段字幕信息可以包括: 字幕序号、 起始时间以及字幕内容。 字幕文件可以有多种格 式, 例如: 其中一种字幕文件格式为 srt格式, 此类字幕文件的文件名通常以 .srt为后 缀。 每段字幕的格式如下- 字幕序列号 起始时间→终止时间 字幕文本 (一行或多行) 空白行 字幕序号一般从 1开始编号, 其所采用的时间格式是"小时: 分钟: 秒, 毫秒"。 下面是一个格式为 srt的字幕文件的示例:
1
00:00:10,500→00:00:13,000
Elephant's Dream
2 00:00:15,000→00:00:18,000 At the left we can see... 上述字幕文件中包含有两段字幕, 第一段字幕的起始时间是从 10.5秒至 13秒, 其字幕内容为 Elephant's Dream, 第二段字幕的起始时间是从 15秒至 18秒, 其字幕内 容为 At the left we can see...。 优选地, 在步骤 S2中, 根据音频数据生成字幕文件可以包括以下操作: 步骤 S21 : 对视频文件中的音频数据进行解码处理; 步骤 S22: 将解码后的音频数据转换成字幕文件。 在优选实施例中, 可以采用移动终端的音视频解码器对视频文件中的音频数据进 行解码, 然后将其转换成特定语言 (例如: 中文或者英文) 的文本。 优选地, 在步骤 S104, 查找与文本信息相匹配的字幕内容之前, 还可以包括以下 步骤: 步骤 S3 : 判断字幕文件所使用的语言与文本信息所使用的语言是否一致; 步骤 S4:如果否,则将字幕文件所使用的语言译成与文本信息所使用的语言一致, 或者, 按照文本信息所使用的语言重新生成字幕文件。 在优选实施例中, 在移动终端已经确定用户待搜索的文件信息并且已经获取到或 者已经生成与视频文件对应的字幕文件的情况下, 如果希望将两者进行匹配, 就需要 确保两者所使用的语音保持一致。 因此, 如果字幕文件中的所使用语言与用户进行语 音搜索时所使用的语言相同, 此时, 视频播放器可以允许用户指定字幕文件。 但是, 如果视频文件中没有加载字幕文件或者其加载的字幕文件所使用的语言与用户所使用 的语言不同。 此时, 需要将字幕文件所使用的语言译成与文本信息所使用的语言一致 或者按照文本信息所使用的语言重新生成字幕文件。 优选地,在步骤 S106中,根据查找到的字幕内容确定与该字幕内容对应的播放时 间, 并按照播放时间调整播放进度可以包括以下处理步骤: 步骤 S5 : 根据查找到的字幕内容确定该字幕内容的字幕序号; 步骤 S6: 根据字幕序号获取与查找到的字幕内容对应的播放时间段, 并在播放时 间段中确定与查找到的字幕内容对应的起始的播放时间; 步骤 S7: 按照播放时间调整播放进度。 在优选实施例中, 尽管字幕文件可以有多种格式, 但是在各种格式的字幕文件中 均包含有多段字幕信息, 而每一段字幕信息又可以包括: 字幕序号、 播放时间段以及 字幕内容。 当在字幕文件中查找到与用户待搜索的文本信息匹配的字幕内容后, 便可 以进一步确定该段字幕内容的字幕序号, 并且可以根据字幕序号进一步确定该段字幕 内容对应的播放时间段(包括: 起始时间与终止时间), 由此可以确定该段字幕内容的 起始的播放时间,进而按照播放时间调整播放进度。 以上述格式为 srt的字幕文件示例 为例,假设待搜索的文本信息为" At the left we can see... ", 而在字幕文件中确实查找到 与该文本信息相匹配的字幕信息, 具体如下:
2
00:00: 15,000→00:00: 18,000 At the left we can see. 由此可以确定该段字幕内容的字幕序号为 2, 而与字幕序号为 2对应的播放时间 段为 00:00: 15,000→00:00: 18,000, 即起始的播放时间为 00:00: 15,000, 因此, 可以按照 播放时间调整视频文件的播放进度。 优选地, 在步骤 S102中, 确定文本信息可以包括但不限于以下方式之一: 方式一、 接收输入的文本信息; 方式二、 接收语音数据, 并将语音数据转换成文本信息。 在优选实施例中, 移动终端用户既可以通过键盘或者触摸屏在视频播放器的搜索 对话框中输入待搜索的文本信息, 也可以通过麦克风将待搜索的文本信息通过语音的 方式输入到移动终端中。 下面结合图 2所示的优选实施方式对上述优选实施过程做进一步的描述。 图 2是根据本发明优选实施例的视频文件播放进度的调整方法的流程图。 如图 2 所示, 该流程可以包括以下处理步骤: 步骤 S202:用户在移动终端上打开视频播放器软件,并选择播放特定的视频文件; 步骤 S204: 用户查找是否存在与视频文件对应的字幕文件, 并且可以在视频播放 器中指定字幕文件; 如果是, 则转到步骤 S208; 如果否, 则继续执行步骤 S206; 步骤 S206: 如果没有, 可以采用移动终端的音视频解码器对视频文件中的音频数 据进行解码, 然后将其转换成特定语言 (例如: 中文或者英文) 的文本格式并同时记 录时间信息, 即新生成一个字幕文件; 步骤 S208: 用户选择使用语音搜索功能来搜索视频内容; 步骤 S210:视频播放器软件将用户通过麦克风输入的语音数据转换成特定格式的 文本; 步骤 S212: 视频播放器软件使用转换后的用户语音文本在字幕文件中进行搜索, 如果查找到相匹配的字幕内容, 则可以得到对应字幕的播放时间; 步骤 S214: 视频播放器软件使用播放时间调整视频文件的播放进度。 图 3是根据本发明实施例的视频文件播放进度的调整装置的结构框图。 如图 3所 示, 该视频文件播放进度的调整装置可以包括: 接收模块 10, 设置为接收待搜索的文 本信息; 查找模块 20, 设置为在视频文件的字幕文件中查找与文本信息相匹配的字幕 内容,其中,字幕文件是从视频文件中获取的或者根据视频文件生成的;调整模块 30, 设置为根据查找到的字幕内容确定与该字幕内容对应的播放时间, 并按照播放时间调 整视频文件的播放进度。 采用如图 3所示的装置, 解决了相关技术中移动终端用户在已经观看视频中查找 特定片段的方式较为复杂且对该特定片段的播放进度定位的准确性较差的问题, 进而 实现了对用户期望回放片段进行准确定位, 操作简单方便。 优选地, 如图 4所示, 上述装置还可以包括: 第一判断模块 40, 设置为判断视频 文件中是否存在字幕文件; 第一处理模块 50, 设置为在第一判断模块输出为否时, 根 据视频文件中的音频数据生成字幕文件。 优选地, 如图 4所示, 第一处理模块 50可以包括: 解码单元 500, 设置为对视频 文件中的音频数据进行解码处理; 转换单元 502, 设置为将解码后的音频数据转换成 字幕文件。 优选地, 如图 4所示, 上述装置还可以包括: 第二判断模块 60, 设置为判断字幕 文件所使用的语言与文本信息所使用的语言是否一致; 第二处理模块 70, 设置为在第 二判断模块输出为否时,将字幕文件所使用的语言译成与文本信息所使用的语言一致, 或者, 按照文本信息所使用的语言重新生成字幕文件。 优选地, 如图 4所示, 调整模块 30可以包括: 第一确定单元 300, 设置为根据查 找到的字幕内容确定该字幕内容的字幕序号; 第二确定单元 302, 设置为根据字幕序 号获取与查找到的字幕内容对应的播放时间段, 并在播放时间段中确定与查找到的字 幕内容对应的起始的播放时间; 调整单元 304, 设置为按照播放时间调整播放进度。 优选地, 如图 4所示, 确定模块 10可以包括: 第一接收单元 100, 设置为接收输 入的文本信息; 第二接收单元 102, 设置为接收语音数据, 并将语音数据转换成文本 信息。 下面结合图 5所示的优选实施方式对上述优选实施过程做进一步的描述。 图 5是根据本发明优选实施例的智能移动终端的软硬件架构示意图。 该架构可以 分为三个层次, 从下至上依次分别为硬件层、 操作系统层以及应用程序层。 硬件层可 以包括: 处理器、 存储器、 麦克风、 扬声器和触控显示屏。 操作系统层负责管理硬件 设备、 提供文件系统和功能程序库, 而功能程序库中可以包括: 语音识别模块 (相当 于上述第一处理模块和确定模块)。语音识别模块的作用在于将语音数据转换成对应的 文本内容。 应用程序层可以包括: 多个应用程序, 例如: 视频播放器、 计算器。 视频 播放器中可以包括音视频解码器, 能够对视频文件进行解码, 然后在显示屏上播放图 像, 并通过扬声器播放声音。 本发明所提供的技术方案可以在视频播放器中新增一个 功能模块, 在显示屏上显示的视频图像中可以将该功能命名为语音搜索 (相当于上述 查找模块和调整模块)。 当用户使用视频播放器观看视频时, 可以选择使用语音搜索功能。 此时, 用户可 以对着智能移动终端说出脑海中记忆的期望检索的语句, 然后, 语音搜索模块即可根 据用户说出的语句的字幕内容搜索视频文件中相应的音频数据, 进而查找匹配项。 如 果能够查找到匹配项, 则将视频播放器的进度条滑块定位到相应的位置。 在该优选实施例中, 语音搜索模块完成上述工作需要具备以下前提条件: 条件一、 智能移动终端的软件系统中包含语音识别模块, 该语音识别模块可以由 操作系统或者其它应用程序提供, 而且视频播放器软件可以对其进行使用。 例如: 语 音识别模块可以接收预设格式(例如: 脉冲编码调制(Pulse Code Modulation, 简称为 PCM)) 的音频输入, 并将其转换成特定语言 (例如: 中文或者英文) 的文本。 条件二、 视频播放器中的音视频解码器能够识别视频文件的格式 (例如: MP4、 AVI), 并将视频文件中的音频数据解码成语音识别模块能够接收的格式。 条件三、 语音搜索模块能够通过操作系统从麦克风采集用户说出的语音数据, 而 且这些语音数据能够被语音识别模块所接收。 作为本发明的一个优选实施例,可以采用 srt格式的字幕文件为例进一步对用户希 望搜索的文本信息在视频文件中的定位进行详细的描述, 当然, 在具体实施过程中还 可以采用其它格式的字幕文件, 此处并不构成对本发明的限定。 整个过程具体包括以 下几个步骤: 第一步、 使用视频播放器中的音频解码器将视频文件中的音频数据解码成语音识 别模块能够识别的格式 (例如: PCM格式) 并保存至音频文件中。 第二步、 分析上述音频文件并生成字幕文件。 循环读取音频文件中的语音数据, 可以每次读取 1秒的语音数据, 其次将这 1秒的音频数据输入至语音识别模块中, 生 成对应的字幕文本,然后按照 Srt格式生成一段字幕信息保存至字幕文件中。按照上述 方式, 如果视频文件的长度为 N秒, 那么在字幕文件中就会生成 N段字幕。 第三步、 视频文件的搜索定位。 在用户启用语音搜索功能时, 用户说出在视频文 件中期望检索到的语句, 语音搜索模块从麦克风采集到用户说出的语音数据, 然后将 其输入到语音识别模块中, 以生成对应的文本。 随后, 语音搜索模块在字幕文件中搜 索上述生成的文本, 如果搜索到与之匹配的文本, 按照字幕文件的格式可以获得与匹 配文本对应的播放时间。 最后, 视频播放器根据播放时间进行定位。 从以上的描述中, 可以看出, 上述实施例实现了如下技术效果 (需要说明的是这 些效果是某些优选实施例可以达到的效果): 采用本发明实施例所提供的技术方案, 为 用户提供了一种新的视频文件播放进度的调整方法及装置, 移动终端可以通过确定用 户待搜索的文本信息, 然后将该文本信息与视频文件中的字幕文件进行匹配, 如果能 够查找到匹配的字幕内容, 则获取与匹配字幕内容对应的播放时间, 从而根据播放时 间准确调整视频文件的播放进度, 解决了相关技术中移动终端用户在已经观看视频中 查找特定片段的方式较为复杂且对该特定片段的播放进度定位的准确性较差的问题, 进而实现了对用户期望回放片段进行准确定位, 操作简单方便。 显然, 本领域的技术人员应该明白, 上述的本发明实施例的各模块或各步骤可以 用通用的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布在多个计算 装置所组成的网络上, 可选地, 它们可以用计算装置可执行的程序代码来实现, 从而, 可以将它们存储在存储装置中由计算装置来执行, 并且在某些情况下, 可以以不同于 此处的顺序执行所示出或描述的步骤, 或者将它们分别制作成各个集成电路模块, 或 者将它们中的多个模块或步骤制作成单个集成电路模块来实现。 这样, 本发明不限制 于任何特定的硬件和软件结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本领域的技 术人员来说, 本发明可以有各种更改和变化。 凡在本发明的精神和原则之内, 所作的 任何修改、 等同替换、 改进等, 均应包含在本发明的保护范围之内。

Claims

权 利 要 求 书
1. 一种视频文件播放进度的调整方法, 包括:
接收待搜索的文本信息;
在视频文件的字幕文件中查找与所述文本信息相匹配的字幕内容, 其中, 所述字幕文件是从所述视频文件中获取的或者根据所述视频文件生成的;
根据查找到的字幕内容确定与该字幕内容对应的播放时间, 并按照所述播 放时间调整所述视频文件的播放进度。
2. 根据权利要求 1所述的方法, 其中, 在查找与所述文本信息相匹配的字幕内容 之前, 还包括:
判断所述视频文件中是否存在所述字幕文件;
如果否, 则根据所述视频文件中的音频数据生成所述字幕文件。
3. 根据权利要求 2所述的方法,其中,根据所述音频数据生成所述字幕文件包括:
对所述视频文件中的音频数据进行解码处理;
将解码后的音频数据转换成所述字幕文件。
4. 根据权利要求 1所述的方法, 其中, 在查找与所述文本信息相匹配的字幕内容 之前还包括:
判断所述字幕文件所使用的语言与所述文本信息所使用的语言是否一致; 如果否, 则将所述字幕文件所使用的语言译成与所述文本信息所使用的语 言一致, 或者, 按照所述文本信息所使用的语言重新生成所述字幕文件。
5. 根据权利要求 1所述的方法, 其中, 根据所述查找到的字幕内容确定与该字幕 内容对应的播放时间, 并按照所述播放时间调整所述播放进度包括:
根据所述查找到的字幕内容确定该字幕内容的字幕序号;
根据所述字幕序号获取与所述查找到的字幕内容对应的播放时间段, 并在 所述播放时间段中确定与所述查找到的字幕内容对应的起始的播放时间;
按照所述播放时间调整所述播放进度。
6. 根据权利要求 1至 5中任一项所述的方法, 其中, 接收所述文本信息包括以下 之一:
接收输入的所述文本信息;
接收语音数据, 并将所述语音数据转换成所述文本信息。
7. 一种视频文件播放进度的调整装置, 包括:
接收模块, 设置为接收待搜索的文本信息;
查找模块, 设置为在视频文件的字幕文件中查找与所述文本信息相匹配的 字幕内容, 其中, 所述字幕文件是从所述视频文件中获取的或者根据所述视频 文件生成的;
调整模块, 设置为根据查找到的字幕内容确定与该字幕内容对应的播放时 间, 并按照所述播放时间调整所述视频文件的播放进度。
8. 根据权利要求 7所述的装置, 其中, 所述装置还包括:
第一判断模块, 设置为判断所述视频文件中是否存在所述字幕文件; 第一处理模块, 设置为在所述第一判断模块输出为否时, 根据所述视频文 件中的音频数据生成所述字幕文件。
9. 根据权利要求 8所述的装置, 其中, 所述第一处理模块包括:
解码单元, 设置为对所述视频文件中的音频数据进行解码处理; 转换单元, 设置为将解码后的音频数据转换成所述字幕文件。
10. 根据权利要求 7所述的装置, 其中, 所述装置还包括:
第二判断模块, 设置为判断所述字幕文件所使用的语言与所述文本信息所 使用的语言是否一致;
第二处理模块, 设置为在所述第二判断模块输出为否时, 将所述字幕文件 所使用的语言译成与所述文本信息所使用的语言一致, 或者, 按照所述文本信 息所使用的语言重新生成所述字幕文件。
11. 根据权利要求 7所述的装置, 其中, 所述调整模块包括:
第一确定单元, 设置为根据所述查找到的字幕内容确定该字幕内容的字幕 序号; 第二确定单元, 设置为根据所述字幕序号获取与所述查找到的字幕内容对 应的播放时间段, 并在所述播放时间段中确定与所述查找到的字幕内容对应的 起始的播放时间;
调整单元, 设置为按照所述播放时间调整所述播放进度。
12. 根据权利要求 7至 11中任一项所述的装置, 其中, 所述接收模块包括:
第一接收单元, 设置为接收输入的所述文本信息;
第二接收单元, 设置为接收语音数据, 并将所述语音数据转换成所述文本 信息。
PCT/CN2013/084520 2013-07-15 2013-09-27 视频文件播放进度的调整方法及装置 WO2014161282A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP13880926.4A EP2978232A4 (en) 2013-07-15 2013-09-27 METHOD AND DEVICE FOR ADJUSTING THE PLAYING PROGRESS OF A VIDEO FILE
US14/890,186 US9799375B2 (en) 2013-07-15 2013-09-27 Method and device for adjusting playback progress of video file

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310295470.0 2013-07-15
CN201310295470.0A CN104301771A (zh) 2013-07-15 2013-07-15 视频文件播放进度的调整方法及装置

Publications (1)

Publication Number Publication Date
WO2014161282A1 true WO2014161282A1 (zh) 2014-10-09

Family

ID=51657475

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/084520 WO2014161282A1 (zh) 2013-07-15 2013-09-27 视频文件播放进度的调整方法及装置

Country Status (4)

Country Link
US (1) US9799375B2 (zh)
EP (1) EP2978232A4 (zh)
CN (1) CN104301771A (zh)
WO (1) WO2014161282A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104883607A (zh) * 2015-06-05 2015-09-02 广东欧珀移动通信有限公司 一种视频截图或剪切的方法、装置及移动设备

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130291019A1 (en) * 2012-04-27 2013-10-31 Mixaroo, Inc. Self-learning methods, entity relations, remote control, and other features for real-time processing, storage, indexing, and delivery of segmented video
CN103686352A (zh) * 2013-11-15 2014-03-26 乐视致新电子科技(天津)有限公司 智能电视媒体播放器及其字幕处理方法、智能电视
CN105163178B (zh) * 2015-08-28 2018-08-07 北京奇艺世纪科技有限公司 一种视频播放位置定位方法和装置
WO2018027730A1 (zh) * 2016-08-11 2018-02-15 张婧 钢琴视频教学中的同步方法及系统
CN106210845A (zh) * 2016-08-11 2016-12-07 张婧 音乐课程中教学视频同步的方法及系统
WO2018027729A1 (zh) * 2016-08-11 2018-02-15 张婧 音乐课程中教学视频同步的方法及系统
WO2018027731A1 (zh) * 2016-08-11 2018-02-15 张婧 英文学习中的视频同步方法及系统
CN106297846A (zh) * 2016-08-11 2017-01-04 张婧 钢琴视频教学中的同步方法及系统
CN109271532A (zh) * 2017-07-18 2019-01-25 北京国双科技有限公司 一种多媒体文件回放的方法及装置
CN107506385A (zh) * 2017-07-25 2017-12-22 努比亚技术有限公司 一种视频文件检索方法、设备及计算机可读存储介质
CN107396203A (zh) * 2017-09-06 2017-11-24 深圳市视维科技股份有限公司 一种基于IJKPlayer外挂字幕的方法
CN107767871B (zh) * 2017-10-12 2021-02-02 安徽听见科技有限公司 文本显示方法、终端及服务器
CN107820123A (zh) * 2017-10-25 2018-03-20 深圳天珑无线科技有限公司 移动终端截取屏幕画面的方法、移动终端以及存储装置
CN107809679A (zh) * 2017-10-26 2018-03-16 费非 调节字幕的方法和装置
CN107908674A (zh) * 2017-10-26 2018-04-13 费非 语音判断方法及装置、存储介质和处理器
US10459620B2 (en) * 2018-02-09 2019-10-29 Nedelco, Inc. Caption rate control
CN108282678B (zh) * 2018-02-11 2021-01-05 孙新峰 一种多媒体数据的播放方法、装置及系统
CN108806692A (zh) * 2018-05-29 2018-11-13 深圳市云凌泰泽网络科技有限公司 一种音频内容查找及可视化播放方法
CN109005445A (zh) * 2018-06-26 2018-12-14 卫军征 多媒体播放方法、系统、存储介质及播放设备
CN109246472A (zh) * 2018-08-01 2019-01-18 平安科技(深圳)有限公司 视频播放方法、装置、终端设备及存储介质
CN109657094A (zh) * 2018-11-27 2019-04-19 平安科技(深圳)有限公司 音频处理方法及终端设备
CN110162668B (zh) * 2019-03-07 2023-11-14 腾讯科技(深圳)有限公司 交互方法、装置、计算机可读存储介质和计算机设备
CN109905772B (zh) * 2019-03-12 2022-07-22 腾讯科技(深圳)有限公司 视频片段查询方法、装置、计算机设备及存储介质
CN110248245B (zh) * 2019-06-21 2022-05-06 维沃移动通信有限公司 一种视频定位方法、装置、移动终端及存储介质
US10965888B1 (en) * 2019-07-08 2021-03-30 Snap Inc. Subtitle presentation based on volume control
CN110401879A (zh) * 2019-08-13 2019-11-01 宇龙计算机通信科技(深圳)有限公司 一种视频播放的控制方法、装置、终端及存储介质
JP7447422B2 (ja) * 2019-10-07 2024-03-12 富士フイルムビジネスイノベーション株式会社 情報処理装置およびプログラム
CN113051985A (zh) * 2019-12-26 2021-06-29 深圳云天励飞技术有限公司 信息提示方法、装置、电子设备及存储介质
CN113382291A (zh) * 2020-03-09 2021-09-10 海信视像科技股份有限公司 一种显示设备及流媒体播放方法
CN113099312A (zh) * 2021-03-30 2021-07-09 深圳市多科特文化传媒有限公司 教学视频播放系统
CN113378001B (zh) * 2021-06-28 2024-02-27 北京百度网讯科技有限公司 视频播放进度的调整方法及装置、电子设备和介质
CN114501159B (zh) * 2022-01-24 2023-12-22 传神联合(北京)信息技术有限公司 一种字幕编辑方法、装置、电子设备及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101102419A (zh) * 2007-07-10 2008-01-09 北京大学 一种定位视频字幕区域的方法
CN101382937A (zh) * 2008-07-01 2009-03-11 深圳先进技术研究院 基于语音识别的多媒体资源处理方法及其在线教学系统
CN101739450A (zh) * 2009-11-26 2010-06-16 北京网梯科技发展有限公司 对视频中出现的信息进行检索的方法及系统
CN101908053A (zh) * 2009-11-27 2010-12-08 新奥特(北京)视频技术有限公司 一种语音检索的方法及装置

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5794249A (en) 1995-12-21 1998-08-11 Hewlett-Packard Company Audio/video retrieval system that uses keyword indexing of digital recordings to display a list of the recorded text files, keywords and time stamps associated with the system
US6370543B2 (en) * 1996-05-24 2002-04-09 Magnifi, Inc. Display of media previews
US20030046075A1 (en) * 2001-08-30 2003-03-06 General Instrument Corporation Apparatus and methods for providing television speech in a selected language
KR100700814B1 (ko) * 2005-07-07 2007-03-27 엘지전자 주식회사 디지털 비디오 기기에서의 텍스트 파일 재생장치 및 방법
US20070154176A1 (en) * 2006-01-04 2007-07-05 Elcock Albert F Navigating recorded video using captioning, dialogue and sound effects
US7680853B2 (en) * 2006-04-10 2010-03-16 Microsoft Corporation Clickable snippets in audio/video search results
US8891938B2 (en) 2007-09-06 2014-11-18 Kt Corporation Methods of playing/recording moving picture using caption search and image processing apparatuses employing the method
US20100106482A1 (en) * 2008-10-23 2010-04-29 Sony Corporation Additional language support for televisions
US8620139B2 (en) * 2011-04-29 2013-12-31 Microsoft Corporation Utilizing subtitles in multiple languages to facilitate second-language learning
US8914276B2 (en) * 2011-06-08 2014-12-16 Microsoft Corporation Dynamic video caption translation player
TW201421994A (zh) 2012-11-21 2014-06-01 Hon Hai Prec Ind Co Ltd 視頻內容搜索系統及方法
CN103067775A (zh) * 2013-01-28 2013-04-24 Tcl集团股份有限公司 一种音视频终端的字幕显示方法、音视频终端及服务器

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101102419A (zh) * 2007-07-10 2008-01-09 北京大学 一种定位视频字幕区域的方法
CN101382937A (zh) * 2008-07-01 2009-03-11 深圳先进技术研究院 基于语音识别的多媒体资源处理方法及其在线教学系统
CN101739450A (zh) * 2009-11-26 2010-06-16 北京网梯科技发展有限公司 对视频中出现的信息进行检索的方法及系统
CN101908053A (zh) * 2009-11-27 2010-12-08 新奥特(北京)视频技术有限公司 一种语音检索的方法及装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104883607A (zh) * 2015-06-05 2015-09-02 广东欧珀移动通信有限公司 一种视频截图或剪切的方法、装置及移动设备
CN104883607B (zh) * 2015-06-05 2017-12-19 广东欧珀移动通信有限公司 一种视频截图或剪切的方法、装置及移动设备

Also Published As

Publication number Publication date
EP2978232A4 (en) 2016-05-04
EP2978232A1 (en) 2016-01-27
US9799375B2 (en) 2017-10-24
US20160133298A1 (en) 2016-05-12
CN104301771A (zh) 2015-01-21

Similar Documents

Publication Publication Date Title
WO2014161282A1 (zh) 视频文件播放进度的调整方法及装置
US9330720B2 (en) Methods and apparatus for altering audio output signals
EP1865426B1 (en) Information processing apparatus, information processing method, and computer program
CN108391149B (zh) 显示设备、控制显示设备的方法、服务器以及控制服务器的方法
CN110675886B (zh) 音频信号处理方法、装置、电子设备及存储介质
US20140006022A1 (en) Display apparatus, method for controlling display apparatus, and interactive system
CN109754783B (zh) 用于确定音频语句的边界的方法和装置
WO2021083071A1 (zh) 语音转换、文件生成、播音、语音处理方法、设备及介质
US20150098018A1 (en) Techniques for live-writing and editing closed captions
US9767825B2 (en) Automatic rate control based on user identities
WO2019047878A1 (zh) 语音操控终端的方法、终端、服务器和存储介质
KR101100191B1 (ko) 멀티미디어 재생장치와 이를 이용한 멀티미디어 자료검색방법
CN101753915A (zh) 数据处理设备、数据处理方法及程序
WO2016202176A1 (zh) 一种媒体文件合成方法、装置和设备
CN102592628A (zh) 一种音视频播放文件的播放控制方法
JP7280328B2 (ja) 情報処理装置、情報処理方法、プログラム
US20110035223A1 (en) Audio clips for announcing remotely accessed media items
US20140078331A1 (en) Method and system for associating sound data with an image
EP3203468A1 (en) Acoustic system, communication device, and program
JP2006189799A (ja) 選択可能な音声パターンの音声入力方法及び装置
CN111627417B (zh) 播放语音的方法、装置及电子设备
WO2021017302A1 (zh) 一种数据提取方法、装置、计算机系统及可读存储介质
KR100944958B1 (ko) 특정 구간의 멀티미디어 데이터 및 캡션 데이터를 제공하는장치 및 서버
CN109977239B (zh) 一种信息处理方法和电子设备
KR102636708B1 (ko) 프레젠테이션 문서에 대한 수어 발표 영상을 제작할 수 있는 전자 단말 장치 및 그 동작 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13880926

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2013880926

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2013880926

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 14890186

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE