WO2011110063A1 - Method and system for generating video scene library, method and system for retrieving video scenes - Google Patents

Method and system for generating video scene library, method and system for retrieving video scenes Download PDF

Info

Publication number
WO2011110063A1
WO2011110063A1 PCT/CN2011/071072 CN2011071072W WO2011110063A1 WO 2011110063 A1 WO2011110063 A1 WO 2011110063A1 CN 2011071072 W CN2011071072 W CN 2011071072W WO 2011110063 A1 WO2011110063 A1 WO 2011110063A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
subtitle
video scene
library
segment
Prior art date
Application number
PCT/CN2011/071072
Other languages
French (fr)
Chinese (zh)
Inventor
李平辉
Original Assignee
Li Pinghui
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Li Pinghui filed Critical Li Pinghui
Publication of WO2011110063A1 publication Critical patent/WO2011110063A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data

Definitions

  • the present invention relates to the field of video search technology, and in particular, to a method for generating a video scene library and a search method and system for a video scene based on the library.
  • the present invention also relates to a method and system for directly searching for a video scene. Background technique
  • Video search technology on the Internet has been widely used today. Users can easily get the video information they want by using a video search engine.
  • Today's video search technology is generally based on keyword search.
  • the video file that meets the search criteria is returned to the user by matching the search of the video file name or the related tag in the video database. For example, if the user enters the keyword "crazy" to perform a video search, then the video files including "Crazy Stone", "Crazy Racing” and the like, including the word "crazy" are search results that meet the search criteria. Even with the more advanced frame search technique, the result is that the search results are returned to the user in units of the entire video file.
  • Today's video search technology does not provide a convenient search function for video clips.
  • a user who needs a large amount of video scenes as a material such as a photographer who wants to refer to many war scene shooting methods, needs some rain scenes as a material for video production enthusiasts, they can only First, judge which video files will appear on this type of video scene through experience or other auxiliary conditions. Then, by watching a large number of these video files, the target video scene is found, and then the video cutting software is used for cutting and collecting.
  • the problem to be solved by the present invention is to provide a method for generating a video scene library, which provides data support for the user to quickly and easily find the target video scene segment.
  • the invention also provides a generation system of a video scene library corresponding to the above method.
  • the present invention also provides a method and system for searching a video scene segment based on a video scene library generated by the above method, so that the user can find the target video scene segment conveniently and quickly.
  • the present invention also provides a method and system for directly searching for video scene segments, so that the user can quickly and easily find the target video scene segment.
  • the present invention also provides a method and system for generating a video scene, so as to quickly generate a large number of video scene segments.
  • the present invention uses the following technical solutions:
  • a method for generating a video scene library comprising the following steps:
  • the caption annotation includes a dialogue/narration in the video scene, or a synonymous explanation or generalization of the dialogue/narration, or a label describing the type of the video scene.
  • the step B further includes extracting the time anchor point and related video file information into the subtitle library.
  • the steps C are sequentially interchanged.
  • An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source ;
  • a subtitle extraction unit configured to extract the subtitle segments of the annotation into the subtitle library
  • a cutting unit configured to perform redundant cutting on the video file according to the marked time anchor point, intercepting the video scene segment corresponding to the character screen, and storing the video scene segment in the video scene segment library;
  • the relationship establishing unit is configured to establish a correspondence between the subtitle segment in the subtitle library and the video scene segment in the video scene library.
  • the subtitle annotation includes a dialogue/narration original in the video scene, or a synonymous explanation or generalization of the dialogue/narration, or a label describing a video scene type;
  • the subtitle extraction The unit further extracts the time anchor point and related video file information into the subtitle library.
  • the user inputs a keyword to request a search for a video scene segment
  • the step a and the step b further include: determining whether to request a search for a video scene of a dialogue or narration type, or requesting a video scene of a description type.
  • step b searches for a subtitle segment of the dialogue or narration type in the subtitle library; if it is the latter, step b searches the subtitle library for the subtitle segment of the description type.
  • the step b and the step c further comprise: determining whether to request to intercept the video scene segment in real time according to the new cutting time redundancy.
  • a system for searching a video scene segment based on a video scene library generated by the above method comprising:
  • An input unit through which the user inputs information
  • a search unit configured to: when receiving an input unit to initiate a request, search for a video scene segment in the storage unit;
  • a storage unit configured to store the generated video scene library, that is, the stored video scene segment library and the subtitle library;
  • a display unit for displaying video clips that match the search criteria.
  • the search system further includes a determining unit, configured to determine whether the search request is for a dialogue or a narration type scene or a description type of the scene, and is used for determining whether to request input of the video scene segment.
  • the cutting time redundancy is re-cut and intercepted.
  • the system further includes a cutting unit for re-cutting the video file by an input cutting time redundancy amount.
  • a method for directly searching for a video scene segment comprising the following steps:
  • ⁇ ' time anchor annotation and subtitle annotation for the video scene in the video file in the data source
  • ⁇ ' extract the time anchor point and subtitle segment of the annotation and related video file information into the subtitle library
  • C' user input key The word proposes a request to search for a video scene segment
  • a system for directly searching for video scene segments comprising:
  • An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source ;
  • a subtitle library extracting unit for extracting the labeled subtitle segment and time anchor point and related video file information Saved in the subtitle library
  • An input unit through which the user inputs information
  • a search unit configured to perform a keyword matching search in the subtitle library when receiving the input unit to initiate the request
  • a cutting unit configured to cut a target video scene segment by cutting a video file in the data source according to the time anchor point and the cutting redundancy amount
  • a method for generating a video scene comprising the following steps:
  • a system for generating a video scene comprising:
  • An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source ;
  • a caption extraction unit configured to extract the caption segment and the time anchor point into the caption library
  • the cutting unit is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle segment.
  • the invention has the beneficial effects of: the method and system for generating a video scene library and the method and system for searching a video scene segment, which can be automatically used in a video file only by spending manpower time for creating a corresponding subtitle file.
  • Video scenes are collected into the video scene library.
  • the video scene library is similar to the font library, the concept of the thesaurus, which contains various video scene segments from various video files and corresponding subtitle segments.
  • FIG. 1 is a flowchart of a method for generating a video scene library according to the present invention.
  • FIG. 2 is a flow chart of a video scene search method according to the present invention.
  • FIG. 3 is a structural diagram of a video scene library of the present invention.
  • FIG. 4 is a schematic diagram of a video scene library generation system according to the present invention.
  • FIG. 5 is a schematic diagram of a video scene search system of the present invention.
  • FIG. 6 is a schematic diagram of a video scene search example of the present invention. detailed description
  • the video file according to the solution of the present invention does not affect the implementation of the present invention in terms of its content, format, type and the like.
  • a file of a general English movie video is taken as an example, but the implementation of the solution of the present invention is not limited to the video file of an English movie.
  • the present invention is also applicable to Chinese movies, other foreign language movies, and non-film videos.
  • FIG. 1 discloses a preferred implementation example of a method for generating a video scene library according to the present invention.
  • the method includes the following steps:
  • time anchor annotation and subtitle annotation are performed on each video scene of the video file.
  • a typical framing rule is to use a complete dialogue or narration for each scene unit in the video, and a specific scene as a scene unit.
  • the subtitle content can be a dialogue/narration original text, or a synonym explanation or generalization of the dialogue/narration, corresponding to a video scene of a dialogue or narration type, or a scene description label, corresponding to a descriptive video scene. Take the video file of the film "Forrest Gump" as an example.
  • the preset framing rule is framing with each complete dialogue or narration in the video, and also framing a specific scene, such as snow scene. , seascape, rain, battle scenes, etc.
  • the time anchors marked are: start time anchor and end time anchor point.
  • the start time anchor point is the point in time at which the video scene starts playing in the video file.
  • the end time anchor point refers to the time point at which the video scene ends playing in the video file; the content of the subtitle note includes the dialogue or narration and the scene description label.
  • a video scene is a video scene of a dialogue or narration type.
  • the video scene is marked with a general video subtitle creation technology, and a subtitle file containing a time anchor point and a subtitle segment is output.
  • Subtitle production technology is a mature technology, and I will not comment here.
  • the subtitle library can use general commercial database products. Each storage element includes a complete subtitle note and corresponding start time anchor and end time anchor.
  • the main structure of the entire subtitle library is shown in the following table: Serial number start time anchor end time anchor point subtitle information type from video
  • the cutting time redundancy refers to the amount of time that extends forward and backward, centering on the time period in which the target video scene is located.
  • the purpose of setting the cutting time redundancy is to allow the user to know the context information of the target video scene.
  • y is the length of time to mark the video scene.
  • the video scene segment set obtained by cutting all the cuts is stored in the video scene segment library.
  • the entity corresponding to the concept of the video scene fragment library may be a general commercial database product or a file set in a common operating system.
  • each video scene segment has a corresponding subtitle segment.
  • the video scene database generation system includes: an annotation unit 401, a subtitle extraction unit 402, a cutting unit 403, and a relationship establishing unit 404.
  • the labeling unit 401 is configured to perform time anchor annotation and caption annotation on the video scene in the video file of the data source;
  • the caption extraction unit 402 is configured to extract the labeled caption segment and time anchor point and other video file information into the caption library;
  • the unit 403 is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle, and store the video scene segment in the video scene segment library;
  • the relationship establishing unit 404 is configured to create the subtitle segment and the video in the subtitle library. Correspondence of video scene segments in the scene library.
  • the working principle of each unit can be referred to the description of the above method, and will not be described here.
  • the video scene library can be obtained by the above method and system for generating a video scene library. The following describes a method for searching a video scene segment based on the video scene library. Referring to FIG. 2 and FIG. 6, the method for searching a video scene segment based on the video scene library of the present invention includes the following steps:
  • the user inputs a keyword in the terminal, for example, enter "how are you” in the input box 601 to issue a request to search for a video scene.
  • step 203 is performed; if it is a video scene for the description type, step 204 is performed.
  • the keyword entered is "war”
  • step 206 is performed, and if real-time cutting is required, step 207 is performed.
  • the corresponding target video scene segment is returned by searching for the matched subtitle segment. For example, “how are you?" in “Scent Of A Woman”, “how are you do ing?” in “Bad L ieutenant", “Hi. How are you?" in “Mona Li sa Smi le”, etc.
  • the target video scene segment corresponding to the matched subtitle segment is returned by searching for the matched subtitle segment.
  • the time anchor point is further obtained by searching for the matched subtitle segment. For example, “S are a you?" The “how are you?" time anchors are "00: 07: 18. 269" and "00: 07: 20. 438". If the user selects cut play 606, the time redundancy re-entered in system pop-up box 607 is 5 seconds. After clicking cut play, then the program will be ((Scent Of A Woman)) from “00: 07: 13. The 269" to "00: 07: 25. 438" video clip is cut and returned to the search user.
  • the searched target video scene segment is displayed, referring to the video play interface 608.
  • the method for searching a video scene segment based on the video scene library of the present invention is described above.
  • the present invention also discloses a system for searching a video scene segment based on a video scene library corresponding to the above method.
  • the search system includes: an input unit 501, a determination unit 502, and a search The unit 503, the cutting unit 504, the storage unit 505, and the display unit 506.
  • the input unit 501 is configured to acquire information input by the user, including keywords, cutting time redundancy.
  • the determining unit 502 is configured to determine whether to search for a video scene of a type of dialogue or narration or a video scene of a description type. If it is the former, the search unit is called to search for the video scene of the dialogue or narration type in the video scene library in the storage unit; if the latter, the search unit is called for the description type in the video scene library in the storage unit. Video scenes are searched.
  • the determining unit 502 is further configured to determine whether it is required to perform real-time cutting on the video scene segment according to the input cutting time redundancy, such as cutting time redundancy. If real-time cutting is not required, the relevant video scene segment is returned directly from the video scene library in the storage unit.
  • the searching unit 503 is configured to receive a search for the subtitle and the video scene segment in the storage unit when the determining unit initiates the request.
  • the cutting unit 504 is configured to re-cut the video file in the storage unit according to the input cutting time redundancy amount when receiving the requesting unit to initiate the request.
  • the storage unit 505 is configured to store the generated video scene library and the data source video file of the generated video scene library.
  • the display unit 506 is for displaying a video scene segment that matches the search condition.
  • a video scene library similar to a font library and a thesaurus concept can be obtained.
  • the library contains various video scene segments from various video files and corresponding subtitle segments.
  • the user can easily obtain a large number of target video scene segments from different video files by inputting keywords in the terminal, thereby eliminating the current network technology.
  • it is necessary to download or copy a large number of large-volume video files, and then search in the subtitle file, and locate, cut, etc. in the video file.
  • the difference between this embodiment and the first embodiment is that, in this embodiment, the video scene library is not constructed, and when the user requests to search for a video scene segment, the time anchor point corresponding to the searched subtitle segment is in the data source. The video file is cut in real time, and the target video scene segment is returned to the user.
  • Various steps For the technical details and the working principle of each unit, refer to the first embodiment, and no further details are provided herein.
  • a method for directly searching for a video scene segment comprising the following steps:
  • ⁇ ' time anchor annotation and subtitle annotation for the video scene in the video file in the data source
  • ⁇ ' extract the time anchor point and subtitle segment of the annotation and related video file information into the subtitle library
  • C' user input key The word proposes a request to search for a video scene segment
  • the above method corresponds to a system for directly searching for a video scene segment, the system includes: Unit, subtitle library extraction unit, input unit, search unit, cutting unit, display unit.
  • the labeling unit is configured to perform time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
  • the subtitle library extracting unit is configured to extract the labeled subtitle segment and the time anchor point and the related video file information into the subtitle library;
  • the input unit user inputs information through the input unit
  • the search unit is configured to perform a keyword matching search in the subtitle library when receiving the input unit to initiate the request;
  • the cutting unit is configured to intercept the target video scene segment from the video file in the data source according to the time anchor point and the cutting redundancy amount;
  • the display unit is used to display a video clip that matches the search criteria.
  • Embodiment 3 The difference between this embodiment and the above two embodiments is that the embodiment is only a method and system for generating a video scene. For the technical details of the various steps and the working principle of each unit, reference may be made to the first embodiment, and details are not described herein.
  • the corresponding video file is redundantly cut according to the marked time anchor point, and the video scene segment corresponding to the subtitle segment is intercepted.
  • a system for generating a video scene corresponding to the above method comprising an annotation unit, a subtitle extraction unit, and a cutting unit.
  • the labeling unit is configured to perform time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
  • the caption extraction unit is configured to extract the labeled caption segment and the time anchor point and store the caption library in the caption library;
  • the cutting unit is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle segment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Television Signal Processing For Recording (AREA)
  • Studio Circuits (AREA)

Abstract

A method and system for generating a video scene library, a method and system for retrieving video scenes are disclosed. The method for generating the video scene library comprises the following steps: A, performing time anchor marking and caption annotating to the video scenes in video files in a data source; B, extracting the annotated captions to be stored in a caption library; C, according to the marked time anchors, performing redundancy segmenting to the corresponding video files, intercepting the video scene fragments, which correspond to the captions, to be stored in a video scene fragment library; D, establishing the relationship between caption fragments in the caption library and video scene fragments in the video scene library. The solution enables well data support for conveniently and quickly finding target video scene fragments by users.

Description

视频场景库生成方法及系统、 搜索视频场景的方法及系统 技术领域  Video scene library generation method and system, method and system for searching video scene
本发明属于视频搜索技术领域,具体涉及一种视频场景库的生成方法以及基 于这种库的视频场景的搜索方法及系统; 此外, 本发明还涉及一种直接搜索视频 场景的方法及系统。 背景技术  The present invention relates to the field of video search technology, and in particular, to a method for generating a video scene library and a search method and system for a video scene based on the library. In addition, the present invention also relates to a method and system for directly searching for a video scene. Background technique
随着互联网络的普及和网络技术的发展,现今互联网上视频搜索技术已经被 普遍使用。 用户通过使用视频搜索引擎, 可以方便的获得自己想要的视频信息。 现今视频搜索技术普遍是基于关键字搜索 ,通过在视频数据库中对视频文件名或 者相关标签的关键字匹配检索, 将符合搜索条件的视频文件返回给用户。 例如, 用户输入关键字 "疯狂" 进行视频搜索 , 那么 《疯狂的石头》《疯狂的赛车》等 等文件名包涵 "疯狂"二字的视频文件都是符合搜索条件的搜索结果。 即使是釆 用更高级的帧搜索技术,其结果也是以为整个视频文件为单位将搜索结果返回给 用户。 现今视频搜索技术并没有提供方便快捷的视频场景片段的搜索功能。  With the popularity of the Internet and the development of network technologies, video search technology on the Internet has been widely used today. Users can easily get the video information they want by using a video search engine. Today's video search technology is generally based on keyword search. The video file that meets the search criteria is returned to the user by matching the search of the video file name or the related tag in the video database. For example, if the user enters the keyword "crazy" to perform a video search, then the video files including "Crazy Stone", "Crazy Racing" and the like, including the word "crazy" are search results that meet the search criteria. Even with the more advanced frame search technique, the result is that the search results are returned to the user in units of the entire video file. Today's video search technology does not provide a convenient search function for video clips.
殳设一个学习外语的用户如果想知道一个单词或者一个句子在众多实际电 影场景中如何运用, 例如, 一个学生想知道 "how are you? " 在哪些电影场景 中可以用到, 那么在现有的网络技术条件下,他首先必须得根据经验或者其他辅 助条件判断 "how a re you? " 这个句子会出现在哪一部视频里面, 然后利用字 幕搜索引擎和视频搜索引擎搜得这部视频的字幕文件和视频文件,通过对字幕文 件的关键字匹配检索确定这部片子存在 "how are you? " 这个句子后, 再通过拖 放方式或者特定播放软件定位到 "how are you? " 这个句子所在的时间段进行 观看。 用户如果想收集含有 "how are you? " 这个句子的视频场景片段, 就需 要再用视频切割软件对视频文件进行切割收集。通过不断重复所述的过程, 用户 可以收集到一些包含 "how are you? " 这个对话内容的不同视频场景。  If you want to know how a word or a sentence can be used in many actual movie scenes, for example, a student wants to know "how are you?" in which movie scenes can be used, then in the existing Under the conditions of network technology, he must first judge the "how a re you?" sentence based on experience or other auxiliary conditions, and then use the subtitle search engine and video search engine to search for subtitles of this video. File and video files, after the keyword matching search of the subtitle file determines that the phrase "how are you?" exists in the film, and then locates the "how are you?" sentence by drag and drop or specific playback software. Watch the time period. If the user wants to collect video clips containing the phrase "how are you?", they will need to use video cutting software to cut and collect the video files. By repeating the process described above, the user can collect a number of different video scenes containing the "how are you?" conversation.
同样, 一个需要大量某一类视频场景作为素材的用户, 比如想参考许多战争 场景拍摄方法的摄影师, 需要一些雨景作为素材的视频制作爱好者,他们也只能 首先通过经验或者其他辅助条件判断这一类视频场景会出现在哪一些视频文件 上。 然后通过观看大量这些视频文件发现目标视频场景,再用视频切割软件进行 切割收集。 Similarly, a user who needs a large amount of video scenes as a material, such as a photographer who wants to refer to many war scene shooting methods, needs some rain scenes as a material for video production enthusiasts, they can only First, judge which video files will appear on this type of video scene through experience or other auxiliary conditions. Then, by watching a large number of these video files, the target video scene is found, and then the video cutting software is used for cutting and collecting.
通过以上所述, 可以看到, 在现有网络技术下, 用户必须花费大量的时间才 能获得少量的目标视频场景片段。 现今的视频搜索技术不具有通过关键字搜索, 快捷获得大量目标视频场景片段的功能。 发明内容  From the above, it can be seen that under the existing network technology, the user has to spend a large amount of time to obtain a small number of target video scene segments. Today's video search technology does not have the ability to quickly obtain a large number of target video scene segments through keyword search. Summary of the invention
本发明所要解决的问题是,提供一种视频场景库的生成方法, 为用户方便快 捷的找到目标视频场景片段做好数据支持。  The problem to be solved by the present invention is to provide a method for generating a video scene library, which provides data support for the user to quickly and easily find the target video scene segment.
本发明同时提供上述方法对应的视频场景库的生成系统。  The invention also provides a generation system of a video scene library corresponding to the above method.
此外,本发明还提供了一种基于上述方法所生成的视频场景库的搜索视频场 景片段的方法和系统, 以便用户方便快捷的找到目标视频场景片段。  In addition, the present invention also provides a method and system for searching a video scene segment based on a video scene library generated by the above method, so that the user can find the target video scene segment conveniently and quickly.
此外, 本发明还提供了一种直接搜索视频场景片段的方法和系统, 以便用户 方便快捷的找到目标视频场景片段。  In addition, the present invention also provides a method and system for directly searching for video scene segments, so that the user can quickly and easily find the target video scene segment.
另外, 本发明还提供了一种视频场景的生成方法和系统, 以便快捷的生成大 量视频场景片段。  In addition, the present invention also provides a method and system for generating a video scene, so as to quickly generate a large number of video scene segments.
为解决上述技术问题, 本发明釆用如下技术方案:  In order to solve the above technical problems, the present invention uses the following technical solutions:
一种视频场景库的生成方法, 所述方法包括如下步骤:  A method for generating a video scene library, the method comprising the following steps:
A、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; A. Perform time anchor annotation and subtitle annotation on the video scene in the video file in the data source;
B、 提取标注的字幕段存入字幕库; B. Extracting the subtitle segments of the annotation into the subtitle library;
C、 根据标注的时间锚点对对应视频文件进行有冗余切割, 截取该字幕对应 的视频场景片段, 存入视频场景片段库;  C. performing redundant cutting on the corresponding video file according to the marked time anchor point, intercepting the video scene segment corresponding to the subtitle, and storing the video scene segment in the video scene segment library;
D、 建立字幕库里的字幕段和视频场景库里的视频场景片段的对应关系。 作为本发明的一种优选方案, 所述步骤 A中, 字幕附注包括视频场景里的对 白 /旁白原文,或者对白 /旁白的同义解释或概括,或者描述视频场景类型的标签。  D. Establish a correspondence between the subtitle segment in the subtitle library and the video scene segment in the video scene library. As a preferred solution of the present invention, in the step A, the caption annotation includes a dialogue/narration in the video scene, or a synonymous explanation or generalization of the dialogue/narration, or a label describing the type of the video scene.
作为本发明的一种优选方案,所述步骤 B进一步包括提取时间锚点和相关视 频文件信息存入字幕库。 作为本发明的一种优选方案, 所述步骤^ 步骤 C顺序互换。 一种视频场景库的生成系统, 所述系统包括: As a preferred solution of the present invention, the step B further includes extracting the time anchor point and related video file information into the subtitle library. As a preferred embodiment of the present invention, the steps C are sequentially interchanged. A system for generating a video scene library, the system comprising:
标注单元,用以对数据源的视频文件里的视频场景进行时间锚点标注和字幕 附注;  An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕提取单元, 用以提取标注的字幕段存入字幕库;  a subtitle extraction unit, configured to extract the subtitle segments of the annotation into the subtitle library;
切割单元, 用以根据标注的时间锚点对视频文件进行有冗余切割,截取该字 幕对应的视频场景片段, 存入视频场景片段库;  a cutting unit, configured to perform redundant cutting on the video file according to the marked time anchor point, intercepting the video scene segment corresponding to the character screen, and storing the video scene segment in the video scene segment library;
关系建立单元,用以建立字幕库里的字幕段和视频场景库里的视频场景片段 的对应关系。  The relationship establishing unit is configured to establish a correspondence between the subtitle segment in the subtitle library and the video scene segment in the video scene library.
作为本发明的一种优选方案, 所述标注单元中, 字幕附注包括视频场景里的 对白 /旁白原文, 或者对白 /旁白的同义解释或概括, 或者描述视频场景类型的标 签; 所述字幕提取单元进一步提取时间锚点和相关视频文件信息存入字幕库。 一种基于上述方法所生成的视频场景库的搜索视频场景片段的方法,所述方 法包括如下步骤:  As a preferred solution of the present invention, in the labeling unit, the subtitle annotation includes a dialogue/narration original in the video scene, or a synonymous explanation or generalization of the dialogue/narration, or a label describing a video scene type; the subtitle extraction The unit further extracts the time anchor point and related video file information into the subtitle library. A method for searching a video scene segment based on a video scene library generated by the above method, the method comprising the following steps:
a、 用户输入关键字提出搜索视频场景片段的请求;  a, the user inputs a keyword to request a search for a video scene segment;
b、 在字幕库里检索得到匹配的字幕段信息;  b. Searching for the matching subtitle segment information in the subtitle library;
c、 返回和匹配字幕段相关联的视频场景片段。  c. Returns and matches the video clip associated with the subtitle segment.
作为本发明的一种优选方案, 所述步骤 a和步骤 b之间进一步包括: 判断是请求搜索对白或旁白类型的视频场景, 还是请求描述类型的视频场 景。  As a preferred solution of the present invention, the step a and the step b further include: determining whether to request a search for a video scene of a dialogue or narration type, or requesting a video scene of a description type.
若为前者, 则步骤 b在字幕库里搜索对白或旁白类型的字幕段; 若为后者, 则步骤 b在字幕库里搜索描述类型的字幕段。  If it is the former, step b searches for a subtitle segment of the dialogue or narration type in the subtitle library; if it is the latter, step b searches the subtitle library for the subtitle segment of the description type.
作为本发明的一种优选方案, 所述步骤 b和步骤 c之间进一步包括: 判断是否请求按照新的切割时间冗余量实时切割截取视频场景片段。  As a preferred solution of the present invention, the step b and the step c further comprise: determining whether to request to intercept the video scene segment in real time according to the new cutting time redundancy.
若是, 则根据匹配字幕段对应的时间锚点,按照新的切割时间冗余量对相应 视频文件进行切割截取获得相应的视频场景片段; 若否, 则根据视频场景片段库 和字幕库的关联关系获得和匹配字幕段对应的视频场景片段。 一种基于上述方法所生成的视频场景库的搜索视频场景片段的系统,所述系 统包括: If yes, according to the time anchor point corresponding to the matching subtitle segment, the corresponding video file segment is cut and intercepted according to the new cutting time redundancy amount to obtain the corresponding video scene segment; if not, according to the video scene segment library The association relationship with the subtitle library obtains and matches the video scene segment corresponding to the subtitle segment. A system for searching a video scene segment based on a video scene library generated by the above method, the system comprising:
输入单元, 用户通过该输入单元输入信息;  An input unit through which the user inputs information;
搜索单元, 用于接收到输入单元发起请求时,在存储单元中对视频场景片段 的搜索;  a search unit, configured to: when receiving an input unit to initiate a request, search for a video scene segment in the storage unit;
存储单元, 用于存储生成的视频场景库, 即存储有相互关联的视频场景片段 库和字幕库;  a storage unit, configured to store the generated video scene library, that is, the stored video scene segment library and the subtitle library;
显示单元, 用于显示符合搜索条件的视频场景片段。  A display unit for displaying video clips that match the search criteria.
作为本发明的一种优选方案, 所述搜索系统进一步包括判断单元, 用于判断 搜索请求是针对对白或旁白类型的场景还是针对描述类型的场景,同时用于判断 是否要求对视频场景片段按输入的切割时间冗余量重新切割截取。  As a preferred solution of the present invention, the search system further includes a determining unit, configured to determine whether the search request is for a dialogue or a narration type scene or a description type of the scene, and is used for determining whether to request input of the video scene segment. The cutting time redundancy is re-cut and intercepted.
作为本发明的一种优选方案, 所述系统进一步包括切割单元, 用于对视频文 件按输入的切割时间冗余量重新切割截取。 一种直接搜索视频场景片段的方法, 所述方法包括如下步骤:  As a preferred aspect of the present invention, the system further includes a cutting unit for re-cutting the video file by an input cutting time redundancy amount. A method for directly searching for a video scene segment, the method comprising the following steps:
Α'、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; Β'、 提取标注的时间锚点和字幕段以及相关视频文件信息存入字幕库; C'、 用户输入关键字提出搜索视频场景片段的请求;  Α', time anchor annotation and subtitle annotation for the video scene in the video file in the data source; Β', extract the time anchor point and subtitle segment of the annotation and related video file information into the subtitle library; C', user input key The word proposes a request to search for a video scene segment;
D'、 通过关键字检索获得匹配的字幕段及其相应的时间锚点;  D', obtaining a matching subtitle segment and its corresponding time anchor point by keyword search;
E'、根据时间锚点和切割冗余量,对相应视频文件进行切割截取获得目标视 频场景片段, 返回给用户。 一种直接搜索视频场景片段的系统, 所述系统包括:  E', according to the time anchor point and the cutting redundancy, cut and intercept the corresponding video file to obtain the target video scene segment, and return it to the user. A system for directly searching for video scene segments, the system comprising:
标注单元,用以对数据源的视频文件里的视频场景进行时间锚点标注和字幕 附注;  An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕库提取单元,用以提取标注的字幕段和时间锚点以及相关视频文件信息 存入字幕库; a subtitle library extracting unit for extracting the labeled subtitle segment and time anchor point and related video file information Saved in the subtitle library;
输入单元, 用户通过该输入单元输入信息;  An input unit through which the user inputs information;
搜索单元, 用于接收到输入单元发起请求时,在字幕库中进行关键字匹配检 索;  a search unit, configured to perform a keyword matching search in the subtitle library when receiving the input unit to initiate the request;
切割单元, 用于根据时间锚点和切割冗余量,对数据源中的视频文件切割截 取目标视频场景片段;  a cutting unit, configured to cut a target video scene segment by cutting a video file in the data source according to the time anchor point and the cutting redundancy amount;
显示单元, 用于显示符合搜索条件的视频场景片段。 一种视频场景的生成方法, 所述方法包括如下步骤:  A display unit for displaying video clips that match the search criteria. A method for generating a video scene, the method comprising the following steps:
'、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; ', perform time anchor annotation and subtitle annotation on the video scene in the video file in the data source;
B' '、 提取标注的字幕段和时间锚点存入字幕库; B' ', extracting the labeled subtitle segment and time anchor point into the subtitle library;
0 '、 根据标注的时间锚点对对应视频文件进行有冗余切割, 截取字幕段对 应的视频场景片段。 一种视频场景的生成系统, 所述系统包括:  0 ', the corresponding video file is redundantly cut according to the marked time anchor point, and the video scene segment corresponding to the subtitle segment is intercepted. A system for generating a video scene, the system comprising:
标注单元,用以对数据源的视频文件里的视频场景进行时间锚点标注和字幕 附注;  An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕提取单元, 用以提取标注的字幕段和时间锚点存入字幕库;  a caption extraction unit, configured to extract the caption segment and the time anchor point into the caption library;
切割单元, 用以根据标注的时间锚点对视频文件进行有冗余切割,截取字幕 段对应的视频场景片段。 本发明的有益效果在于:釆用本发明提出的视频场景库的生成方法及系统以 及搜索视频场景片段的方法及系统, 只需花费制作相应字幕文件的人力时间, 就 可以自动把视频文件里的视频场景釆集进视频场景库。 视频场景库类似于字库, 词库的概念,里面包含有来自各种视频文件里的各种视频场景片段以及相应的字 幕段。用户只要在终端输入关键字进行搜索, 就可以轻易获得大量来自不同视频 文件的目标视频场景片段, 免去了现今网络技术下, 为达到同样目的, 需要下载 或者拷贝大量大体积视频文件, 进而在字幕文件里检索, 在视频文件里定位, 切 割等等的麻烦。 弥补了现今视频搜索引擎不能搜索视频场景片段的不足, 为根据 关键字搜索视频场景的实现提供了一种方案, 为用户, 尤其是外语学习者和视频 编辑工作者节省了大量的时间, 提供了巨大的方便。 附图说明 The cutting unit is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle segment. The invention has the beneficial effects of: the method and system for generating a video scene library and the method and system for searching a video scene segment, which can be automatically used in a video file only by spending manpower time for creating a corresponding subtitle file. Video scenes are collected into the video scene library. The video scene library is similar to the font library, the concept of the thesaurus, which contains various video scene segments from various video files and corresponding subtitle segments. Users only need to input keywords in the terminal to search, you can easily get a large number of target video scene clips from different video files, eliminating the need of today's network technology, in order to achieve the same purpose, you need to download or copy a large number of large-volume video files, and then Retrieve in the subtitle file, locate in the video file, cut The trouble of cutting and so on. Compensating the shortcomings of video search engines that can't search video clips, providing a solution for the search of video scenes based on keywords, saving a lot of time for users, especially foreign language learners and video editors, Great convenience. DRAWINGS
图 1为本发明视频场景库生成方法的流程图。  FIG. 1 is a flowchart of a method for generating a video scene library according to the present invention.
图 2为本发明视频场景搜索方法的流程图。  2 is a flow chart of a video scene search method according to the present invention.
图 3为本发明视频场景库的结构图。  FIG. 3 is a structural diagram of a video scene library of the present invention.
图 4 为本发明视频场景库生成系统的示意图。  FIG. 4 is a schematic diagram of a video scene library generation system according to the present invention.
图 5为本发明视频场景搜索系统的示意图。  FIG. 5 is a schematic diagram of a video scene search system of the present invention.
图 6为本发明视频场景搜索实例的示意图。 具体实施方式  FIG. 6 is a schematic diagram of a video scene search example of the present invention. detailed description
下面结合附图详细说明本发明的优选实施例。  Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
实施例一  Embodiment 1
本发明方案所涉及的视频文件, 其内容、 格式、 类型等属性都不影响本发明 方案的实施。 下面例子中就以一般的英文电影视频的文件为例子,但本发明方案 的实施并不限于英文电影的视频文件。 如, 本发明也适用于华语电影、 其他外语 电影、 非电影类视频。  The video file according to the solution of the present invention does not affect the implementation of the present invention in terms of its content, format, type and the like. In the following example, a file of a general English movie video is taken as an example, but the implementation of the solution of the present invention is not limited to the video file of an English movie. For example, the present invention is also applicable to Chinese movies, other foreign language movies, and non-film videos.
参阅图 1 , 图 1揭示了本发明视频场景库的生成方法的较佳实施实例, 所述 方法包括如下步骤:  Referring to FIG. 1, FIG. 1 discloses a preferred implementation example of a method for generating a video scene library according to the present invention. The method includes the following steps:
【步骤 101】  [Step 101]
依照预设的取景规则 ,对视频文件的每一个视频场景进行时间锚点标注和字 幕附注。典型的取景规则有, 以视频中的每一句完整的对白或者旁白为一个场景 单位,还有以一个特定的场景为一个场景单位。 字幕内容可以是对白 /旁白原文, 或者对白 /旁白的同义解释或概括, 对应的是对白或旁白类型的视频场景, 也可 以是场景描述标签, 对应的是描述性视频场景。 以《阿甘正传》这部影片的视频文件为例, 假设预设的取景规则是以视频中 的每一句完整的对白或者旁白为单位进行取景, 此外还对特定的场景进行取景 , 比如雪景, 海景, 雨景, 战斗场景等。 4叚设电影里一共有 2000句对白和旁白, 50个特定的场景。 那么这部影片便被定义了 2050个视频场景。 该步骤就要对这 2050 个视频场景进行时间锚点标注和字幕附注。 标注的时间锚点有: 开始时间 锚点和结束时间锚点。开始时间锚点是指视频场景在视频文件中播放开始的时间 点。 结束时间锚点是指视频场景在视频文件中播放结束的时间点; 字幕附注的内 容包括对白或旁白和场景描述标签。 According to the preset framing rule, time anchor annotation and subtitle annotation are performed on each video scene of the video file. A typical framing rule is to use a complete dialogue or narration for each scene unit in the video, and a specific scene as a scene unit. The subtitle content can be a dialogue/narration original text, or a synonym explanation or generalization of the dialogue/narration, corresponding to a video scene of a dialogue or narration type, or a scene description label, corresponding to a descriptive video scene. Take the video file of the film "Forrest Gump" as an example. Assume that the preset framing rule is framing with each complete dialogue or narration in the video, and also framing a specific scene, such as snow scene. , seascape, rain, battle scenes, etc. 4 There are a total of 2000 dialogues and narration in the movie, 50 specific scenes. Then the film was defined with 2050 video scenes. This step requires time anchor annotation and caption annotation for these 2050 video scenes. The time anchors marked are: start time anchor and end time anchor point. The start time anchor point is the point in time at which the video scene starts playing in the video file. The end time anchor point refers to the time point at which the video scene ends playing in the video file; the content of the subtitle note includes the dialogue or narration and the scene description label.
比如, 在 "00: 32: 46. 634" 和 "00: 32: 48. 727" 的时间段之间, 有一句 "My name i s Forres t Gump" 的对白。 根据取景规则, 以这个时间段为中心, 便产生 了一个视频场景单位。 标注的开始时间锚点是 " 00: 32: 46. 634" , 结束时间锚点 是 "00: 32: 48. 727" , 附注的字幕内容是对白内容 "My name i s Forres t Gump" 0 该段视频场景属于对白或旁白类型的视频场景。 For example, between "00: 32: 46. 634" and "00: 32: 48. 727", there is a dialogue "My name is Forres t Gump". According to the framing rule, a video scene unit is generated centering on this time period. The start time anchor of the annotation is "00: 32: 46. 634", the end time anchor is "00: 32: 48. 727", and the subtitle content of the annotation is the dialogue content "My name is Forres t Gump" 0 A video scene is a video scene of a dialogue or narration type.
又如, 在 "00: 49: 10. 123" 和 "00: 51: 06. 351" 的时间段之间, 是一个较为 独立的战斗场景。 根据取景规则, 以这个时间段为中心, 便产生了一个视频场景 单位。标注的开始时间锚点是" 00: 49: 10. 123" ,结束时间锚点是" 00: 51: 06. 351" , 附注的字幕内容是场景描述标签 "战争"。 该段视频场景属于描述类型的视频场 景。  Another example is that between "00: 49: 10. 123" and "00: 51: 06. 351", it is a relatively independent battle scene. According to the framing rule, a video scene unit is generated centering on this time period. The start time anchor of the callout is "00: 49: 10.123", the end time anchor is "00: 51: 06. 351", and the subtitle of the note is the scene description tag "War". This video scene belongs to a video scene of the description type.
该步骤釆用一般影视字幕制作技术对视频场景进行标注,输出包含时间锚点 和字幕段的字幕文件。 字幕制作技术是现有成熟技术, 在此不做赞述。  In this step, the video scene is marked with a general video subtitle creation technology, and a subtitle file containing a time anchor point and a subtitle segment is output. Subtitle production technology is a mature technology, and I will not comment here.
【步骤 102】  [Step 102]
通过正则表达式匹配 ,提取所有标注的时间锚点和字幕以及相关的视频文件 信息, 存入字幕库。 字幕库可釆用一般商用数据库产品。 每个存储元素包括一个 完整的字幕附注及对应的开始时间锚点和结束时间锚点。整个字幕库的主要结构 如下表格所示: 序号 开始时间锚点 结束时间锚点 字幕信息 类型 来自视频Through regular expression matching, all labeled time anchors and subtitles and related video file information are extracted and stored in the subtitle library. The subtitle library can use general commercial database products. Each storage element includes a complete subtitle note and corresponding start time anchor and end time anchor. The main structure of the entire subtitle library is shown in the following table: Serial number start time anchor end time anchor point subtitle information type from video
… … … … … … ... ... ... ... ... ...
N 00:07:18.269 00:07:20.438 How are you? 对白 《闻香识女人》 N 00:07:18.269 00:07:20.438 How are you? Dialogue "Smell the woman"
N+l 00:32:46.634 00:32:48.727 My name is Forrest Gump. 对白 《阿甘正传》N+l 00:32:46.634 00:32:48.727 My name is Forrest Gump. Talk "Forrest Gump"
N+2 00:49:10.123 00:51:06.351 战争 描述 《阿甘正传》N+2 00:49:10.123 00:51:06.351 War Description "Forrest Gump"
… … … … … … ... ... ... ... ... ...
表 1  Table 1
【步骤 103】 [Step 103]
以所标注的时间锚点为输入参数,循环利用多媒体编程语言的切割函数, 比 如 Java媒体架构(JMF)里的 Cut类里的相关函数,对视频文件按切割时间冗余量 进行切割截取视频场景片段。切割时间冗余量是指, 以目标视频场景所在的时间 段为中心, 向前还有向后扩展的时间量。 设置切割时间冗余量的目的是, 为了用 户可能需要了解目标视频场景的上下文信息。切割时间冗余量一般是一个以字幕 段文字长度和视频场景段时间长度为自变量的函数, 即 z = f ( X, y), 其中 z 代表切割时间冗余量, X是字幕段的单词数或者字数, y是标注视频场景的时间 长度。切割时间冗余量也可以是一个人为定义的常量。视频场景片段的开始切割 时间点 = 标注的开始时间锚点 -切割时间冗余量 ;结束切割时间点 = 标注的 结束时间锚点 + 切割时间冗余量。  Taking the marked time anchor as the input parameter, recycling the cutting function of the multimedia programming language, such as the related function in the Cut class in the Java Media Architecture (JMF), cutting and capturing the video scene according to the cutting time redundancy of the video file. Fragment. The cutting time redundancy refers to the amount of time that extends forward and backward, centering on the time period in which the target video scene is located. The purpose of setting the cutting time redundancy is to allow the user to know the context information of the target video scene. The cutting time redundancy is generally a function of the subtitle segment text length and the video scene segment time length as independent variables, ie z = f (X, y), where z represents the cutting time redundancy and X is the subtitle segment word. Number or number of words, y is the length of time to mark the video scene. The cutting time redundancy can also be an artificially defined constant. Start cutting of video clips Time point = Start time anchor of the label - Cut time redundancy; End cut time point = End time of the label Anchor + Cut time redundancy.
以 "My name is Forrest Gump" 这个字幕单位为例子。 开始时间错点是 "00: 07: 18.269", 结束时间锚点是 "00: 07: 20.438"。 假设计算所得或者预定义 的切割时间冗余量是 3秒。 那么从 "00: 07: 15.269" 到 "00: 07: 23.438" 时间段 的视频场景将作为目标视频场景片段被截取出来。  Take the subtitle unit "My name is Forrest Gump" as an example. The starting time is "00: 07: 18.269" and the ending time anchor is "00: 07: 20.438". Assume that the calculated or predefined cut time redundancy is 3 seconds. Then the video scene from "00: 07: 15.269" to "00: 07: 23.438" will be taken as the target video scene segment.
将所有切割截取获得的视频场景片段集,存入视频场景片段库。 这里视频场 景片段库的概念对应的实体可以是一般商用数据库产品,也可以是普通操作系统 中的文件集。  The video scene segment set obtained by cutting all the cuts is stored in the video scene segment library. Here, the entity corresponding to the concept of the video scene fragment library may be a general commercial database product or a file set in a common operating system.
【步骤 104】  [Step 104]
利用数据库关联技术, 建立视频场景片段库和字幕库的关联关系, 综合形成 可供搜索的视频场景库。 参照图 3, 每一个视频场景片段都有对应的字幕段。  Using the database association technology, the association relationship between the video scene segment library and the subtitle library is established, and a video scene library for searching is comprehensively formed. Referring to FIG. 3, each video scene segment has a corresponding subtitle segment.
对于数据源的每个视频文件, 重复步骤 101至步骤 104, 便可将不同视频文 件里的所有视频场景及相关字幕段收录进视频场景库。 For each video file of the data source, repeat steps 101 to 104 to display different video files. All video scenes and related subtitles in the clip are included in the video scene library.
此外, 由于步骤 102、 步骤 1 03无必要顺序依赖, 步骤 102、 步骤 1 03的顺 序可互换。 以上介绍了本发明的视频场景库的生成方法, 本发明在揭示上述方法的同 时, 还揭示了上述方法对应的视频场景库的生成系统。 请参阅图 4 , 所述视频场 景库的生成系统包括: 标注单元 401、 字幕提取单元 402、 切割单元 403、 关系 建立单元 404。  In addition, since step 102 and step 103 do not need to be sequentially dependent, the order of step 102 and step 103 can be interchanged. The method for generating a video scene library of the present invention has been described above. The present invention also discloses a method for generating a video scene library corresponding to the above method while revealing the above method. Referring to FIG. 4, the video scene database generation system includes: an annotation unit 401, a subtitle extraction unit 402, a cutting unit 403, and a relationship establishing unit 404.
标注单元 401 用以对数据源的视频文件里的视频场景进行时间锚点标注和 字幕附注;字幕提取单元 402用以提取标注的字幕段和时间锚点以及其他视频文 件信息存入字幕库;切割单元 403用以根据标注的时间锚点对视频文件进行有冗 余切割, 截取该字幕对应的视频场景片段, 存入视频场景片段库; 关系建立单元 404用以建立字幕库里的字幕段和视频场景库里的视频场景片段的对应关系。 各 单元的作用原理可参考上述方法的描述, 这里不作赘述。 通过以上视频场景库的生成方法及系统, 可获得视频场景库, 以下介绍基于 视频场景库的搜索视频场景片段的方法。 参阅图 2、 图 6 , 本发明的基于视频场 景库的搜索视频场景片段的方法包括如下步骤:  The labeling unit 401 is configured to perform time anchor annotation and caption annotation on the video scene in the video file of the data source; the caption extraction unit 402 is configured to extract the labeled caption segment and time anchor point and other video file information into the caption library; The unit 403 is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle, and store the video scene segment in the video scene segment library; the relationship establishing unit 404 is configured to create the subtitle segment and the video in the subtitle library. Correspondence of video scene segments in the scene library. The working principle of each unit can be referred to the description of the above method, and will not be described here. The video scene library can be obtained by the above method and system for generating a video scene library. The following describes a method for searching a video scene segment based on the video scene library. Referring to FIG. 2 and FIG. 6, the method for searching a video scene segment based on the video scene library of the present invention includes the following steps:
【步骤 201】  [Step 201]
用户输在终端输入关键字, 如在输入框 601中输入 "how are you" , 发出搜 索视频场景的请求。  The user inputs a keyword in the terminal, for example, enter "how are you" in the input box 601 to issue a request to search for a video scene.
【步骤 202】  [Step 202]
根据选项 602 , 判断请求是针对对白或旁白类型的视频场景, 还是针对描述 类型的视频场景, 点击搜索视频场景按钮 603。 如果是针对对白或旁白类型的视 频场景,则执行步骤 203 ;如果是执行针对描述类型的视频场景,则执行步骤 204。  According to option 602, it is determined whether the request is for a video scene of a dialogue or narration type, or for a video scene of a description type, clicking the search video scene button 603. If it is a video scene for a dialogue or narration type, step 203 is performed; if it is a video scene for the description type, step 204 is performed.
【步骤 203】  [Step 203]
在字幕库里搜索匹配的对白或旁白类型的字幕段。 比如, 输入的关键字是 "how are you" , 那么来自 《Scent Of A Woman》 的 "how are you? " , 《Bad Lieutenant》 的 "how are you do ing?" , 《Mona Li sa Smi le》 的 "Hi. How are you?" 等等都是匹配的目标字幕段。 匹配结果显示在列表项 604。 Search for subtitles in the subtitles library for matching dialogues or narration types. For example, the keyword entered is "how are you", then "how are you?" from "Scent Of A Woman", "Bad Lieutenant's "how are you do ing?", "Mona Li sa Smi le"'s "Hi. How are you?" and so on are matching target subtitle segments. The matching result is displayed in list item 604.
【步骤 204】  [Step 204]
在字幕库里搜索匹配的描述性的字幕段。 比如输入的关键字是 "战争", 来 自 ((Forres t Gump» (( Independence Day» «Ava tar»等等片子的战争视频场景片 段所对应的字幕段都是匹配的目标字幕段。  Search for matching descriptive subtitle segments in the subtitles library. For example, the keyword entered is "war", from (Forres t Gump» ((Independence Day» «Ava tar» and so on, the subtitle segments corresponding to the war video scene segment are matching target subtitle segments.
【步骤 205】  [Step 205]
根据选择默认播放 605 , 还是选择切割播放 606 , 判断是请求从视频场景库 里直接返回相关的视频场景片段,还是依照新输入的切割时间冗余量实时对视频 文件切割截取获得视频场景片段。如果是要求直接从视频场景库返回, 则执行步 骤 206 , 如果要求实时切割, 则执行步骤 207。  According to the selection of the default play 605, or the cut play 606, it is determined whether the request directly returns the relevant video scene segment from the video scene library, or the video file segment is obtained by cutting the video file in real time according to the newly input cutting time redundancy. If it is required to return directly from the video scene library, step 206 is performed, and if real-time cutting is required, step 207 is performed.
【步骤 206】  [Step 206]
依照视频场景库里视频场景片段库和字幕库的关联关系,通过搜索匹配所得 的字幕段返回对应的目标视频场景片段。 比如直接返回 《Scent Of A Woman》 的 "how are you?" , 《Bad L ieutenant》 的 "how are you do ing?" , 《Mona Li sa Smi le》 的 "Hi. How are you?" 等等匹配的字幕段对应的目标视频场景片段。  According to the association relationship between the video scene segment library and the subtitle library in the video scene library, the corresponding target video scene segment is returned by searching for the matched subtitle segment. For example, "how are you?" in "Scent Of A Woman", "how are you do ing?" in "Bad L ieutenant", "Hi. How are you?" in "Mona Li sa Smi le", etc. The target video scene segment corresponding to the matched subtitle segment.
【步骤 207】  [Step 207]
通过搜索匹配所得的字幕段进一步取得时间锚点。 比如《Scent Of A Woman)) 的 "how are you?" 的时间锚点是 "00: 07: 18. 269" 和 "00: 07: 20. 438"。 假如 用户选择切割播放 606 , 在系统弹出框 607中重新输入的时间冗余量是 5秒, 点 击切割播放后, 那么程序将对 ((Scent Of A Woman)) 的从 " 00: 07: 13. 269" 到 "00: 07: 25. 438" 的视频场景片段进行切割, 返回给搜索用户。  The time anchor point is further obtained by searching for the matched subtitle segment. For example, "S are a you?" The "how are you?" time anchors are "00: 07: 18. 269" and "00: 07: 20. 438". If the user selects cut play 606, the time redundancy re-entered in system pop-up box 607 is 5 seconds. After clicking cut play, then the program will be ((Scent Of A Woman)) from "00: 07: 13. The 269" to "00: 07: 25. 438" video clip is cut and returned to the search user.
【步骤 208】  [Step 208]
显示搜得的目标视频场景片段, 参照视频播放界面 608。 以上介绍了本发明的基于视频场景库的搜索视频场景片段的方法,本发明在 揭示上述方法的同时,还揭示上述方法对应的基于视频场景库的搜索视频场景片 段的系统; 请参阅图 5 , 该搜索系统包括: 输入单元 501、 判断单元 502、 搜索 单元 503、 切割单元 504、 存储单元 505、 显示单元 506。 The searched target video scene segment is displayed, referring to the video play interface 608. The method for searching a video scene segment based on the video scene library of the present invention is described above. The present invention also discloses a system for searching a video scene segment based on a video scene library corresponding to the above method. Referring to FIG. 5 , The search system includes: an input unit 501, a determination unit 502, and a search The unit 503, the cutting unit 504, the storage unit 505, and the display unit 506.
输入单元 501用于获取用户输入的信息, 包括关键字、 切割时间冗余量。 判断单元 502 用于判断是对对白或旁白的类型的视频场景进行搜索还是对 描述类型的视频场景进行搜索。如果是前者, 则调用搜索单元对存储单元里的视 频场景库里的对白或旁白类型的视频场景进行搜索; 如果是后者, 则调用搜索单 元对存储单元里的视频场景库里的描述类型的视频场景进行搜索。 该判断单元 502还用于判断是否要求对视频场景片段按输入的切割时间冗余量实时切割, 如 割时间冗余量进行切割。如果不要求实时切割, 则直接从存储单元里的视频场景 库返回相关的视频场景片段。  The input unit 501 is configured to acquire information input by the user, including keywords, cutting time redundancy. The determining unit 502 is configured to determine whether to search for a video scene of a type of dialogue or narration or a video scene of a description type. If it is the former, the search unit is called to search for the video scene of the dialogue or narration type in the video scene library in the storage unit; if the latter, the search unit is called for the description type in the video scene library in the storage unit. Video scenes are searched. The determining unit 502 is further configured to determine whether it is required to perform real-time cutting on the video scene segment according to the input cutting time redundancy, such as cutting time redundancy. If real-time cutting is not required, the relevant video scene segment is returned directly from the video scene library in the storage unit.
搜索单元 503用于接收到判断单元发起请求时,在存储单元中对字幕和视频 场景片段的搜索。  The searching unit 503 is configured to receive a search for the subtitle and the video scene segment in the storage unit when the determining unit initiates the request.
切割单元 504用于接收到判断单元发起请求的时候,对存储单元里的视频文 件按输入的切割时间冗余量重新切割截取。  The cutting unit 504 is configured to re-cut the video file in the storage unit according to the input cutting time redundancy amount when receiving the requesting unit to initiate the request.
存储单元 505 用于存储生成的视频场景库和生成视频场景库的数据源视频 文件。  The storage unit 505 is configured to store the generated video scene library and the data source video file of the generated video scene library.
显示单元 506用于显示符合搜索条件的视频场景片段。  The display unit 506 is for displaying a video scene segment that matches the search condition.
通过本例所述的视频场景库的生成方法和系统, 可以获得类似字库,词库概 念的视频场景库,库里包含有来自各种视频文件里的各种视频场景片段以及相应 的字幕段。通过本例所述的基于视频场景库的搜索视频场景的方法, 用户只要在 终端输入关键字进行搜索,就可以轻易获得大量来自不同视频文件的目标视频场 景片段, 免去了现今网络技术下, 为达到同样目的, 需要下载或者拷贝大量大体 积视频文件, 进而在字幕文件里检索, 在视频文件里定位, 切割等等的麻烦。 实施例二  Through the method and system for generating a video scene library as described in this example, a video scene library similar to a font library and a thesaurus concept can be obtained. The library contains various video scene segments from various video files and corresponding subtitle segments. According to the method for searching a video scene based on the video scene library in this example, the user can easily obtain a large number of target video scene segments from different video files by inputting keywords in the terminal, thereby eliminating the current network technology. In order to achieve the same purpose, it is necessary to download or copy a large number of large-volume video files, and then search in the subtitle file, and locate, cut, etc. in the video file. Embodiment 2
本实施例与实施例一的区别在于, 本实施例中, 没有构建视频场景库, 而在 用户提出搜索视频场景片段的请求时,根据搜索所得的字幕段对应的时间锚点对 数据源中的视频文件进行实时切割,将目标视频场景片段返回给用户。各个步骤 的技术细节以及各个单元的作用原理, 可以参考实施例一, 在此不做赘述。 一种直接搜索视频场景片段的方法, 所述方法包括如下步骤: The difference between this embodiment and the first embodiment is that, in this embodiment, the video scene library is not constructed, and when the user requests to search for a video scene segment, the time anchor point corresponding to the searched subtitle segment is in the data source. The video file is cut in real time, and the target video scene segment is returned to the user. Various steps For the technical details and the working principle of each unit, refer to the first embodiment, and no further details are provided herein. A method for directly searching for a video scene segment, the method comprising the following steps:
Α'、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; Β'、 提取标注的时间锚点和字幕段以及相关视频文件信息存入字幕库; C'、 用户输入关键字提出搜索视频场景片段的请求;  Α', time anchor annotation and subtitle annotation for the video scene in the video file in the data source; Β', extract the time anchor point and subtitle segment of the annotation and related video file information into the subtitle library; C', user input key The word proposes a request to search for a video scene segment;
D'、 通过关键字检索获得匹配的字幕段及其相应的时间锚点;  D', obtaining a matching subtitle segment and its corresponding time anchor point by keyword search;
E'、根据时间锚点和切割冗余量,对相应视频文件进行切割截取获得目标视 频场景片段, 返回给用户; 上述方法对应的一种直接搜索视频场景片段的系统, 所述系统包括: 标注单 元、 字幕库提取单元、 输入单元、 搜索单元、 切割单元、 显示单元。  E', according to the time anchor point and the cutting redundancy amount, the corresponding video file is cut and intercepted to obtain the target video scene segment, and returned to the user; the above method corresponds to a system for directly searching for a video scene segment, the system includes: Unit, subtitle library extraction unit, input unit, search unit, cutting unit, display unit.
标注单元用以对数据源的视频文件里的视频场景进行时间锚点标注和字幕 附注;  The labeling unit is configured to perform time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕库提取单元用以提取标注的字幕段和时间锚点以及相关视频文件信息 存入字幕库;  The subtitle library extracting unit is configured to extract the labeled subtitle segment and the time anchor point and the related video file information into the subtitle library;
输入单元用户通过该输入单元输入信息;  The input unit user inputs information through the input unit;
搜索单元用于接收到输入单元发起请求时, 在字幕库中进行关键字匹配检 索;  The search unit is configured to perform a keyword matching search in the subtitle library when receiving the input unit to initiate the request;
切割单元用于根据时间锚点和切割冗余量,对数据源中的视频文件截取目标 视频场景片段;  The cutting unit is configured to intercept the target video scene segment from the video file in the data source according to the time anchor point and the cutting redundancy amount;
显示单元用于显示符合搜索条件的视频场景片段。  The display unit is used to display a video clip that matches the search criteria.
通过本例所述的直接搜索视频场景片段的方法和系统,不需要依赖视频场景 库, 用户只要在终端输入关键字进行搜索, 就可以轻易获得大量来自不同视频文 件的实时切割截取的目标视频场景片段,免去了现今网络技术下, 为达到同样目 的, 需要下载或者拷贝大量大体积视频文件, 进而在字幕文件里检索, 在视频文 件里定位, 切割等等的麻烦。 实施例三 本实施例与以上两个实施例的区别在于,本实施例只是一个生成视频场景的 方法和系统。各个步骤的技术细节以及各个单元的作用原理,可以参考实施例一, 在此不做赘述。 本实施例揭示的一种视频场景的生成方法, 所述方法包括如下步 骤: The method and system for directly searching for video scene segments described in this example do not need to rely on the video scene library. The user can easily obtain a large number of target video scenes from real-time cutting and intercepting different video files by inputting keywords in the terminal for searching. Fragments, eliminating the need for today's network technology, in order to achieve the same purpose, you need to download or copy a large number of large-volume video files, and then search in the subtitle file, positioning, cutting and so on in the video file. Embodiment 3 The difference between this embodiment and the above two embodiments is that the embodiment is only a method and system for generating a video scene. For the technical details of the various steps and the working principle of each unit, reference may be made to the first embodiment, and details are not described herein. A method for generating a video scene disclosed in this embodiment, where the method includes the following steps:
'、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; ', perform time anchor annotation and subtitle annotation on the video scene in the video file in the data source;
B' '、 提取标注的字幕段和时间锚点存入字幕库; B' ', extracting the labeled subtitle segment and time anchor point into the subtitle library;
0 '、 根据标注的时间锚点对对应视频文件进行有冗余切割, 截取字幕段对 应的视频场景片段。  0 ', the corresponding video file is redundantly cut according to the marked time anchor point, and the video scene segment corresponding to the subtitle segment is intercepted.
上述方法对应的一种视频场景的生成系统, 所述系统包括标注单元、字幕提 取单元、 切割单元。  A system for generating a video scene corresponding to the above method, the system comprising an annotation unit, a subtitle extraction unit, and a cutting unit.
标注单元用以对数据源的视频文件里的视频场景进行时间锚点标注和字幕 附注;  The labeling unit is configured to perform time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕提取单元用以提取标注的字幕段和时间锚点存入字幕库;  The caption extraction unit is configured to extract the labeled caption segment and the time anchor point and store the caption library in the caption library;
切割单元用以根据标注的时间锚点对视频文件进行有冗余切割,截取字幕段 对应的视频场景片段。  The cutting unit is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle segment.
通过本例所述的方法和系统, 可以方便快捷的获得大量的视频场景片段, 为 用户的相关工作提供大量的视频场景素材。 这里本发明的描述和应用是说明性的,并非想将本发明的范围限制在上述实 施例中。 这里所披露的实施例的变形和改变是可能的,对于那些本领域的普通技 术人员来说实施例的替换和等效的各种部件是公知的。本领域技术人员应该清楚 的是, 在不脱离本发明的精神或本质特征的情况下, 本发明可以以其它形式、 结 构、 布置、 比例, 以及用其它组件、 材料和部件来实现。 在不脱离本发明范围和 精神的情况下, 可以对这里所披露的实施例进行其它变形和改变。  Through the method and system described in this example, a large number of video scene segments can be obtained conveniently and quickly, and a large amount of video scene material is provided for the related work of the user. The description and application of the present invention are intended to be illustrative, and not intended to limit the scope of the invention. Variations and modifications of the embodiments disclosed herein are possible, and various alternative and equivalent components of the embodiments are well known to those of ordinary skill in the art. It is apparent to those skilled in the art that the present invention may be embodied in other forms, configurations, arrangements, ratios, and other components, materials and components without departing from the spirit or essential characteristics of the invention. Other variations and modifications of the embodiments disclosed herein may be made without departing from the scope and spirit of the invention.

Claims

权利要求书 、 一种视频场景库的生成方法, 其特征在于, 所述方法包括如下步骤:  The present invention provides a method for generating a video scene library, the method comprising the following steps:
A、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; A. Perform time anchor annotation and subtitle annotation on the video scene in the video file in the data source;
B、 提取标注的字幕段存入字幕库; B. Extracting the subtitle segments of the annotation into the subtitle library;
C、根据标注的时间锚点对对应视频文件进行有冗余切割,截取该字幕对 应的视频场景片段, 存入视频场景片段库;  C. Perform redundant cutting on the corresponding video file according to the marked time anchor point, intercept the video scene segment corresponding to the subtitle, and store it in the video scene segment library;
D、 建立字幕库里的字幕段和视频场景库里的视频场景片段的对应关系。 、 根据权利要求 1所述的视频场景库的生成方法, 其特征在于:  D. Establish a correspondence between the subtitle segment in the subtitle library and the video scene segment in the video scene library. The method for generating a video scene library according to claim 1, wherein:
所述步骤 A中, 字幕附注包括视频场景里的对白 /旁白原文, 或对白 /旁 白的同义解释或概括, 或者描述视频场景类型的标签。 、 根据权利要求 1所述的视频场景库的生成方法, 其特征在于:  In the step A, the subtitle notes include a synonym explanation or generalization of the dialogue/narration text in the video scene, or a dialogue/narration, or a label describing the type of the video scene. The method for generating a video scene library according to claim 1, wherein:
所述步骤 B进一步包括提取时间锚点和相关视频文件信息存入字幕库。 、 根据权利要求 1所述的视频场景库的生成方法, 其特征在于:  The step B further includes extracting the time anchor point and related video file information into the subtitle library. The method for generating a video scene library according to claim 1, wherein:
所述步骤^ 步骤 C顺序互换。 、 一种视频场景库的生成系统, 其特征在于, 所述系统包括:  The step ^ step C is sequentially exchanged. A system for generating a video scene library, the system comprising:
标注单元, 用以对数据源的视频文件里的视频场景进行时间锚点标注和 字幕附注;  An annotation unit, configured to perform time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕提取单元, 用以提取标注的字幕段存入字幕库;  a subtitle extraction unit, configured to extract the subtitle segments of the annotation into the subtitle library;
切割单元, 用以根据标注的时间锚点对视频文件进行有冗余切割, 截取 该字幕对应的视频场景片段, 存入视频场景片段库;  a cutting unit, configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle, and store the video scene segment into the video scene segment library;
关系建立单元, 用以建立字幕库里的字幕段和视频场景库里的视频场景 片段的对应关系。 、 根据权利要求 5所述的视频场景库的生成系统, 其特征在于: The relationship establishing unit is configured to establish a correspondence between the subtitle segment in the subtitle library and the video scene segment in the video scene library. The system for generating a video scene library according to claim 5, characterized in that:
所述标注单元中, 字幕附注包括视频场景里的对白 /旁白原文, 或对白 / 旁白的同义解释或概括, 或者描述视频场景类型的标签;  In the labeling unit, the caption note includes a synonym explanation or generalization of the dialogue/narration text in the video scene, or a dialogue/narration, or a label describing the type of the video scene;
所述字幕提取单元进一步提取时间锚点和相关视频文件信息存入字幕 库。 、 一种基于权利要求 1所述方法所生成的视频场景库的搜索视频场景片段的方 法, 其特征在于, 所述方法包括如下步骤:  The caption extraction unit further extracts the time anchor point and related video file information into the caption library. A method for searching for a video scene segment based on a video scene library generated by the method of claim 1, wherein the method comprises the following steps:
a、 用户输入关键字提出搜索视频场景片段的请求;  a, the user inputs a keyword to request a search for a video scene segment;
b、 在字幕库里检索得到匹配的字幕段信息;  b. Searching for the matching subtitle segment information in the subtitle library;
c、 返回和匹配字幕段相关联的视频场景片段。 、 根据权利要求 7所述的搜索方法, 其特征在于:  c. Returns and matches the video clip associated with the subtitle segment. The search method according to claim 7, wherein:
所述步骤 a和步骤 b之间进一步包括:  The step a and the step b further include:
判断是请求搜索对白或旁白类型的视频场景, 还是请求描述类型的视频 场景;  Judging whether to request a search for a video scene of a dialogue or narration type, or request a video scene of a description type;
若为前者, 则步骤 b在字幕库里搜索对白或旁白类型的字幕段; 若为后 者, 则步骤 b在字幕库里搜索描述类型的字幕段。 、 根据权利要求 7所述的搜索方法, 其特征在于:  If it is the former, step b searches for a subtitle segment of the dialogue or narration type in the subtitle library; if it is the latter, step b searches the subtitle library for the subtitle segment of the description type. The search method according to claim 7, wherein:
所述步骤 b和步骤 c之间进一步包括:  The step b and the step c further include:
判断是否请求按照新的切割时间冗余量实时切割截取视频场景片段; 若是, 则根据匹配字幕段对应的时间锚点, 按照新的切割时间冗余量对 相应视频文件进行切割截取获得相应的视频场景片段; 若否, 则根据视频场 景片段库和字幕库的关联关系获得和匹配字幕段对应的视频场景片段。 0、 一种基于权利要求 1 所述方法所生成的视频场景库的搜索视频场景片段 的系统, 其特征在于, 所述系统包括: 输入单元, 用户通过该输入单元输入信息; Determining whether to request cutting the video scene segment in real time according to the new cutting time redundancy; if yes, according to the time anchor point corresponding to the matching subtitle segment, cutting and intercepting the corresponding video file according to the new cutting time redundancy to obtain the corresponding video a scene segment; if not, obtaining and matching a video scene segment corresponding to the subtitle segment according to the association relationship between the video scene segment library and the subtitle library. A system for searching a video scene segment based on a video scene library generated by the method of claim 1, wherein the system comprises: An input unit through which the user inputs information;
搜索单元, 用于接收到输入单元发起请求时, 在存储单元中对视频场景 片段的搜索;  a search unit, configured to: when receiving an input unit to initiate a request, searching for a video scene segment in the storage unit;
存储单元, 用于存储生成的视频场景库, 即存储有相互关联的视频场景 片段库和字幕库;  a storage unit, configured to store the generated video scene library, that is, the video scene fragment library and the subtitle library are stored;
显示单元, 用于显示符合搜索条件的视频场景片段。 、 根据权利要求 10所述的搜索系统, 其特征在于:  A display unit for displaying video clips that match the search criteria. The search system according to claim 10, characterized in that:
所述搜索系统进一步包括判断单元, 用于判断搜索请求是针对对白或旁 白类型的场景还是针对描述类型的场景, 同时用于判断是否要求对视频场景 片段按输入的切割时间冗余量重新切割截取。 、 根据权利要求 10所述的搜索系统, 其特征在于:  The search system further includes a determining unit, configured to determine whether the search request is for a dialogue or a narration type scene or a description type of scene, and is used for determining whether the video scene segment is required to be re-cut and intercepted according to the input cutting time redundancy amount. . The search system according to claim 10, characterized in that:
所述系统进一步包括切割单元, 用于对视频文件按输入的切割时间冗余 量重新切割。 、 一种直接搜索视频场景片段的方法,其特征在于,所述方法包括如下步骤: The system further includes a cutting unit for re-cutting the video file by the input cutting time redundancy. A method for directly searching for a video scene segment, the method comprising the steps of:
Α'、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; Β'、 提取标注的时间锚点和字幕段以及其他视频文件信息存入字幕库; C'、 用户输入关键字提出搜索视频场景片段的请求; Α', time anchor annotation and subtitle notes for video scenes in video files in the data source; Β', extract timed anchor points and subtitle segments and other video file information into the subtitle library; C', user input key The word proposes a request to search for a video scene segment;
D'、 通过关键字检索获得匹配的字幕段及其相应的时间锚点;  D', obtaining a matching subtitle segment and its corresponding time anchor point by keyword search;
E'、 根据时间锚点和切割冗余量, 对相应视频文件进行切割截取获得目 标视频场景片段, 返回给用户。 、 一种直接搜索视频场景片段的系统, 其特征在于, 所述系统包括:  E', according to the time anchor point and the cutting redundancy, cut and intercept the corresponding video file to obtain the target video scene segment, and return it to the user. A system for directly searching for a video scene segment, wherein the system includes:
标注单元, 用以对数据源的视频文件里的视频场景进行时间锚点标注和 字幕附注;  An annotation unit, configured to perform time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕库提取单元, 用以提取标注的字幕段和时间锚点以及其他视频文件 信息存入字幕库; a subtitle library extraction unit for extracting the labeled subtitle segments and time anchor points and other video files Information is stored in the subtitle library;
输入单元, 用户通过该输入单元输入信息;  An input unit through which the user inputs information;
搜索单元, 用于接收到输入单元发起请求时, 在字幕库中进行关键字匹 配检索;  a search unit, configured to perform keyword matching retrieval in the subtitle library when receiving the input unit to initiate the request;
切割单元, 用于根据时间锚点和切割冗余量, 对数据源中的视频文件切 割截取目标视频场景片段;  a cutting unit, configured to cut a target video scene segment by cutting a video file in the data source according to a time anchor point and a cutting redundancy amount;
显示单元, 用于显示符合搜索条件的视频场景片段。  A display unit for displaying video clips that match the search criteria.
15、 一种视频场景的生成方法, 其特征在于, 所述方法包括如下步骤: A method for generating a video scene, the method comprising the following steps:
'、 对数据源中视频文件里的视频场景进行时间锚点标注和字幕附注; B' '、 提取标注的字幕段和时间锚点存入字幕库;  ', perform time anchor annotation and subtitle annotation on the video scene in the video file in the data source; B' ', extract the subtitle segment and time anchor point of the annotation into the subtitle library;
0 '、根据标注的时间锚点对对应视频文件进行有冗余切割,截取字幕段 对应的视频场景片段。  0 ', the corresponding video file is redundantly cut according to the marked time anchor point, and the video scene segment corresponding to the subtitle segment is intercepted.
16、 一种视频场景的生成系统, 其特征在于, 所述系统包括: 16. A system for generating a video scene, the system comprising:
标注单元,用以对数据源的视频文件里的视频场景进行时间锚点标注和字幕 附注;  An annotation unit for performing time anchor annotation and subtitle annotation on the video scene in the video file of the data source;
字幕提取单元, 用以提取标注的字幕段和时间锚点存入字幕库;  a caption extraction unit, configured to extract the caption segment and the time anchor point into the caption library;
切割单元, 用以根据标注的时间锚点对视频文件进行有冗余切割,截取字幕 段对应的视频场景片段。  The cutting unit is configured to perform redundant cutting on the video file according to the marked time anchor point, and intercept the video scene segment corresponding to the subtitle segment.
PCT/CN2011/071072 2010-03-09 2011-02-18 Method and system for generating video scene library, method and system for retrieving video scenes WO2011110063A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010120591.8 2010-03-09
CN2010101205918A CN102024009A (en) 2010-03-09 2010-03-09 Generating method and system of video scene database and method and system for searching video scenes

Publications (1)

Publication Number Publication Date
WO2011110063A1 true WO2011110063A1 (en) 2011-09-15

Family

ID=43865312

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/071072 WO2011110063A1 (en) 2010-03-09 2011-02-18 Method and system for generating video scene library, method and system for retrieving video scenes

Country Status (2)

Country Link
CN (1) CN102024009A (en)
WO (1) WO2011110063A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107438204A (en) * 2017-07-26 2017-12-05 维沃移动通信有限公司 A kind of method and mobile terminal of media file loop play

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102595191A (en) * 2012-02-24 2012-07-18 央视国际网络有限公司 Method and device for searching sport events in sport event videos
CN102595206B (en) * 2012-02-24 2014-07-02 央视国际网络有限公司 Data synchronization method and device based on sport event video
CN102547141B (en) * 2012-02-24 2014-12-24 央视国际网络有限公司 Method and device for screening video data based on sports event video
CN102662970B (en) * 2012-03-09 2016-01-13 杭州海康威视数字技术股份有限公司 Based on video recording search and record a video collection control method and the system thereof of text message
CN102780856B (en) * 2012-04-12 2013-11-27 天脉聚源(北京)传媒科技有限公司 Method for annotating subtitles in news video
CN103838751A (en) * 2012-11-23 2014-06-04 鸿富锦精密工业(深圳)有限公司 Video content searching system and method
CN103473273B (en) 2013-08-22 2019-01-18 百度在线网络技术(北京)有限公司 Information search method, device and server
CN104053048A (en) * 2014-06-13 2014-09-17 无锡天脉聚源传媒科技有限公司 Method and device for video localization
CN104680188B (en) * 2015-03-24 2018-04-27 重庆大学 A kind of construction method of human body attitude reference image library
CN104883584A (en) * 2015-05-19 2015-09-02 福建宏天信息产业有限公司 Method and system for remote subtitle parsing
CN104915433A (en) * 2015-06-24 2015-09-16 宁波工程学院 Method for searching for film and television video
CN105430434A (en) * 2015-11-17 2016-03-23 北京奇虎科技有限公司 Method and device for downloading video
CN107273388A (en) * 2016-04-08 2017-10-20 北京国双科技有限公司 The treating method and apparatus and querying method and device of trial video
CN105956170B (en) * 2016-05-20 2019-07-19 微鲸科技有限公司 Real-time scene information embedding method, Scene realization system and implementation method
CN106952515A (en) * 2017-05-16 2017-07-14 宋宇 The interactive learning methods and system of view-based access control model equipment
CN107704525A (en) * 2017-09-04 2018-02-16 优酷网络技术(北京)有限公司 Video searching method and device
CN107785014A (en) * 2017-10-23 2018-03-09 上海百芝龙网络科技有限公司 A kind of home scenarios semantic understanding method
CN109933691B (en) * 2019-02-11 2023-06-09 北京百度网讯科技有限公司 Method, apparatus, device and storage medium for content retrieval
CN113672322B (en) * 2021-07-29 2024-05-24 浙江太美医疗科技股份有限公司 Method and device for providing interpretation information
CN115906781B (en) * 2022-12-15 2023-11-24 广州文石信息科技有限公司 Audio identification anchor adding method, device, equipment and readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1622609A (en) * 2003-11-28 2005-06-01 Lg电子株式会社 Method and apparatus for repetitive playback of a video section based on subtitles
CN101350904A (en) * 2007-07-19 2009-01-21 索尼株式会社 Video-recording/reproducing apparatus and video- recording/reproducing method
CN101650958A (en) * 2009-07-23 2010-02-17 中国科学院声学研究所 Extraction method and index establishment method of movie video scene clip

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1430166A (en) * 2003-01-07 2003-07-16 财团法人资讯工业策进会 Method of establishig film index database and recording medium
CN100449547C (en) * 2006-12-06 2009-01-07 华为技术有限公司 Medium contents management system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1622609A (en) * 2003-11-28 2005-06-01 Lg电子株式会社 Method and apparatus for repetitive playback of a video section based on subtitles
CN101350904A (en) * 2007-07-19 2009-01-21 索尼株式会社 Video-recording/reproducing apparatus and video- recording/reproducing method
CN101650958A (en) * 2009-07-23 2010-02-17 中国科学院声学研究所 Extraction method and index establishment method of movie video scene clip

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107438204A (en) * 2017-07-26 2017-12-05 维沃移动通信有限公司 A kind of method and mobile terminal of media file loop play
CN107438204B (en) * 2017-07-26 2019-12-17 维沃移动通信有限公司 Method for circularly playing media file and mobile terminal

Also Published As

Publication number Publication date
CN102024009A (en) 2011-04-20

Similar Documents

Publication Publication Date Title
WO2011110063A1 (en) Method and system for generating video scene library, method and system for retrieving video scenes
JP6342951B2 (en) Annotate video interval
US8676835B2 (en) Annotation system for creating and retrieving media and methods relating to same
US7912827B2 (en) System and method for searching text-based media content
US20080177536A1 (en) A/v content editing
US20200126583A1 (en) Discovering highlights in transcribed source material for rapid multimedia production
EP1764712A1 (en) A system and method for searching and analyzing media content
JP5588561B2 (en) Media content providing method and apparatus
US20130007043A1 (en) Voice description of time-based media for indexing and searching
CN103593363A (en) Video content indexing structure building method and video searching method and device
JP4354441B2 (en) Video data management apparatus, method and program
CN110753269B (en) Video abstract generation method, intelligent terminal and storage medium
CN104915433A (en) Method for searching for film and television video
JP2007525900A (en) Method and apparatus for locating content in a program
CN111294660A (en) Video clip positioning method, server, client and electronic equipment
US10114891B2 (en) Method and system of audio retrieval and source separation
US7848598B2 (en) Image retrieval processing to obtain static image data from video data
WO2024109813A1 (en) Video processing method and apparatus
JP5243366B2 (en) Video summarization method and video summarization program
JP2008022292A (en) Performer information search system, performer information obtaining apparatus, performer information searcher, method thereof and program
Sack et al. Automated annotations of synchronized multimedia presentations
Morang et al. InfoLink: analysis of Dutch broadcast news and cross-media browsing
Carmichael et al. Multimodal indexing of digital audio-visual documents: A case study for cultural heritage data
Outtagarts et al. A cloud-based collaborative and automatic video editor
JP2004015748A (en) Moving image editing apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11752806

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 26.02.2013)

122 Ep: pct application non-entry in european phase

Ref document number: 11752806

Country of ref document: EP

Kind code of ref document: A1