KR20170088188A - Apparatus and method for dividing video contents into a plurality of shots using contents adaptive threshold - Google Patents
Apparatus and method for dividing video contents into a plurality of shots using contents adaptive threshold Download PDFInfo
- Publication number
- KR20170088188A KR20170088188A KR1020160008226A KR20160008226A KR20170088188A KR 20170088188 A KR20170088188 A KR 20170088188A KR 1020160008226 A KR1020160008226 A KR 1020160008226A KR 20160008226 A KR20160008226 A KR 20160008226A KR 20170088188 A KR20170088188 A KR 20170088188A
- Authority
- KR
- South Korea
- Prior art keywords
- shot
- image
- shots
- image information
- unit
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8455—Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8541—Content authoring involving branching, e.g. to different story endings
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8549—Creating video summaries, e.g. movie trailer
Abstract
Description
BACKGROUND OF THE
TV contents, movies, and so on are produced and distributed, it is not easy for content users to find desired image contents. Accordingly, a service for efficiently retrieving or recommending video contents has been attracting attention to content users. In order for these services to meet the needs of individual users of the contents, there is a need for a technique for analyzing the characteristics of the image contents accurately and as specifically as possible. However, it is difficult to provide a satisfactory personalized service due to the limitations of the content characteristic analysis technique, because the search and recommendation of the content unit alone have a limitation in providing various services capable of providing high profit and utility.
In order to overcome such limitations, in addition to meta data relating to the entire content such as a genre of a content, characters and the like generally used for searching for and recommending a current content, meta data for each scene constituting a content episode Techniques for searching and recommending content and scenes based on data are being studied. Here, the content episode refers to a broadcast portion of the video content such as a drama and a movie.
An image content is composed of a plurality of frames. Normally, an image content of 1 second consists of 24 frames or 30 frames. Here, a 'frame' refers to a moving image content, that is, one of still images constituting moving image content, and a 'shot' generally refers to a scene without stopping at the time of shooting for producing an image content Indicates the unit of the image taken at one time. And 'scene' refers to a scene which is usually divided or integrated according to a scene, and God is an image unit in which a single situation, action, metabolism or event appears at the same time, same place, . In order to provide a scene-based search and recommendation service, a procedure of forming a shot composed of a plurality of frames, composition of a scene through clustering of the related shots, and metadata generation and tagging by analyzing the characteristics of each scene .
More specifically, in order to divide an image content into shots, color information analysis for each frame and similarity between the consecutive frames constituting the normal image are compared to determine whether to divide the shot. In order to determine whether or not to divide the shot, it is necessary that a threshold value serving as a reference of the shot division should be set in advance. In order to improve the accuracy of the shot division, it is important to appropriately set such a threshold value.
However, uniformly applying a predetermined threshold value without considering the types and characteristics of image contents can limit the accuracy of shot division. If the accuracy of shot segmentation is low, it is difficult to effectively provide scene-based search and recommendation services. Accordingly, when dividing each of various image contents into a plurality of shots, a method for increasing the accuracy is required.
An object of the present invention is to provide an apparatus and a method for dividing an image content into a plurality of shots in consideration of characteristics of image contents to be divided into shots.
Another object of the present invention is to provide an apparatus and a method for dividing a shot of an image content by dividing an image content into shots by applying an adaptively determined threshold considering the corresponding image content characteristic will be.
According to another aspect of the present invention, there is provided a method of dividing an image content into a plurality of shots, the method comprising: applying a predetermined threshold value to an analysis and processing result of image information on the image content, The method comprising the steps of: dividing the content into a plurality of first shots; obtaining a variation threshold through analysis and processing of image information for each of the plurality of first shots; and applying the variation threshold to at least one And dividing the first shot of the first shot into a plurality of second shots.
According to the embodiment of the present invention described above, when dividing the image content into shots, the threshold value of the inter-frame image feature value serving as a reference in dividing the image into different shots is not set as a fixed value, Adaptive decision is made considering image characteristics. Therefore, according to the embodiment of the present invention, it is possible to improve the accuracy of shot division on the image content. In addition, since the image contents are divided into shots with improved accuracy, it is possible to provide scene-based retrieval and recommendation services for image contents more efficiently and accurately in a customized manner.
1 is a block diagram illustrating a configuration of a shot division apparatus for video content according to an exemplary embodiment of the present invention.
2 is a block diagram showing an example of a detailed configuration of the threshold management unit of FIG.
3 is a flowchart illustrating an example of a method of dividing a shot of video content according to an embodiment of the present invention.
FIG. 4 is a simplified schematic diagram of an aspect of a shot division method according to an embodiment of the present invention, described above with reference to FIG.
FIG. 5 is a schematic diagram illustrating another aspect of the shot dividing method according to an embodiment of the present invention described above with reference to FIG.
BRIEF DESCRIPTION OF THE DRAWINGS Fig. However, these drawings are only an example for easily describing the content and scope of technical ideas, and thus the technical scope thereof is not limited or changed. It will be apparent to those skilled in the art that various changes and modifications can be made within the scope of the technical idea based on these examples. Also, terms and words used in the present specification are terms selected in consideration of the functions in the embodiments, and the meaning of the terms may vary depending on the user, the intention or custom of the operator, and the like. Therefore, the terms used in the following embodiments are defined according to their definitions when they are specifically defined in this specification, and in the absence of a specific definition, they should be construed in a sense generally recognized by ordinary artisans.
1 is a block diagram illustrating a configuration of a shot division apparatus for video content according to an exemplary embodiment of the present invention. 1, the
The image
The image
Among the image information extracted by the image
The
According to this embodiment, the threshold value received from the threshold
The shot
The shot
The threshold
2 is a block diagram showing an example of a detailed configuration of the threshold
The fixed threshold
3 is a flowchart illustrating an example of a method of dividing a shot of video content according to an embodiment of the present invention. The shot dividing method shown in FIG. 3 may be a procedure performed in the
Referring to FIG. 3, first, the image
Then, the image
The
In step S20, the
Subsequently, the image
FIG. 4 is a simplified schematic diagram of an aspect of a shot division method according to an embodiment of the present invention, described above with reference to FIG. Referring to FIG. 4, one unit image content (for example, one movie, one broadcast content episode, etc.) is composed of a plurality of frames. Generally, since the image content is composed of 24 frames per second or 30 frames per second, one unit image may be composed of thousands or tens of thousands of frames depending on its length. According to the embodiment of the present invention, the primary shot division is performed by applying a predetermined fixed threshold to the spiritual contents, and as a result, the image contents can be divided into a plurality of shots. Subsequently, the secondary shot division is performed on the specific shot (Shot 2 in FIG. 4) using the primary shot division result. In this case, the variation threshold is applied. As described above, the variation threshold can be derived using the image information or the image characteristic of the specific shot divided by the primary shot. For example, a second shot division is performed by applying a variation threshold to some shots (Shot 2 in FIG. 4) satisfying a specific condition among a plurality of shots derived from the first shot division result. As a result, (Shot 2) is divided into two or more shots (Shot 2-1, Shot 2-2), so that additional shots may be split.
FIG. 5 is a schematic diagram illustrating another aspect of the shot dividing method according to an embodiment of the present invention described above with reference to FIG. 5, analysis / processing of the secondary image information is performed based on the primary shot division result among the shot division methods shown in FIG. 3, and a variation threshold and a secondary shot division target shot are selected based on the result The process is schematically illustrated.
Referring to FIG. 5, when a first shot division is performed on one unit image content, a plurality of divided shots are derived as a result. For this purpose, the analysis and processing of image information is performed primarily as described above. The second image information analysis and processing are performed on each of the derived shots. The image information analysis result for each shot can be derived according to a preset algorithm stored in the shot division setting unit (
After the calculation of the variation threshold and the selection of the shot for the secondary shot division are completed, the shot division is performed again on the basis of the variation threshold value for the shot, thereby completing the shot division for the image content.
The above description is only an example and should not be construed as being limited thereto. It is to be understood that the technical spirit of the present invention should be defined only by the invention disclosed in the claims, and all technical ideas within the scope of equivalents thereof should be construed as being included in the scope of the present invention. Therefore, it is apparent to those skilled in the art that the above-described embodiments can be modified and implemented in various forms.
100: Shot splitter
110:
120: Image information analysis section
130:
140: shot division policy section
Claims (1)
Dividing the image content into a plurality of first shots by applying a preset threshold value to an analysis and processing result of the image information for the image content;
Obtaining a variation threshold value through analysis and processing of image information for each of the plurality of first shots; And
And dividing at least one first shot of the plurality of first shots into a plurality of second shots by applying the variation threshold value.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020160008226A KR20170088188A (en) | 2016-01-22 | 2016-01-22 | Apparatus and method for dividing video contents into a plurality of shots using contents adaptive threshold |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020160008226A KR20170088188A (en) | 2016-01-22 | 2016-01-22 | Apparatus and method for dividing video contents into a plurality of shots using contents adaptive threshold |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20170088188A true KR20170088188A (en) | 2017-08-01 |
Family
ID=59650147
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020160008226A KR20170088188A (en) | 2016-01-22 | 2016-01-22 | Apparatus and method for dividing video contents into a plurality of shots using contents adaptive threshold |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20170088188A (en) |
-
2016
- 2016-01-22 KR KR1020160008226A patent/KR20170088188A/en unknown
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102263898B1 (en) | Dynamic video overlays | |
CA2906199C (en) | Systems and methods for addressing a media database using distance associative hashing | |
CN105379246B (en) | It arranges the method for image filter, store the computer readable storage medium and electronic device of this method | |
US20170034542A1 (en) | Recognition data generation device, image recognition device, and recognition data generation method | |
US10200765B2 (en) | Content identification apparatus and content identification method | |
EP1494132A1 (en) | Method and apparatus for representing a group of images | |
CN106960211B (en) | Key frame acquisition method and device | |
US8270714B2 (en) | Method and apparatus for colour correction of image sequences | |
KR101485820B1 (en) | Intelligent System for Generating Metadata for Video | |
US20210319230A1 (en) | Keyframe Extractor | |
Chen et al. | Intra-and-inter-constraint-based video enhancement based on piecewise tone mapping | |
US20150085145A1 (en) | Multiple image capture and processing | |
JP3709191B2 (en) | Color temperature conversion system and method using metadata of video content | |
JP2022524651A (en) | Content Aware PQ Range Analyzer and Tone Mapping in Live Feed | |
EP3236362B1 (en) | Information pushing method and system | |
US20150085159A1 (en) | Multiple image capture and processing | |
WO2015174906A1 (en) | Segmentation based image transform | |
KR20170088188A (en) | Apparatus and method for dividing video contents into a plurality of shots using contents adaptive threshold | |
CN104754367A (en) | Multimedia information processing method and device | |
CN110100445B (en) | Information processing system, information processing apparatus, and computer readable medium | |
KR101337833B1 (en) | Method for estimating response of audience concerning content | |
CA3024183C (en) | Generating synthetic frame features for sentinel frame matching | |
KR102430756B1 (en) | Scene segmentation apparatus using object detecting and set theory, and method thereof | |
KR102310386B1 (en) | Apparatus and control method for dividing contents episode | |
EP3471100B1 (en) | Method and system for synchronising between an item of reference audiovisual content and an altered television broadcast version thereof |