KR101843017B1 - Random Editing System for Providing Compressed Video Contents and Method thereof - Google Patents

Random Editing System for Providing Compressed Video Contents and Method thereof Download PDF

Info

Publication number
KR101843017B1
KR101843017B1 KR1020160067016A KR20160067016A KR101843017B1 KR 101843017 B1 KR101843017 B1 KR 101843017B1 KR 1020160067016 A KR1020160067016 A KR 1020160067016A KR 20160067016 A KR20160067016 A KR 20160067016A KR 101843017 B1 KR101843017 B1 KR 101843017B1
Authority
KR
South Korea
Prior art keywords
gop
frame
moving picture
picture content
reference frame
Prior art date
Application number
KR1020160067016A
Other languages
Korean (ko)
Other versions
KR20170135299A (en
Inventor
이규영
천솔지
Original Assignee
(주)잼투고
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by (주)잼투고 filed Critical (주)잼투고
Priority to KR1020160067016A priority Critical patent/KR101843017B1/en
Priority to PCT/KR2017/005356 priority patent/WO2017209432A1/en
Publication of KR20170135299A publication Critical patent/KR20170135299A/en
Application granted granted Critical
Publication of KR101843017B1 publication Critical patent/KR101843017B1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23412Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs for generating or manipulating the scene composition of objects, e.g. MPEG-4 objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally

Abstract

The present invention relates to a system and a method for providing an arbitrary edited compressed moving picture, and a system for providing an arbitrary edited compressed moving picture content according to the present invention is characterized by comprising: A moving picture content storage unit for storing compressed moving picture contents composed of first to Nth GOPs (N is an integer of 2 or more), each of which comprises a plurality of reference frames referencing information; And providing the compressed moving picture content to a user terminal in units of GOPs through a communication network, wherein at least some GOPs are provided to the user terminal in order, and k k GOPs (k is an integer of 2 or more and N-1 or less) And providing the moving picture contents to the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP.

Description

Technical Field [0001] The present invention relates to a system and a method for providing compressed video contents,

The present invention relates to a system and a method for providing arbitrary edited compressed moving picture contents, and more particularly, to a system and method for providing arbitrary edited compressed moving picture contents that can be arbitrarily edited in units of GOPs starting with a best- And a method of providing the same.

Transmission of various video contents through a communication network has become common. Uncompressed video content files, in which all frames include a complete image, have a problem in that the network resources required for video content transmission are increased and the data transmission cost is increased when the network is transmitted due to excessive file size. Accordingly, video storage format standardization organizations such as MPEG (Moving Picture Experts Group) have proposed various compression video content storage standards.

According to a video compression standard such as MPEG-2, MPEG-4, ITU-T H.263, ITU-T H.264 / MPEG-4, Part 10 and Advanced Video Coedc (AVC) The content is divided into GOP (Group of Picture) units having a length of, for example, about 0.5 seconds, and one GOP is divided into a non-reference frame (intra frame) such as an I frame having complete image information and a change And a reference frame (inter frame) such as a B frame or a P frame, which has only incomplete image information by storing only information but greatly reduces the amount of data. The video compression technique reduces the amount of video contents by the GOP structure composed of the non-reference frame and the reference frame, thereby contributing to the easy distribution of the video contents through the communication network. (References, Scalable Parallel Programming Applied to H.264 / AVC Decoding, pp. 5-15)

On the other hand, UGC (User Generated Contents) produced by general users, rather than experts, is often composed of a series of monotonous images due to limitations of general users' shooting techniques, shooting equipment, editing techniques and editing equipment, There was a difficult limit. In order to solve such a problem, it is possible to improve the monotony of the video contents by editing one video or a plurality of videos through editing technology. Korean Patent Application No. 2016-0034287 and Korean Patent Application No. 2016-0048882 filed by the present applicant disclose a video content providing system capable of increasing the unexpectedness of a moving image by automatically arbitrarily editing one moving image or a plurality of moving images .

On the other hand, in the case of compressed video contents other than uncompressed video contents, if the starting point to be arbitrarily edited is a non-reference frame, image quality deterioration due to the loss of reference data occurs at the start point of the arbitrarily edited section due to the absence of the preceding reference frame .

U.S. Patent No. 9,319,448 to Qualcomm, entitled " Trick modes for network streaming of coded multimedia data " refers to the location of a reference frame for random access to a video content file as a random access point Thereby providing an effect that the user terminal can easily access the non-reference frame when searching the section of the moving image content file. However, according to this prior art, there is a problem that the user can only provide access to a certain point of the video content and can not automatically provide the arbitrarily edited video content.

U.S. Patent No. 8,340,113 issued to Ericsson, Inc., entitled " Method and arrangement for improved media session management " refers to a method in which when a request is made to change a video transmission mode from broadcast to unicast, It is possible to view seamless video contents in spite of the broadcast mode change. However, according to this prior art, since the user watches the video contents in frame order, there is a problem that the user can not receive the arbitrary edited video contents automatically.

U.S. Patent No. 9,319,448 United States Patent No. 8,340,113

Juurlink et al., &Quot; Scalable Parallel Programming Applied to H.264 / AVC Decoding ", 2012, pp. 5-11

According to an aspect of the present invention, there is provided a system and method for providing arbitrary edited compressed moving picture contents according to the present invention. The moving picture content providing system stores compressed moving picture contents and provides them to a user. So that the user terminal reproduces the edited GOP edited from the best row non-reference frame which does not require the video information of the preceding frame to an arbitrary point, thereby preventing deterioration of the image quality despite arbitrary editing.

According to the system and method for providing arbitrary edited compressed moving picture contents according to the embodiment of the present invention, it is possible to enhance the unexpectedness of edited moving picture contents by detecting a scene change point and editing the non- Another purpose is to maintain the context of the original video content.

According to another embodiment of the present invention, there is provided an arbitrary editing compressed moving picture contents providing system and method for arbitrarily editing a GOP including an IDR frame (Instantaneous Decoder Refresh Frame) which does not affect the picture quality of a preceding frame, Another object of the present invention is to prevent deterioration of image quality of a trailing frame in a preceding frame in spite of the empty space of a GOP edited at an arbitrary point.

According to another embodiment of the present invention, there is provided a system and a method for providing arbitrary edited compressed moving picture contents, wherein consecutive GOPs other than a single GOP are defined as an editing GOP array and are arbitrarily arranged, And to maintain the context of the original moving picture contents.

According to another embodiment of the present invention, there is provided a system and a method for providing arbitrary edited compressed moving picture contents, the definition of GOPs including the same GOP consecutively in an edit GOP array, To maintain the other purpose.

According to another embodiment of the present invention, there is provided a system and a method for providing arbitrary edited compressed moving picture contents, wherein GOPs including the same consecutive persons are defined as an edit GOP array, To maintain the context of the other.

In order to achieve the above object, a system for providing arbitrary edited compressed moving picture contents according to the present invention includes a plurality of reference frames which refer to information of a best row non-reference frame not referencing information of a preceding frame and information of a preceding or succeeding frame A moving picture content storage unit for storing compressed moving picture contents composed of first to Nth GOPs (N is an integer of 2 or more), each of which is composed of a first GOP to an Nth GOP; And providing the compressed moving picture content to a user terminal in units of GOPs through a communication network, wherein at least some GOPs are provided to the user terminal in order, and k k GOPs (k is an integer of 2 or more and N-1 or less) And providing the moving picture contents to the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP.

In the arbitrary editing compressed moving picture contents providing system according to the embodiment of the present invention, the best row non-reference frame is an I frame, and the reference frame is a Predicted Frame and / or a Bidirectional Frame ).

In the arbitrary editing compressed moving picture contents providing system according to an embodiment of the present invention, the moving picture content providing unit may include a GOP including an I frame located at a scene change point detected through scene change detection as the best row non- K < th > GOP.

In the arbitrary editing compressed moving picture contents providing system according to the embodiment of the present invention, the moving picture content providing unit selects the GOP including the IDR frame (Instantaneous Decoder Refresh) as the best row frame as the kth GOP. do.

In the arbitrary editing compressed moving picture contents providing system according to the embodiment of the present invention, the moving picture content providing unit may include an editing GOP array defined by the kth GOP to the k + m GOP (m is an integer of 10 or more) -1 GOP, or to the user terminal in any order so as to follow the k + m + 1 GOP.

The arbitrary editing compressed moving picture contents providing system according to the embodiment of the present invention may further include a video object analyzing unit for analyzing VOPs of the image of the best row non-reference frame and the reference frame, , And the k < th > to (k + m) GOPs in which the same VOP is consecutively included are defined as the edit GOP arrays.

The arbitrary editing compressed moving picture contents providing system according to the embodiment of the present invention may further include a face recognizing unit for recognizing a face of a person included in the image of the best row non-reference frame and the reference frame and associating the face with a face identifier, And the content providing unit defines the k < th > GOP to the k + m < th > GOP in which the same facial identifiers are consecutively associated with the editing GOP array.

The method of providing arbitrary edited compressed moving picture contents according to the present invention is characterized in that the moving picture content providing system includes a plurality of reference frames which refer to information of a best row reference frame not referring to information of a preceding frame and information of a preceding or succeeding frame A first step of storing compressed moving picture contents composed of first to Nth GOPs (N is an integer of 2 or more), each of which is composed of; A second step of the moving picture content providing system receiving an arbitrary editing request for requesting arbitrary editing of the compressed moving picture content from a user terminal through a communication network; And the moving picture contents providing system provides the compressed moving picture contents to the user terminal in units of GOPs through a communication network, and at least some GOPs are provided to the user terminal in order, and a k < (K + 1) -th GOP, and (k + 1) -th GOP, which is an integer equal to or smaller than N-1, to the user terminal.

In the first step, the moving picture content providing system may be configured such that the best row non-reference frame is an I frame (Infra Frame), and the reference frame is P And the compressed moving picture content, which is a Predicted Frame and / or a B-frame (Bidirectional Frame), is stored.

In the arbitrary editing compressed moving picture content providing method according to an embodiment of the present invention, the third step may be a step of, when the moving picture content providing system performs an I-frame located at a scene change point detected through scene change detection, And the GOP included in the frame is selected as the k < th > GOP.

In the arbitrary editing compressed moving picture content providing method according to an embodiment of the present invention, the third step may include: a GOP including an IDR frame (Instantaneous Decoder Refresh) as the best row non- K < th > GOP.

In the third step, the moving picture contents providing system may edit the kth GOP to the (k + m) GOP (m is an integer of 10 or more) according to an embodiment of the present invention. GOP array, and provides the edited GOP array to the user terminal in any order so as to precede the k-1 GOP or to follow the k + m + 1 GOP.

In the arbitrary editing compressed moving picture content providing method according to an embodiment of the present invention, the first step may include analyzing VOP (Video Object Plane) of the image of the best reference non-reference frame and reference frame by the moving picture content providing system Wherein the moving picture content system defines the k < th > GOP to the k < + > m GOP sequentially including the same VOP as the editing GOP array .

In the arbitrary editing compressed moving picture content providing method according to the embodiment of the present invention, in the first step, the moving picture content providing system recognizes the face of the person included in the image of the best row non-reference frame and the reference frame And a second sub-step of associating the k < th > GOP to the k < th > m GOP in which the same facial identifier is consecutively associated, As shown in FIG.

According to the above configuration of the present invention, according to the system and method for providing arbitrary edited compressed moving picture contents according to the present invention, the moving picture content providing system stores compressed moving picture contents and provides them in units of GOP The user terminal reproduces the edited GOP edited from the best row non-reference frame which does not require the video information of the preceding frame to a certain point, thereby providing an effect of preventing the deterioration of image quality despite the arbitrary editing.

According to the system and method for providing arbitrary edited compressed moving picture contents according to the embodiment of the present invention, it is possible to enhance the unexpectedness of edited moving picture contents by detecting a scene change point and editing the non- And provides the effect of maintaining the context of the original video contents.

According to another embodiment of the present invention, there is provided a system and method for providing arbitrary edited compressed moving picture contents, the method comprising: arbitrarily editing a GOP including an IDR frame as a best row frame that does not affect the image quality of a preceding frame, It provides an effect of preventing degradation of the picture quality of the following frame in the preceding frame despite the gap of the GOP.

According to another embodiment of the present invention, there is provided a system and a method for providing arbitrary edited compressed moving picture contents, wherein consecutive GOPs other than a single GOP are defined as an editing GOP array and are arbitrarily arranged, And the effect of maintaining the context of the original moving picture contents is provided.

According to another embodiment of the present invention, there is provided a system and a method for providing arbitrary edited compressed moving picture contents, the definition of a GOP including consecutive VOPs as an edit GOP array, ≪ / RTI >

According to another embodiment of the present invention, there is provided a system and a method for providing arbitrary edited compressed moving picture contents, wherein GOPs including the same consecutive persons are defined as an edit GOP array, To provide the effect of maintaining the context of.

BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a configuration diagram illustrating a system for providing arbitrary editing compressed moving picture contents according to an embodiment of the present invention; FIG.
FIG. 2 is a block diagram showing a GOP structure of compressed moving picture contents. FIG.
3 is a block diagram showing a GOP structure of compressed moving picture contents arbitrarily edited according to an embodiment of the present invention;
4 is a block diagram showing a GOP structure of compressed video contents arbitrarily edited according to another embodiment of the present invention;
5 is a GOP diagram showing a GOP structure of compressed moving picture contents.
FIG. 6 is a GOP diagram showing a GOP structure of compressed moving picture contents arbitrarily edited in an edit GOP array unit according to another embodiment of the present invention; FIG.
FIG. 7 is a GOP diagram showing a GOP structure of compressed moving picture contents arbitrarily edited in an edit GOP array unit according to another embodiment of the present invention; FIG.
8 is a flowchart illustrating a method for providing arbitrary editing compressed moving picture contents according to the present invention.

The terms and words used in the present specification and claims should not be construed in an ordinary or dictionary sense, and the inventor shall, in order to best explain his invention in the best way, And should be construed as meaning and concept consistent with the technical idea of the invention.

Therefore, the embodiments described in the present specification and the configurations shown in the drawings correspond to the preferred embodiments of the present invention and do not represent all the technical ideas of the present invention, so that the configurations can be replaced at the time of filing of the present invention Various equivalents and variations may be present.

The specification that " comprises " any element in any specification throughout the specification does not exclude other elements, but may also include other elements, unless the context clearly indicates otherwise. The terms " module, " " part, " " system, " and the like, which are described in the specification, mean a unit for processing at least one function or operation, And may be included in one device or in another device.

Hereinafter, an arbitrary editing compressed video content providing system according to the present invention will be described with reference to the drawings. FIG. 1 illustrates a system for providing arbitrary editing compressed moving picture contents (content_compress) according to an embodiment of the present invention. The arbitrary editing compressed video content (content_compress) includes a moving image content storage unit 110 and a moving image content providing unit 120.

The moving picture content storage unit 110 and the moving picture content providing unit 120 may be implemented in a single server system or a server system configured as a separate server through a communication network. Also, it may be implemented in the form of a program code of software for controlling hardware or hardware such as a logic circuit, a memory, and a storage device in the user terminals 200-1 and 200-2 rather than a server-client system according to an embodiment.

The moving picture content storage unit 110 includes a best row reference frame (frame_intra_first) that does not refer to information of a preceding frame and a plurality of reference frames (frame_inter) that refer to information of a preceding or succeeding frame, (Content_compress) composed of one GOP (GOP_1st) to Nth GOP (GOP_Nth) (N is an integer of 2 or more).

2 shows a GOP structure of compressed video content (content_compress). A GOP is a set of frames, and is composed of at least one non-reference frame (frame_intra) and at least one reference frame (frame_inter). The number of frames included in the GOP can be specified by the user, and the number of frames can be determined to have a time of about 0.5 seconds, for example.

The size of the non-reference frame (frame_intra) is large because the frame itself includes data such as brightness, color, and the like of all pixels for the frame. According to a moving picture compression standard such as MPEG, an I frame (Infra Frame) and an IDR frame (Instantaneous Decoder Refresh Frame) described later correspond to a non-reference frame (frame_intra).

The size of the data is small because the reference frame (frame_inter) includes only information on pixels or VOP (Video Object Plane) changed in the preceding non-reference frame (frame_intra). Since the next frame in one frame in the video content is very short in time, the change of data between consecutive frames is very small, so that it is efficient in terms of data size to store only changed data. According to a moving picture compression standard such as MPEG, a P frame and a B frame correspond to a reference frame (frame_inter). The P frame refers to the preceding I frame or another P frame, and since the B frame refers to both the preceding and following frames, the size of the data is generally smaller than that of the P frame.

In the embodiment of FIG. 2, k k GOP (GOP_kth) is composed of I frame, B frame, B frame, P frame, B frame and B frame in order and an I frame which is a best row non- Four B frames and one P frame which are frame_inter. In this case, the I frame is implemented through the data of the I frame upon reproduction of the moving picture content. Next, the P frame, which is the third trailing frame in the I frame, refers to the I frame and then the B frame The moving picture content is reproduced in such a manner that the preceding I frame and the following P frame are referred to and then the I frame and the B frame preceding the B frame that is the third preceding frame in the I frame and the P frame following the B frame are referred to.

Meanwhile, the moving picture content storage unit 110 of the present invention may include a function of converting the non-compressed moving picture content into a compressed moving picture content (content_compress) and storing the compressed moving picture content (content_compress) .

The video content providing unit 120 provides the compressed video content (content_compress) to the user terminals 200-1 and 200-2 on a GOP basis through a communication network, and at least some GOPs are sequentially transmitted to the user terminal 200- 1) and k + 1 GOP (GOP_kth) preceding the k-1 GOP (GOP_k-1th) (k is an integer of 2 or more and N-1 or less) + 1th) to the user terminals 200-1 and 200-2 in an arbitrary order.

The video content providing unit 120 provides video content through a communication network such as the Internet or an intranet in the case of a server-client system. When the video content is implemented on a single client, the video content providing unit 120 provides video content through system- can do.

The video content providing unit 120 provides at least some GOPs to the user terminals 200-1 and 200-2 according to the order of the original moving picture contents, (200-1, 200-2). At this time, the ratios of the non-edit GOPs provided in order and the edit GOPs provided differently from the order may be variously applied according to the embodiment. For example, if you want to make many changes in the original video content, the ratio of edit GOP to non-edit GOP may be 90%. On the contrary, if you want to make some changes while maintaining the context of original video content, The ratio can be set to 10%. According to an embodiment, the compressed video content (content_compress) providing system can determine the ratio of the edited GOP to the non-edited GOP at an arbitrary ratio.

In order to eliminate the hassle of specifying a user's additional editing GOP, the determination of a GOP to be an editing GOP may be performed automatically by a system that provides a compressed video content (content_compress), not a user decision, .

The k-th GOP (GOP_kth) is arranged in front of or adjacent to the preceding k-1 GOP (GOP_k-1th) as shown in FIG. 3, 1) th GOP (GOP_k-1th) in front of the (k-1) GOP. On the contrary, the arbitrary kth GOP (GOP_kth) is arranged adjacent to or further behind the k + 1 GOP (GOP_k + 1th) (K + 1) th GOP (GOP_k + 1th) at the rear of the kth GOP.

On the other hand, the moving picture content providing unit 120 may perform arbitrary editing for a single compressed moving picture content (content_compress) or may perform arbitrary editing for a plurality of compressed moving picture contents (content_compress). The first compressed moving picture content (content_compress_1st) is composed of 150 GOPs, the second compressed moving picture content (content_compress_2nd) is composed of 300 GOPs, and the second compressed moving picture content (content_compress_2nd) If the compressed third video content (content_compress_3rd) is composed of 200 GOPs, the moving picture content providing unit 120 regards the compressed video content (content_compress) composed of a total of 650 GOPs as random content To the user terminals 200-1 and 200-2. According to this embodiment, the unexpectedness of the arbitrarily edited moving picture contents is further increased.

Recently, moving picture compression techniques have been developed to locate the scene change point through the scene change detection through image processing to locate the best row reference frame (frame_intra_first) of the GOP. This is because frame corelation between the preceding frame and the trailing frame is lowered when the scene change occurs in the middle of the GOP, and there is a problem of increasing the amount of data of the B frame or the P frame which is the reference frame (frame_inter) . In the embodiment of the present invention, a GOP defined by a scene change point as a best row non-reference frame (frame_intra_first) is used as an edit GOP in consideration of this point, It is possible to provide an effect of automatically generating moving picture contents. For this, the moving picture content providing unit 120 may be configured to select a GOP including an I frame located at a scene change point detected through scene change detection as a best row non-reference frame (frame_intra_first) as a kth GOP (GOP_kth) desirable.

Meanwhile, a method of arbitrarily editing the moving picture content and providing the moving picture content to the user is as follows. First, the moving picture content providing unit 120 may change the order of the GOP without changing the compressed video content (content_compress) Second, the moving picture content providing unit 120 changes the GOP order of the compressed video content (content_compress) file according to the result of arbitrary editing and stores the compressed content (content_compress) file in the buffer memory or compresses A copy of all or part of the content_compress file may be uploaded and provided.

In the former case, there is no problem, but in the latter case, the following data loss problem may occur depending on the change of the GOP order. 2, the last row frame of the k-1 GOP (GOP_k-1th), which is the immediately preceding frame of the I frame which is the best row reference frame (frame_intra_first) of the kth GOP (GOP_kth) Lt; / RTI > If the kth frame GOP_kth is shifted to another position as a result of the arbitrary editing of the moving picture content providing unit 120 as shown in FIG. 3, the B frame, which is the last row frame of the k-1 GOP (GOP_k-1th) There is a problem that the frame image is not completely generated due to the loss of the trailing frame.

According to the embodiment of the present invention, in order to solve the above problem, the GOP including the IDR frame not referred to by the preceding frames of the preceding GOP as the best row reference frame (frame_intra_first) is selected as the kth GOP (GOP_kth) It is possible to provide an effect of preventing the data loss in the network.

On the other hand, since one GOP is very short as several tens of seconds or a few seconds, it is impossible to convey the context of original video contents when only a single GOP is arbitrarily edited and moved to another position, there is a problem. In order to prevent such a problem, the moving picture content providing unit 120 according to the embodiment of the present invention includes an editing unit 120, which is defined as k k GOP (GOP_kth) to k + m GOP (GOP_k + mth) The GOP array (array_GOP_edit) is provided to the user terminals 200-1 and 200-2 in an arbitrary order so as to precede the k-1 GOP (GOP_k-1th) or follow the (m + 1) GOP (GOP_k + m + 1th) . At this time, at least eleven GOPs should be set as one edit GOP array (array_GOP_edit). Preferably, m is set to 10 or more because the object to be arbitrarily edited has a length of at least about 1 second to 10 seconds.

FIG. 5 is a GOP structure diagram showing a GOP structure of a compressed moving picture content (content_compress). FIGS. 6 and 7 illustrate compressed moving picture contents (content_compress) arbitrarily edited in units of an editing GOP array (array_GOP_edit) Is a GOP structure diagram showing the GOP structure of FIG. According to this embodiment, the moving picture content providing unit 120 defines (k = 3, m = 2) the third GOP (GOP_3rd) to the fifth GOP (GOP_5th) as an edit GOP array (array_GOP_edit). According to the embodiment, as shown in FIG. 6, the edit GOP array (array_GOP_edit) is arranged to be adjacent to the second GOP (GOP_2nd) which is the adjacent preceding GOP, or the edit GOP array (array_GOP_edit) (GOP_6th), which is an adjacent trailing GOP, to be traced.

According to this embodiment, the edited GOP array (array_GOP_edit) composed of at least 10 consecutive GOPs is edited to provide arbitrary edited moving picture contents that can increase the unexpectedness while maintaining the context of the original moving picture contents .

On the other hand, it is more preferable that the edit GOP arrays (array_GOP_edit) consist of GOPs successively including the same VOP (Video Object Plane) such as the same person, the same object, and the same background. To this end, the arbitrary editing compressed video content (content_compress) providing system includes a video object analyzing unit 130 for analyzing VOP (Video Object Planes) of an image of a best row reference frame (frame_intra_first) and a reference frame (frame_inter). At this time, the moving picture providing unit 120 may be configured to define the k-th GOP (GOP_kth) to (k + m) GOP (GOP_k + mth) in which the same VOP is consecutively defined as an edit GOP array (array_GOP_edit).

In another embodiment, it is preferable that the editing GOP arrays (array_GOP_edit) consist of GOPs continuously including the same person through facial recognition. To this end, a system for providing arbitrary editing compressed video content (content_compress) includes a facial recognition unit for recognizing a face of a person included in an image of a best row reference frame (frame_intra_first) and a reference frame (frame_inter) and associating it with a face identifier (id_facial) 140). In this case, the moving picture content providing unit 120 is characterized by defining a k-th GOP (GOP_kth) to (k + m) -th GOP_k + mth GOPs having the same facial identifiers (id_facial) consecutively as an editing GOP array (array_GOP_edit) .

In this case, the facial identifier (id_facial) is not particularly limited, and the name, alias, and identifier of the face recognized in advance through the internal memory or the external server and the corresponding person can be found and used as a facial identifier (id_facial) Or may be an identifier given in a specific order or at random depending on the minutiae of the face.

Hereinafter, a method of providing arbitrary editing compressed video content (content_compress) according to the present invention will be described with reference to FIG.

First, the moving picture content providing system 100 includes a best row reference frame (frame_intra_first) that does not refer to information of a preceding frame and a plurality of reference frames (frame_inter) that refer to information of a preceding or succeeding frame, (S10) of storing compressed video content (content_compress) composed of first GOP (GOP_1st) to Nth GOP (GOP_Nth) (N is an integer of 2 or more).

At this time, the best row reference frame (frame_intra_first) may be an I frame or an IDR frame, and the reference frame (frame_inter) may be a B frame or a P frame.

In the second step s20, the moving picture content providing system 100 receives an arbitrary editing request for requesting arbitrary editing of the compressed moving picture content (content_compress) from the user terminals 200-1 and 200-2 through a communication network, .

Finally, the moving picture contents providing system 100 provides the compressed video contents (content_compress) to the user terminals 200-1 and 200-2 in units of GOP through a communication network, (GOP_kth) (k is an integer of 2 or more and N-1 or less) to the k-1 GOP (GOP_k-1th) or a k + 1 GOP (S30) to the user terminals 200-1 and 200-2 in an arbitrary order that follows the GOP_k + 1th.

In step S30, the moving picture contents providing system 100 sets the I frame located at the scene change point detected through the scene change detection to the best row non-reference frame (frame_intra_first) As the k < th > GOP (GOP_kth).

In order to prevent data loss due to movement of the best row reference frame (frame_intra_first) of the kth GOP (GOP_kth) of the last row frame of the (k-1) th GOP (GOP_k-1th) preceding the kth GOP (GOP_kth) In the third step s30, it is preferable that the moving picture contents providing system 100 selects a GOP including an IDR (Instantaneous Decoder Refresh) frame as a best row reference frame (frame_intra_first) as a k-th GOP (GOP_kth).

In the third step s30, the moving picture contents providing system 100 sets the kth GOP (GOP_kth) to the k + m GOP (GOP_k + mth) (m is an integer of 10 or more) so that the automatically edited GOPs have an appropriate length. Is defined as an edit GOP array (array_GOP_edit) and the edit GOP array (array_GOP_edit) is preceded by the k-1 GOP (GOP_k-1th) or follows the k + m + 1 GOP (GOP_k + m + To the user terminals 200-1 and 200-2 in this order.

In the embodiment in which the editing is performed in units of the edit GOP array (array_GOP_edit), the first step (s10) for editing the GOPs consecutively included in the same VOP by the editing GOP array (array_GOP_edit) (S11) for analyzing video object planes (VOP) of an image of a best row reference frame (frame_intra_first) and a reference frame (frame_inter), and the second step (s20) It is preferable that the k < th > GOP (GOP_kth) to (k + m) GOP (GOP_k + mth) in which the VOPs are consecutively included is defined as an edit GOP array (array_GOP_edit).

In an embodiment in which editing is performed in units of an edit GOP array (array_GOP_edit), a first step (s10) for editing GOPs consecutively containing the same person in units of an edit GOP array (array_GOP_edit) (S12) of recognizing the face of the person included in the image of the best row reference frame (frame_intra_first) and the reference frame (frame_inter) and associating it with the face identifier (id_facial) It is preferable that the moving picture content system defines a k-th GOP (GOP_kth) to (k + m) GOP (GOP_k + mth) in which the same facial identifiers id_facial are consecutively associated as an editing GOP array (array_GOP_edit).

The description of the invention in this specification is for illustrative purposes only, and the invention is not limited to these embodiments. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. .

Video contents providing system: 100 Video contents storing part: 110
Video content provider: 120 Video object analysis department: 130
Face recognition section: 140 User terminal: 200-1, 200-2

Claims (14)

(N being an integer of 2 or greater) composed of the best row non-reference frame which does not refer to the information of the preceding frame and a plurality of reference frames which refer to the information of the preceding or succeeding frame, A moving picture content storage unit for storing compressed moving picture contents; And
Wherein at least some of the GOPs included in the compressed moving picture content are not changed in order, at least a part of the GOPs are changed in order, (k + 1) -th GOP is arranged in any order such that the k < th > GOP is preceded by the (k + 1) And providing the moving picture contents to the user terminal,
Wherein the best row reference frame is an I frame and the reference frame is a Predicted Frame and / or a B-frame (Bidirectional Frame)
Wherein the moving picture content providing unit,
And selects a GOP including the I frame, which is located at the scene change point detected through scene change detection, as the best row non-reference frame as the k-th GOP.
delete delete The apparatus according to claim 1,
And selects a GOP including the IDR (Instantaneous Decoder Refresh) frame as the best row non-reference frame as the k-th GOP.
The apparatus according to claim 1,
(K + 1) -th GOP to the k-th GOP or to the k-th (k + 1) -th GOP after the k < th & Wherein the compressed moving picture content providing system comprises:
6. The system as claimed in claim 5,
And a video object analyzer for analyzing VOP (Video Object Planes) of the image of the best row non-reference frame and the reference frame,
Wherein the moving picture content providing unit defines the k < th > GOP to the k + m < th > GOP sequentially including the same VOP as the editing GOP array.
6. The system as claimed in claim 5,
And a facial recognition unit for recognizing the face of the person included in the image of the best row non-reference frame and the reference frame and associating the facial identifier with the facial identifier,
Wherein the moving picture content providing unit defines the k < th > GOP to the k + m < th > GOP in which the same facial identifiers are consecutively associated as the editing GOP array.
The moving picture content providing system includes a first best reference frame that does not refer to information of a preceding frame and a first to an Nth GOP that are respectively composed of a plurality of reference frames that refer to information of a preceding or succeeding frame N is an integer equal to or greater than 2);
A second step of the moving picture content providing system receiving an arbitrary editing request for requesting arbitrary editing of the compressed moving picture content from a user terminal through a communication network; And
Wherein the moving picture content providing system transmits the compressed moving picture content in units of GOP to the user terminal through a communication network, at least a part of the GOPs included in the compressed moving picture content are not changed in order, (K is an integer equal to or greater than 2 and less than or equal to N-1), and the compressed (k-1) GOP is arranged in any order so as to precede the k-1 GOP or follow the k + And providing the moving picture content to the user terminal,
In the first step,
The moving picture content providing system may be configured such that the best row non-reference frame is an I frame and the reference frame stores the compressed moving picture content, which is a P frame and / or a B frame,
In the third step,
Wherein the moving picture content providing system selects, as the kth GOP, a GOP including an I frame located at a scene change point detected through scene change detection as the best row non-reference frame. Way.
delete delete 9. The method according to claim 8,
Wherein the moving picture content providing system selects a GOP including the IDR (Instantaneous Decoder Refresh) frame as the best row non-reference frame as the kth GOP.
9. The method according to claim 8,
Wherein the moving picture content providing system defines the kth GOP through (k + m) GOPs (m is an integer of 10 or more) as an edit GOP array, and the edit GOP array is preceded by the (k + To the user terminal in an arbitrary order so as to follow the +1 GOP.
13. The method of claim 12,
The first step may further include a first sub-step of analyzing video object planes (VOPs) of an image of a best reference non-reference frame and a reference frame,
Wherein the moving picture contents providing system defines the k < th > GOP to the k + m < th > GOP in which the same VOP is consecutively included in the editing GOP array. .
13. The method of claim 12,
Wherein the first step further comprises a second sub-step of associating the facial identifier of the person included in the image of the best row non-reference frame and the reference frame with the facial identifier,
Wherein the moving picture content providing system defines the k < th > GOP to the k < + > m GOPs in which the same facial identifiers are consecutively associated as the editing GOP array .
KR1020160067016A 2016-05-31 2016-05-31 Random Editing System for Providing Compressed Video Contents and Method thereof KR101843017B1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
KR1020160067016A KR101843017B1 (en) 2016-05-31 2016-05-31 Random Editing System for Providing Compressed Video Contents and Method thereof
PCT/KR2017/005356 WO2017209432A1 (en) 2016-05-31 2017-05-23 Randomly-edited compressed video content provision system and provision method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020160067016A KR101843017B1 (en) 2016-05-31 2016-05-31 Random Editing System for Providing Compressed Video Contents and Method thereof

Publications (2)

Publication Number Publication Date
KR20170135299A KR20170135299A (en) 2017-12-08
KR101843017B1 true KR101843017B1 (en) 2018-03-29

Family

ID=60477656

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020160067016A KR101843017B1 (en) 2016-05-31 2016-05-31 Random Editing System for Providing Compressed Video Contents and Method thereof

Country Status (2)

Country Link
KR (1) KR101843017B1 (en)
WO (1) WO2017209432A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113692781B (en) * 2020-03-10 2024-03-12 北京小米移动软件有限公司 Method, device, communication equipment and storage medium for transmitting data
CN111918121B (en) * 2020-06-23 2022-02-18 南斗六星系统集成有限公司 Accurate editing method for streaming media file

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001094938A (en) * 1999-09-24 2001-04-06 Nec Corp Compression image reproduction method and device
JP2009522939A (en) * 2006-01-06 2009-06-11 グーグル インク. Dynamic media supply infrastructure
JP2012018727A (en) * 2010-07-08 2012-01-26 Sony Corp Information processor, and information processing method and program
KR101382954B1 (en) * 2006-07-04 2014-04-08 소니 주식회사 Information processing apparatus and method and recording medium for program

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080054475A (en) * 2006-12-13 2008-06-18 주식회사 대우일렉트로닉스 Reservation recording method by using video object plane and its system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001094938A (en) * 1999-09-24 2001-04-06 Nec Corp Compression image reproduction method and device
JP2009522939A (en) * 2006-01-06 2009-06-11 グーグル インク. Dynamic media supply infrastructure
KR101382954B1 (en) * 2006-07-04 2014-04-08 소니 주식회사 Information processing apparatus and method and recording medium for program
JP2012018727A (en) * 2010-07-08 2012-01-26 Sony Corp Information processor, and information processing method and program

Also Published As

Publication number Publication date
WO2017209432A1 (en) 2017-12-07
KR20170135299A (en) 2017-12-08

Similar Documents

Publication Publication Date Title
US8918533B2 (en) Video switching for streaming video data
US9992555B2 (en) Signaling random access points for streaming video data
US6389218B2 (en) Method and apparatus for simultaneously producing compressed play and trick play bitstreams from a video frame sequence
CN105359544B (en) Special play-back in digital video frequency flow transmission
US10009628B2 (en) Tuning video compression for high frame rate and variable frame rate capture
US20090052537A1 (en) Method and device for processing coded video data
US20080267290A1 (en) Coding Method Applied to Multimedia Data
WO2021147448A1 (en) Video data processing method and apparatus, and storage medium
HUE029013T2 (en) Arranging sub-track fragments for streaming video data
CN102598688A (en) Streaming encoded video data
CN111277826B (en) Video data processing method and device and storage medium
US20090086034A1 (en) Video Image Processing Device, Video Image Processing Method, and Video Image Processing Program
KR20140126372A (en) data, multimedia and video transmission updating system
CN103081488A (en) Signaling video samples for trick mode video representations
EP2306730A2 (en) Method and system for 3D video decoding using a tier system framework
KR101843017B1 (en) Random Editing System for Providing Compressed Video Contents and Method thereof
US10674111B2 (en) Systems and methods for profile based media segment rendering
CN105379281B (en) Picture reference control for video decoding using a graphics processor
US11910038B2 (en) Crop-based compression of videos
KR101829262B1 (en) Method for transmitting videos including text and graphics over ip packets and the apparatus thereof
KR102072576B1 (en) Apparatus and method for encoding and decoding of data
JPH09200772A (en) Compressed image data display device
JP7434561B2 (en) MPD expiration date processing model
CN115695918B (en) Multi-camera broadcast guide control method and device, readable storage medium and terminal equipment
KR20040039113A (en) PVR Set-top box system capable of indexing, searching and editing the moving picture

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
AMND Amendment
X701 Decision to grant (after re-examination)
GRNT Written decision to grant