WO2017209432A1 - Randomly-edited compressed video content provision system and provision method - Google Patents

Randomly-edited compressed video content provision system and provision method Download PDF

Info

Publication number
WO2017209432A1
WO2017209432A1 PCT/KR2017/005356 KR2017005356W WO2017209432A1 WO 2017209432 A1 WO2017209432 A1 WO 2017209432A1 KR 2017005356 W KR2017005356 W KR 2017005356W WO 2017209432 A1 WO2017209432 A1 WO 2017209432A1
Authority
WO
WIPO (PCT)
Prior art keywords
gop
video content
frame
edited
reference frame
Prior art date
Application number
PCT/KR2017/005356
Other languages
French (fr)
Korean (ko)
Inventor
이규영
천솔지
Original Assignee
(주)잼투고
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by (주)잼투고 filed Critical (주)잼투고
Publication of WO2017209432A1 publication Critical patent/WO2017209432A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23412Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs for generating or manipulating the scene composition of objects, e.g. MPEG-4 objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally

Definitions

  • the present invention relates to a system and a method for providing a random edited compressed video content, and more particularly, to a random edited compressed video content providing system which can be provided to a user by randomly editing in a GOP unit starting with a best-order non-reference frame during random editing. And a method of providing the same.
  • the uncompressed video content file which contains a complete image of every frame, has a problem of increasing network resources and data transmission costs required for video content transmission due to excessive file size. Accordingly, video storage format standardization organizations such as the Moving Picture Experts Group (MPEG) are proposing various compressed video content storage standards.
  • MPEG Moving Picture Experts Group
  • GOP Group of Picture
  • I frame having complete image information
  • frame before or after it consists of a reference frame (inter frame) such as a B frame or a P frame, which stores only information and has incomplete image information, but greatly reduces the amount of data.
  • the video compression technique contributes to the easy distribution of video contents through the communication network by reducing the amount of data of the video contents by the GOP structure composed of non-reference frames and reference frames.
  • UGC User Generated Contents
  • UGC User Generated Contents
  • the monotony of video content can be improved by editing a single video or a plurality of videos through an editing technique.
  • Korean Patent Application No. 2016-0034287 and Korean Patent Application No. 2016-0048882 filed by the present applicant provide a video content providing system that can enhance the unexpectedness of a video by automatically editing one video or a plurality of videos. It starts.
  • Ericsson's U.S. Patent No. 8,340,113 “Method and arrangement for improved media session management” indicates that the user terminal starts from the video content when the transmission mode is changed when a request is made to change the transmission mode from broadcast to unicast. By requesting a point, it provides an effect of viewing seamless video content despite a change in broadcast mode.
  • the user cannot watch the video content in frame order, so that the user cannot automatically receive the edited video content.
  • Patent Document 1 US Patent No. 9,319,448
  • Patent Document 2 US Patent No. 8,340,113
  • Non-Patent Document 1 Juurlink et al., “Scalable Parallel Programming Applied to H.264 / AVC Decoding”, 2012, pp. 5-11
  • the present invention is to solve the above problems, according to the random edited compressed video content providing system and method according to the present invention, when the video content providing system stores the compressed video content and provides the video content to the user, By providing the video content whose video frames are edited in GOP units according to the order of, the user terminal plays back the edited GOP edited at an arbitrary point from the best non-reference frame that does not require the video information of the preceding frame. It aims at preventing the fall of image quality.
  • the unexpected of the edited video content is to maintain the context of the original video content while improving gender.
  • a system and a method for providing an arbitrarily compressed video content may include: a GOP including an IDR frame (Instantaneous Decoder Refresh Frame) that does not affect the image quality of a preceding frame as a best-order non-reference frame;
  • IDR frame Instantaneous Decoder Refresh Frame
  • another object of the present invention is to prevent deterioration of the quality of a subsequent frame in a preceding frame despite the gap of a GOP edited at an arbitrary point.
  • a video section that is too short in the edited video content is defined by randomly arranging consecutive GOPs instead of a single GOP as an edited GOP array. Another purpose is to prevent them from being randomly distributed and to maintain the context of the original video content.
  • a GOP including the same GOP consecutively is defined as an edited GOP array, so that automatic random editing is performed on a per-object basis. Another purpose is to maintain the context of the original content.
  • the random edited compressed video content providing system includes a plurality of reference frames that refer to information of a best-order non-reference frame that does not refer to information of a preceding frame and information of a preceding or following frame.
  • a video content storage unit for storing compressed video content including first to Nth GOPs each configured to include N (an integer of 2 or more); And providing the compressed video content to a user terminal in units of GOPs through a communication network, wherein at least some of the GOPs are provided to the user terminal in order, and k th GOP (k is an integer of 2 or more and N-1 or less). And a video content providing unit provided to the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP.
  • the best-order non-reference frame is an I frame (Infra Frame), and the reference frame is a P frame (Predicted Frame) and / or B frame (Bidirectional Frame) It is characterized by the).
  • the video content providing unit may include a GOP including an I frame located at a scene change point detected through scene change detection as the best-order non-reference frame.
  • the kth GOP is selected.
  • the video content providing unit selects a GOP including an Instantaneous Decoder Refresh (IDR) frame as the best non-reference frame as the k-th GOP. It features.
  • IDR Instantaneous Decoder Refresh
  • the video content providing unit may include the edited GOP array defined as the kth GOP to the k + m GOP (m is an integer of 10 or more). and providing the user terminal in any order to precede the k-1 GOP or follow the k + m + 1 GOP.
  • the random edited compressed video content providing system further includes a video object analyzer configured to analyze the VOPs (Video Object Planes) of the best non-reference frame and the image of the reference frame, and provide the video content.
  • the unit is characterized by defining the k-th GOP to the k + m GOP including the same VOP consecutively as the edited GOP array.
  • the random edited compressed video content providing system may further include a face recognizing unit that recognizes a face of a person included in an image of the best non-reference frame and the reference frame and associates it with a face identifier.
  • the video content providing unit may define the kth GOP to the k + m GOPs associated with the same face identifier consecutively as the edited GOP array.
  • the random edited compressed video content providing method includes a plurality of reference frames that refer to information of a best-order non-reference frame that does not refer to information of a preceding frame and information of a preceding or following frame in a video content providing system.
  • a second step of the video content providing system receiving a random edit request for requesting arbitrary edit of the compressed video content from a user terminal through a communication network; And the video content providing system to provide the compressed video content to the user terminal in units of GOPs through a communication network, wherein at least some of the GOPs are provided to the user terminal in order, and the kth GOP (k is equal to or greater than 2).
  • the video content providing system in the first step, includes: the best non-reference frame is an I frame (Infra Frame), and the reference frame is P; And store the compressed video content as a frame and / or a bidirectional frame.
  • the third step may include: deactivating the I frame located at the scene change point detected by the scene change detection by the video content providing system;
  • the GOP included in the frame is selected as the k-th GOP.
  • the third step may include the GOP including the Instantaneous Decoder Refresh (IDR) frame as the best non-reference frame.
  • IDR Instantaneous Decoder Refresh
  • the video content providing system edits the k th GOP to the k + m GOP (m is an integer of 10 or more).
  • a GOP array is defined, and the edited GOP array is provided to the user terminal in any order to precede the k-1 GOP or to follow the k + m + 1 GOP.
  • the first step is the video content providing system, the VOP (Video Object Planes) of the image of the best-order non-reference frame and the reference frame
  • a second sub step of analyzing wherein the video content system defines, as the edited GOP array, the k th GOPs to the k + m GOPs including the same VOP consecutively. It is characterized by.
  • the video content providing system recognizes the face of the person included in the image of the best-order non-reference frame and the reference frame. And a second sub-step of associating with a face identifier, wherein the second step includes: the editing GOP of the k th GOP to the k + m GOPs in which the same face identifier is continuously connected. It is characterized by defining as an array.
  • the video content providing system stores the compressed video content and provides the video content to the user in any order. Accordingly, by providing video content in which video frames are edited in units of GOPs, the user terminal plays back edited GOPs edited at random points from the best non-reference frame that does not require the image information of the preceding frame, thereby deteriorating image quality despite random editing. Provides the effect of preventing.
  • the unexpectedness of the edited video content by detecting a scene change point in the video content and editing the non-reference frame detected by the scene change to an arbitrary point It provides the effect of maintaining the context of the original video content while increasing the value.
  • the random point is randomly edited by randomly editing a GOP including an IDR frame as the best non-reference frame that does not affect the image quality of a preceding frame.
  • a GOP including an IDR frame as the best non-reference frame that does not affect the image quality of a preceding frame.
  • a video section that is too short in the edited video content is defined by randomly arranging consecutive GOPs instead of a single GOP as an edited GOP array. To prevent them from being randomly distributed and to maintain the context of the original video content.
  • a system and a method for providing a random edited compressed video content may include defining GOPs including the same VOP consecutively as an edited GOP array, thereby automatically editing the original content in units of a photographed object. It provides the effect of maintaining context.
  • FIG. 1 is a block diagram showing a system for providing arbitrarily compressed video content according to an embodiment of the present invention.
  • FIG. 2 is a frame configuration diagram illustrating a GOP structure of compressed video content.
  • FIG. 3 is a frame configuration diagram illustrating a GOP structure of arbitrarily edited compressed video content according to an embodiment of the present invention.
  • FIG. 4 is a frame diagram illustrating a GOP structure of arbitrarily edited compressed video content according to another embodiment of the present invention.
  • 5 is a GOP configuration diagram showing a GOP structure of compressed video content.
  • FIG. 6 is a GOP diagram illustrating a GOP structure of compressed video content arbitrarily edited in an edited GOP array unit according to another embodiment of the present invention.
  • FIG. 6 is a GOP diagram illustrating a GOP structure of compressed video content arbitrarily edited in an edited GOP array unit according to another embodiment of the present invention.
  • FIG. 7 is a GOP diagram illustrating a GOP structure of compressed video content arbitrarily edited in an edited GOP array unit according to another embodiment of the present invention.
  • FIG. 8 is a flowchart illustrating a method for providing arbitrary edited compressed video content according to the present invention.
  • Video content providing system 100 Video content storage: 110
  • Video content provider 120 Video object analyzer: 130
  • Facial recognition unit 140
  • User terminal 200-1, 200-2
  • the description that a part “includes” an element means that the element may further include other elements, except for the absence of a special objection thereto.
  • the terms “.. module”, “.. unit” and “.. system” described in the specification mean a unit that processes at least one function or operation, which is hardware or software or a combination of hardware and software. It may be implemented, and may be included in one device or each other device.
  • the random edited compressed video content includes a video content storage unit 110 and a video content providing unit 120.
  • the video content storage unit 110 and the video content providing unit 120 may be implemented in a single server system, or may be implemented as a server system configured as a separate server through a communication network.
  • hardware such as a logic circuit, a memory, a storage device, or the like, may be implemented in the form of program codes of software that controls hardware, such as a logic circuit, a memory, a storage device, and the like, rather than a server-client system.
  • the video content storage unit 110 includes a first reference frame (frame_inter_first) that does not refer to the information of the preceding frame and a plurality of reference frames (frame_inter) which refer to the information of the preceding or following frames, respectively.
  • a compressed video content (content_compress) including a GOP (GOP_1st) to an Nth GOP (GOP_Nth) (N is an integer of 2 or more) is stored.
  • a GOP is a set of frames and is composed of at least one reference frame (frame_inter) and at least one reference frame (frame_inter).
  • the number of frames included in the GOP can be specified by the user. For example, the number of frames can be determined to have a time of about 0.5 seconds.
  • the reference frame (frame_inter) itself contains data such as brightness, color, etc. of all the pixels for the frame, the size of the data is large.
  • a video compression standard such as MPEG
  • an I frame (Infra Frame) and an IDR frame (Instantaneous Decoder Refresh Frame) to be described later correspond to a reference frame (frame_inter).
  • the reference frame frame_inter includes only information about pixels or video object planes (VOPs) changed in the preceding reference frame (frame_inter), the data is small in size. Since the next frame in the video content is very short in time, the change of data between the successive frames is very small. Therefore, storing only the changed data is efficient in terms of data size.
  • a video compression standard such as MPEG
  • P frames Predicted Frame
  • B frames Bidirectional Frame
  • the P frame refers to the preceding I frame or another P frame
  • the B frame refers to both the preceding frames and the following frames, it is common that the size of the data is smaller than the P frame.
  • the k th GOP (GOP_kth) is composed of an I frame, a B frame, a B frame, a P frame, a B frame, and a B frame in order, an I frame that is a best-order reference frame (frame_inter_first), and a reference frame. It consists of four B frames and one P frame (frame_inter).
  • the I frame is implemented through the data of the I frame when the video content is played.
  • the P frame which is the third trailing frame in the I frame, refers to the I frame
  • the B frame which is the second trailing frame in the I frame.
  • the video content is played back in such a way that it refers to the preceding I frame and the following P frame
  • the B frame which is the third trailing frame in the I frame, refers to the preceding I frame and B frame and the following P frame.
  • the video content storage unit 110 of the present invention may be implemented to include a function of converting and storing uncompressed video content into compressed video content (content_compress), or to store compressed video content (content_compress) that is already compressed. May be
  • the video content providing unit 120 provides the compressed video content (content_compress) to the user terminals 200-1 and 200-2 in units of GOPs through a communication network, and at least some of the GOPs are in order. 1, 200-2), and precedes k-th GOP (GOP_kth) (k is an integer of 2 or more and N-1 or less) k-1 GOP (GOP_k-1th) or k + 1 GOP (GOP_k). + 1th) to the user terminal (200-1, 200-2) in any order that follows.
  • k-th GOP (k is an integer of 2 or more and N-1 or less) k-1 GOP (GOP_k-1th) or k + 1 GOP (GOP_k). + 1th) to the user terminal (200-1, 200-2) in any order that follows.
  • the video content providing unit 120 provides a video content through a communication network such as the Internet or an intranet in the case of a server-client system, and provides a video content through system internal data processing rather than a communication network when implemented on a single client. can do.
  • a communication network such as the Internet or an intranet in the case of a server-client system
  • the video content providing unit 120 provides at least some of the GOPs to the user terminals 200-1 and 200-2 in the order of the original video content, and at least some other GOPs are different from the order of the original video content.
  • the ratio of the non-edited GOPs provided in the order and the edited GOPs provided in the order may be variously applied according to an embodiment. For example, if you want to make a lot of changes in the original video content, the ratio of edited GOP to non-edited GOP may be 90% .In contrast, if you want to make some changes while maintaining the context of the original video content, The ratio can be set at 10%.
  • the compressed video content (content_compress) providing system may determine the ratio of the edited GOP to the non-edited GOP at an arbitrary ratio.
  • the determination of the GOPs to be edited GOPs is automatically performed by the system for providing compressed video content (content_compress) rather than user decision or by a predetermined algorithm through image processing. Configure to decide.
  • Any kth GOP may be disposed adjacent to or in front of a preceding k-1 GOP (GOP_k-1th) as shown in FIG. 3, for example, a k-20 GOP (GOP_k-20th). It may be spaced apart from the k-1th GOP (GOP_k-1th) in front of the. Conversely, any k GOP (GOP_kth) is arranged adjacent to or behind the following k + 1 GOPs (GOP_k + 1th) as shown in FIG. 4, for example k + 20 GOP (GOP_k + 20th). ) May be spaced apart from the k + 1th GOP (GOP_k + 1th) at the rear of the.
  • the video content providing unit 120 may randomly edit a single compressed video content (content_compress) or may arbitrarily edit a plurality of compressed video contents (content_compress).
  • content_compress When randomly editing a plurality of compressed video contents (content_compress), the first compressed video content (content_compress_1st) is composed of 150 GOPs, and the second compressed video content (content_compress_2nd) is composed of 300 GOPs.
  • the third compressed video content (content_compress_3rd) is composed of 200 GOPs, the video content providing unit 120 regards the compressed video content (content_compress) composed of a total of 650 GOPs according to an arbitrary compressed video content order and edits randomly. This may be provided to the user terminals 200-1 and 200-2. According to this embodiment, the unexpectedness of the randomly edited video content is increased.
  • the recent video compression technology has been developed to position the scene reference point (frame_inter_first) of the GOP through the scene change detection through image processing. This is because when the scene transformation occurs in the middle of the GOP, frame correlation between the preceding frame and the following frame is lowered, and thus, there is a problem of increasing the data amount of the B frame or P frame, which is the reference frame (frame_inter). .
  • the scene change point is edited by scene unit by using the GOP that defines the transition point as the best reference frame (frame_inter_first) as the editing GOP through scene change detection, so that it is edited in units of scenes. It can provide the effect of automatically generating content.
  • the video content providing unit 120 may be configured to select a GOP including the I frame located at the scene change point detected through scene change detection as the best reference frame (frame_inter_first) as the k th GOP (GOP_kth). Do.
  • the video content providing unit 120 may change the order of the GOP without changing the compressed video content (content_compress) file and change the order of the GOP to provide a video transmission format.
  • the video content providing unit 120 changes the GOP order of the compressed video content (content_compress) file and stores the compressed video content (content_compress) file according to an arbitrary editing result or compresses the video according to the changed GOP order in the buffer memory.
  • a copy of all or part of the content (content_compress) file may be uploaded and provided.
  • the last frame of the k-1 GOP (GOP_k-1th), which is the frame immediately before the I frame that is the best reference frame (frame_inter_first) of the k GOP (GOP_kth), is the bidirectional reference frame (frame_inter).
  • B frame As shown in FIG. 3, if the k-th frame GOP_kth is moved to another position as a result of the arbitrary editing of the video content providing unit 120, the B frame, which is the last frame of the k-1 GOP GP_k-1th, is referred to. The trailing frame may be lost and the frame image may not be completely created.
  • data in some frames is selected by selecting a GOP including the IDR frame, which is not referenced by the preceding frames of the preceding GOP, as the best-order non-reference frame as the kth GOP (GOP_kth). It can provide the effect of preventing the loss.
  • the video content providing unit 120 is edited as defined by k th GOP (GOP_kth) to k + m GOP (GOP_k + mth) (m is an integer of 10 or more).
  • the user terminals 200-1 and 200-2 in any order to precede the GOP array array_GOP_edit or to follow the k-1 m GOP (GOP_k-1 th) or the k + m + 1 GOP (GOP_k + m + 1 th). It is desirable to provide to. In this case, at least 11 GOPs should be set to one edit GOP array (array_GOP_edit). Since the object to be arbitrarily edited has a length of at least 1 second to 10 seconds, m is preferably set to 10 or more.
  • FIG. 5 is a GOP diagram illustrating a GOP structure of compressed video content (content_compress), and FIGS. 6 and 7 are compressed video content (content_compress) arbitrarily edited in an edited GOP array (array_GOP_edit) according to an embodiment of the present invention.
  • the editing GOP array array_GOP_edit is arranged to be adjacent to a second GOP (GOP_2nd) which is an adjacent preceding GOP, or as shown in FIG. 7, the editing GOP array (array_GOP_edit) may be
  • the sixth GOP (GOP_6th) which is an adjacent trailing GOP, may be disposed to be trailed apart from each other.
  • the edited GOP array (array_GOP_edit) of at least 10 consecutive GOPs are edited to provide users with randomly edited video content that can increase the unexpectedness while maintaining the context of the original video content. It is effective.
  • the editing GOP arrays is more preferably composed of GOPs containing the same video object plane (VOP), such as the same person, the same object, the same background.
  • the random edited compressed video content (content_compress) providing system includes a video object analyzer 130 for analyzing VOPs (Video Object Planes) of the image of the best reference frame (frame_inter_first) and the reference frame (frame_inter).
  • the video content providing unit 120 may be configured to define a kth GOP (GOP_kth) to a k + m GOP (GOP_k + mth) including the same VOP consecutively as an edited GOP array (array_GOP_edit).
  • the editing GOP arrays are composed of GOPs that continuously include the same person through face recognition.
  • the arbitrary editing compressed video content (content_compress) providing system recognizes the face of the person included in the image of the best line reference frame (frame_inter_first) and the reference frame (frame_inter) and associates it with the face identifier (id_facial). More).
  • the video content providing unit 120 defines k-th GOP (GOP_kth) to k + m (GOP_k + mth) GOPs continuously associated with the same face identifier (id_facial) as an edited GOP array (array_GOP_edit). .
  • the facial identifier is not particularly limited, and the name, alias, and identifier of a previously recognized face and its corresponding person can be found through the internal memory or an external server, and used as the face identifier (id_facial). It may be an identifier assigned in a specific order or arbitrarily according to facial features.
  • the video content providing system 100 includes a best reference frame (frame_inter_first) that does not refer to the information of the preceding frame and a plurality of reference frames (frame_inter) which refer to the information of the preceding or following frames, respectively.
  • a first step S10 of storing compressed video content (content_compress) including first GOP (GOP_1st) to Nth GOP (GOP_Nth) (N is an integer of 2 or more) is performed.
  • the reference frame (frame_inter_first) may be an I frame or an IDR frame
  • the reference frame (frame_inter) may be a B frame or a P frame.
  • a second step (s20) in which the video content providing system 100 receives a random edit request for requesting arbitrary editing of the compressed video content (content_compress) from the user terminals 200-1 and 200-2 through a communication network.
  • the video content providing system 100 provides the compressed video content (content_compress) to the user terminals 200-1 and 200-2 in units of GOPs through a communication network, and at least some of the GOPs are provided in order. 200-1, 200-2, and precede kth G-1 GOP (GOP_k-1th) with kth GOP (GOP_kth), where k is an integer of 2 or more and N-1 or less.
  • a third step s30 is provided to the user terminals 200-1 and 200-2 in any order following GOP_k + 1th.
  • the video content providing system 100 converts the I frame located at the scene change point detected by the scene change detection into the best reference frame (frame_inter_first). It is preferable to select the containing GOP as the kth GOP (GOP_kth).
  • step S30 the video content providing system 100 selects a GOP including an IDR frame (Instantaneous Decoder Refresh) as the best-order reference frame (frame_inter_first) as the k th GOP (GOP_kth).
  • IDR frame Instantaneous Decoder Refresh
  • the video content providing system 100 performs k GOP (GOP_kth) to k + m GOP (GOP_k + mth) (m is an integer of 10 or more) so that the editing GOPs which are automatically edited have an appropriate length.
  • k GOP k + m GOP
  • m is an integer of 10 or more
  • the editing GOPs which are automatically edited have an appropriate length.
  • the first step (s10) is performed by the video content providing system 100 to edit GOPs including the same VOP consecutively in an edited GOP array (array_GOP_edit) unit.
  • the method further includes a first substep s11 for analyzing the VOPs (Video Object Planes) of the image of the best row reference frame (frame_inter_first) and the reference frame (frame_inter), and the second step (s20) includes a VOP having the same video content system. It is preferable to define the kth GOP (GOP_kth) to k + m GOP (GOP_k + mth) including the consecutively as an edited GOP array (array_GOP_edit).
  • the first step (s10) is performed by the video content providing system 100 to edit GOPs containing the same person consecutively in an edited GOP array (array_GOP_edit) unit. And further including a second substep s12 for recognizing the face of the person included in the image of the best row reference frame frame_inter_first and the reference frame frame_inter and associating it with the face identifier id_facial.
  • the video content system defines a kth GOP (GOP_kth) to a k + m GOP (GOP_k + mth) continuously associated with the same face identifier (id_facial) as an edited GOP array (array_GOP_edit).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Television Signal Processing For Recording (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to a randomly-edited compressed video provision system and provision method. The randomly-edited compressed video content provision system, according to the present invention, comprises: a video content storage unit for storing compressed video content comprising a first GOP to an Nth GOP (N is an integer greater than or equal to 2), each GOP comprising a first non-reference frame which does not refer to information on a preceding frame, and multiple reference frames which refer to information on a preceding or succeeding frame; and a video content provision unit for providing the compressed video content in GOP units to a user terminal via a communication network, wherein at least some GOPs are provided to the user terminal in order, and a kth GOP (k is an integer greater than or equal to 2 and less than or equal to N-1) is provided to the user terminal in a random order preceding a (k-1)th GOP or succeeding a (k+1)th GOP.

Description

임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법Arbitrary editing compressed video contents provision system and method
본 발명은 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 관한 것으로 보다 상세하게는 임의편집시 최선행 비참조 프레임으로 시작되는 GOP 단위로 임의편집하여 사용자에게 제공할 수 있는 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 관한 것이다.The present invention relates to a system and a method for providing a random edited compressed video content, and more particularly, to a random edited compressed video content providing system which can be provided to a user by randomly editing in a GOP unit starting with a best-order non-reference frame during random editing. And a method of providing the same.
통신망을 통한 다양한 동영상 컨텐츠의 전송이 일반화되었다. 모든 프레임이 완전한 이미지를 포함하는 비압축 동영상 컨텐츠 파일은 과도한 파일 크기로 인해 통신망 전송시 동영상 컨텐츠 전송에 필요한 망 자원의 점유가 높아지고 데이터 전송 비용이 높아지는 문제가 있었다. 이에 따라 MPEG(Moving Picture Experts Group)과 같은 동영상 저장포맷 표준화 단체들은 다양한 압축 동영상 컨텐츠 저장표준을 제안하고 있다.Transmission of various video contents through a communication network has become common. The uncompressed video content file, which contains a complete image of every frame, has a problem of increasing network resources and data transmission costs required for video content transmission due to excessive file size. Accordingly, video storage format standardization organizations such as the Moving Picture Experts Group (MPEG) are proposing various compressed video content storage standards.
MPEG-2, MPEG-4, ITU-T H.263, ITU-T H.264/MPEG-4, Part 10, Advanced Video Codec(AVC) 등의 동영상 압축표준에 따르면 도 2에 도시된 바와 같이 동영상 컨텐츠를 예컨대 0.5초 내외의 길이를 갖는 GOP(Group of Picture) 단위로 분할하고, 하나의 GOP는 완전한 영상정보를 갖는 I 프레임 등의 비참조 프레임(intra frame) 및 앞 또는 뒤의 프레임에서의 변경정보만 저장하여 불완전한 영상정보를 갖지만 데이터 양을 대폭 감소시킨 B 프레임 또는 P 프레임 등의 참조 프레임(inter frame)으로 구성된다. 동영상 압축 기법은 비참조 프레임과 참조 프레임으로 구성되는 GOP 구조에 의해 동영상 컨텐츠의 데이터 양을 저감하여 동영상 컨텐츠가 통신망을 통해 쉽게 유통되도록 기여했다. (참고문헌, Scalable Parallel Programming Applied to H.264/AVC Decoding, pp. 5-15)According to the video compression standards such as MPEG-2, MPEG-4, ITU-T H.263, ITU-T H.264 / MPEG-4, Part 10, Advanced Video Codec (AVC), as shown in FIG. The content is divided into, for example, GOP (Group of Picture) units having a length of about 0.5 seconds, and one GOP is changed in an intra frame, such as an I frame having complete image information, and a frame before or after it. It consists of a reference frame (inter frame) such as a B frame or a P frame, which stores only information and has incomplete image information, but greatly reduces the amount of data. The video compression technique contributes to the easy distribution of video contents through the communication network by reducing the amount of data of the video contents by the GOP structure composed of non-reference frames and reference frames. (Reference, Scalable Parallel Programming Applied to H.264 / AVC Decoding, pp. 5-15)
한편 전문가가 아닌 일반 사용자가 제작한 UGC(User Generated Contents)는 일반 사용자의 촬영기술, 촬영장비, 편집기술 및 편집장비의 한계로 인해 단조로운 영상의 연속으로 구성되는 경우가 많아 다른 사용자들의 흥미를 끌기 어려운 한계가 있었다. 이러한 문제를 해결하기 위해서는 편집기술을 통해 하나의 동영상 또는 다수의 동영상을 편집함으로써 동영상 컨텐츠의 단조로움을 개선할 수 있다. 본 출원인에 의해 출원된 한국 특허출원 제2016-0034287호 및 한국 특허출원 제2016-0048882호는 하나의 동영상 또는 복수의 동영상을 자동으로 임의편집함으로써 동영상의 의외성을 높일 수 있는 동영상 컨텐츠 제공 시스템을 개시한다.On the other hand, UGC (User Generated Contents) produced by non-professional users is often composed of a series of monotonous images due to limitations of general users' shooting technology, shooting equipment, editing technology and editing equipment. There was a hard limit. In order to solve this problem, the monotony of video content can be improved by editing a single video or a plurality of videos through an editing technique. Korean Patent Application No. 2016-0034287 and Korean Patent Application No. 2016-0048882 filed by the present applicant provide a video content providing system that can enhance the unexpectedness of a video by automatically editing one video or a plurality of videos. It starts.
한편, 비압축 동영상 컨텐츠가 아닌 압축 동영상 컨텐츠의 경우 임의편집되는 시작 지점이 비참조 프레임인 경우 선행하는 참조 프레임의 부재로 인해 임의편집된 구간의 시작지점에서 참조 데이터의 유실로 인한 화질저하가 발생하는 문제점이 있다.On the other hand, in the case of compressed video contents other than uncompressed video contents, if the starting point to be randomly edited is an unreferenced frame, image quality deterioration occurs due to the loss of reference data at the beginning of the randomly edited section due to the absence of a preceding reference frame. There is a problem.
Qualcomm사의 미국등록특허 제9,319,448호 “Trick modes for network streaming of coded multimedia data”는 동영상 컨텐츠 파일의 임의접근(random access)을 위해 참조 프레임의 위치를 별도의 정보인 임의접근점(Random Access Point)으로 저장함으로써 사용자 단말기에서 동영상 컨텐츠 파일의 구간검색시 비참조 프레임으로 용이하게 접근하는 효과를 제공한다. 하지만 이러한 선행기술에 의하면 사용자에게 동영상 컨텐츠의 임의 지점에 대한 접근만 제공할 뿐 자동으로 임의편집된 동영상 컨텐츠를 제공할 수 없는 문제가 있다.Qualcomm U.S. Patent No. 9,319,448, "Trick modes for network streaming of coded multimedia data," refers to the location of a reference frame as random information (Random Access Point) for random access to video content files. By storing, the user terminal provides an effect of easily accessing a non-reference frame when searching a section of a video content file. However, according to the prior art, there is a problem in that it is not possible to automatically provide video content that is automatically edited only to provide the user with access to any point of the video content.
Ericsson사의 미국등록특허 제8,340,113호 “Method and arrangement for improved media session management”는 브로드캐스트(broadcast)에서 유니캐스트(unicast)로 동영상 전송 모드 변경의 요청이 있는 경우 사용자 단말기는 전송 모드 변환시 동영상 컨텐츠에서 시작지점을 요청함으로써 방송모드 변경에도 불구하고 끊김 없는(seamless) 동영상 컨텐츠 시청을 할 수 있는 효과를 제공한다. 하지만 이러한 선행기술에 의하면 사용자는 동영상 컨텐츠를 프레임 순서대로 시청하기 때문에 자동으로 임의편집된 동영상 컨텐츠를 제공받을 수 없는 문제가 있다.Ericsson's U.S. Patent No. 8,340,113 “Method and arrangement for improved media session management” indicates that the user terminal starts from the video content when the transmission mode is changed when a request is made to change the transmission mode from broadcast to unicast. By requesting a point, it provides an effect of viewing seamless video content despite a change in broadcast mode. However, according to the prior art, the user cannot watch the video content in frame order, so that the user cannot automatically receive the edited video content.
[특허문헌][Patent Documents]
(특허문헌 1) 미국등록특허 제9,319,448호(Patent Document 1) US Patent No. 9,319,448
(특허문헌 2) 미국등록특허 제8,340,113호(Patent Document 2) US Patent No. 8,340,113
[비특허문헌][Non-Patent Documents]
(비특허문헌 1) Juurlink et al., “Scalable Parallel Programming Applied to H.264/AVC Decoding”, 2012, pp. 5-11(Non-Patent Document 1) Juurlink et al., “Scalable Parallel Programming Applied to H.264 / AVC Decoding”, 2012, pp. 5-11
본 발명은 상기의 문제를 해결하기 위한 것으로, 본 발명에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동영상 컨텐츠 제공 시스템은 압축 동영상 컨텐츠를 저장하고 사용자에게 동영상 컨텐츠를 제공하는 경우, 임의의 순서에 따라 GOP 단위로 동영상 프레임이 편집된 동영상 컨텐츠를 제공함으로써 사용자 단말기는 선행하는 프레임의 영상정보를 요하지 않는 최선행 비참조 프레임으로부터 임의지점으로 편집된 편집 GOP를 재생하여 임의편집에도 불구하고 화질의 저하를 방지하는 것을 목적으로 한다.The present invention is to solve the above problems, according to the random edited compressed video content providing system and method according to the present invention, when the video content providing system stores the compressed video content and provides the video content to the user, By providing the video content whose video frames are edited in GOP units according to the order of, the user terminal plays back the edited GOP edited at an arbitrary point from the best non-reference frame that does not require the video information of the preceding frame. It aims at preventing the fall of image quality.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동영상 컨텐츠 내의 장면변환 지점을 검출하여 장면변화로 검출된 비참조 프레임을 임의지점으로 편집함으로써, 편집된 동영상 컨텐츠의 의외성을 높이면서도 원본 동영상 컨텐츠의 맥락을 유지하는 것을 다른 목적으로 한다.According to the system and method for providing a random edited compressed video content according to an embodiment of the present invention, by detecting a scene change point in the video content and editing the non-reference frame detected by the scene change to an arbitrary point, the unexpected of the edited video content Its purpose is to maintain the context of the original video content while improving gender.
본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 선행하는 프레임의 화질에 영향을 주지 않는 IDR 프레임(Instantaneous Decoder Refresh Frame)을 최선행 비참조 프레임으로 포함하는 GOP를 임의편집함으로써, 임의지점으로 편집된 GOP의 공백에도 불구하고 선행하는 프레임에서의 후행 프레임의 화질 저하를 방지하는 것을 다른 목적으로 한다.According to another exemplary embodiment of the present invention, a system and a method for providing an arbitrarily compressed video content may include: a GOP including an IDR frame (Instantaneous Decoder Refresh Frame) that does not affect the image quality of a preceding frame as a best-order non-reference frame; By arbitrary editing, another object of the present invention is to prevent deterioration of the quality of a subsequent frame in a preceding frame despite the gap of a GOP edited at an arbitrary point.
본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 단일의 GOP가 아닌 연속하는 GOP들을 편집 GOP 어레이로 정의하여 이를 임의배치함으로써, 편집된 동영상 컨텐츠에서 너무 짧은 동영상 구간들이 임의로 분산되는 것을 방지하고 원본 동영상 컨텐츠의 맥락을 유지하는 것을 다른 목적으로 한다.According to another embodiment of the present invention and a method for providing a random edited compressed video content, a video section that is too short in the edited video content is defined by randomly arranging consecutive GOPs instead of a single GOP as an edited GOP array. Another purpose is to prevent them from being randomly distributed and to maintain the context of the original video content.
본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동일한 GOP가 연속적으로 포함된 GOP들을 편집 GOP 어레이(array)로 정의함으로써, 피촬영객체 단위로 자동 임의 편집을 통해 원본 컨텐츠의 맥락을 유지할 수 있는 것을 다른 목적으로 한다.According to another embodiment of the present invention and a system for providing a random edited compressed video content, a GOP including the same GOP consecutively is defined as an edited GOP array, so that automatic random editing is performed on a per-object basis. Another purpose is to maintain the context of the original content.
마지막으로, 본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동일한 인물이 연속적으로 포함된 GOP들을 편집 GOP 어레이로 정의함으로써, 등장인물 단위로 자동 임의 편집을 통해 원본 컨텐츠의 맥락을 유지할 수 있는 것을 다른 목적으로 한다.Finally, according to another system and method for providing a random edited compressed video content according to another embodiment of the present invention, by defining a GOP array containing the same person continuously as an edited GOP array, the original through automatic random editing in the character unit Another purpose is to be able to maintain the context of the content.
상기의 목적을 달성하기 위해 본 발명에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템은, 선행하는 프레임의 정보를 참조하지 않는 최선행 비참조 프레임 및 선행 또는 후행하는 프레임의 정보를 참조하는 복수의 참조 프레임들을 포함하여 각각 구성되는 제 1 GOP 내지 제 N GOP로(N은 2 이상의 정수) 구성되는 압축 동영상 컨텐츠를 저장하는 동영상 컨텐츠 저장부; 및 통신망을 통해 상기 압축 동영상 컨텐츠를 GOP 단위로 사용자 단말기에게 제공하되, 적어도 일부의 GOP들은 순서에 따라 상기 사용자 단말기에게 제공하고, 제 k GOP를(k는 2 이상, N-1 이하의 정수) 제 k-1 GOP에 선행하거나 제 k+1 GOP에 후행하는 임의의 순서로 사용자 단말기에게 제공하는 동영상 컨텐츠 제공부;를 포함하여 구성되는 것을 특징으로 한다.In order to achieve the above object, the random edited compressed video content providing system according to the present invention includes a plurality of reference frames that refer to information of a best-order non-reference frame that does not refer to information of a preceding frame and information of a preceding or following frame. A video content storage unit for storing compressed video content including first to Nth GOPs each configured to include N (an integer of 2 or more); And providing the compressed video content to a user terminal in units of GOPs through a communication network, wherein at least some of the GOPs are provided to the user terminal in order, and k th GOP (k is an integer of 2 or more and N-1 or less). And a video content providing unit provided to the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템에 있어서, 상기 최선행 비참조 프레임은 I 프레임(Infra Frame)이고, 상기 참조 프레임은 P 프레임(Predicted Frame) 및/또는 B 프레임(Bidirectional Frame)인 것을 특징으로 한다.In the arbitrary edited compressed video content providing system according to an embodiment of the present invention, the best-order non-reference frame is an I frame (Infra Frame), and the reference frame is a P frame (Predicted Frame) and / or B frame (Bidirectional Frame) It is characterized by the).
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템에 있어서, 상기 동영상 컨텐츠 제공부는, 장면변환 검출을 통해 검출된 장면변환 지점에 위치한 I 프레임을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 한다.In the arbitrary edited compressed video content providing system according to an embodiment of the present invention, the video content providing unit may include a GOP including an I frame located at a scene change point detected through scene change detection as the best-order non-reference frame. The kth GOP is selected.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템에 있어서, 상기 동영상 컨텐츠 제공부는, IDR 프레임(Instantaneous Decoder Refresh)을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 한다.In the arbitrary edited compressed video content providing system according to an embodiment of the present invention, the video content providing unit selects a GOP including an Instantaneous Decoder Refresh (IDR) frame as the best non-reference frame as the k-th GOP. It features.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템에 있어서, 상기 동영상 컨텐츠 제공부는, 상기 제 k GOP 내지 상기 제 k+m GOP(m은 10 이상의 정수)로 정의되는 편집 GOP 어레이를 상기 제 k-1 GOP에 선행하거나 k+m+1 GOP에 후행하도록 임의의 순서로 사용자 단말기에게 제공하는 것을 특징으로 한다.In the arbitrary edited compressed video content providing system according to an embodiment of the present invention, the video content providing unit may include the edited GOP array defined as the kth GOP to the k + m GOP (m is an integer of 10 or more). and providing the user terminal in any order to precede the k-1 GOP or follow the k + m + 1 GOP.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템은, 상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지의 VOP(Video Object Plane)들을 분석하는 비디오 객체 분석부를 더 포함하고, 상기 동영상 컨텐츠 제공부는, 동일한 VOP가 연속적으로 포함된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 한다.The random edited compressed video content providing system according to an exemplary embodiment of the present invention further includes a video object analyzer configured to analyze the VOPs (Video Object Planes) of the best non-reference frame and the image of the reference frame, and provide the video content. The unit is characterized by defining the k-th GOP to the k + m GOP including the same VOP consecutively as the edited GOP array.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템은, 상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지에 포함된 인물의 안면을 인식하여 안면 식별자와 연관시키는 안면 인식부를 더 포함하고, 상기 동영상 컨텐츠 제공부는, 동일한 안면 식별자가 연속적으로 연관된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 한다.The random edited compressed video content providing system according to an exemplary embodiment of the present invention may further include a face recognizing unit that recognizes a face of a person included in an image of the best non-reference frame and the reference frame and associates it with a face identifier. The video content providing unit may define the kth GOP to the k + m GOPs associated with the same face identifier consecutively as the edited GOP array.
본 발명에 따른 임의편집 압축 동영상 컨텐츠 제공방법은, 동영상 컨텐츠 제공 시스템이, 선행하는 프레임의 정보를 참조하지 않는 최선행 비참조 프레임 및 선행 또는 후행하는 프레임의 정보를 참조하는 복수의 참조 프레임들을 포함하여 각각 구성되는 제 1 GOP 내지 제 N GOP로(N은 2 이상의 정수) 구성되는 압축 동영상 컨텐츠를 저장하는 제 1 단계; 상기 동영상 컨텐츠 제공 시스템이, 통신망을 통해 사용자 단말기로부터 상기 압축 동영상 컨텐츠의 임의편집을 요청하는 임의편집 요청을 수신하는 제 2 단계; 및 상기 동영상 컨텐츠 제공 시스템이, 통신망을 통해 상기 압축 동영상 컨텐츠를 GOP 단위로 상기 사용자 단말기에게 제공하되, 적어도 일부의 GOP들은 순서에 따라 상기 사용자 단말기에게 제공하고, 제 k GOP를(k는 2 이상, N-1 이하의 정수) 제 k-1 GOP에 선행하거나 제 k+1 GOP에 후행하는 임의의 순서로 사용자 단말기에게 제공하는 제 3 단계;를 포함하여 구성되는 것을 특징으로 한다.The random edited compressed video content providing method according to the present invention includes a plurality of reference frames that refer to information of a best-order non-reference frame that does not refer to information of a preceding frame and information of a preceding or following frame in a video content providing system. A first step of storing the compressed video content composed of the first to Nth GOPs (N is an integer of 2 or more), respectively; A second step of the video content providing system receiving a random edit request for requesting arbitrary edit of the compressed video content from a user terminal through a communication network; And the video content providing system to provide the compressed video content to the user terminal in units of GOPs through a communication network, wherein at least some of the GOPs are provided to the user terminal in order, and the kth GOP (k is equal to or greater than 2). A third step of providing the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공방법에 있어서, 상기 제 1 단계는, 상기 동영상 컨텐츠 제공 시스템이, 상기 최선행 비참조 프레임은 I 프레임(Infra Frame)이고, 상기 참조 프레임은 P 프레임(Predicted Frame) 및/또는 B 프레임(Bidirectional Frame)인 상기 압축 동영상 컨텐츠를 저장하는 것을 특징으로 한다.In the random edited compressed video content providing method according to an embodiment of the present invention, in the first step, the video content providing system includes: the best non-reference frame is an I frame (Infra Frame), and the reference frame is P; And store the compressed video content as a frame and / or a bidirectional frame.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공방법에 있어서, 상기 제 3 단계는, 상기 동영상 컨텐츠 제공 시스템이, 장면변환 검출을 통해 검출된 장면변환 지점에 위치한 I 프레임을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 한다.In the random edited compressed video content providing method according to an embodiment of the present invention, the third step may include: deactivating the I frame located at the scene change point detected by the scene change detection by the video content providing system; The GOP included in the frame is selected as the k-th GOP.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공방법에 있어서, 상기 제 3 단계는, 상기 동영상 컨텐츠 제공 시스템이, IDR 프레임(Instantaneous Decoder Refresh)을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 한다.In the random edited compressed video content providing method according to an embodiment of the present invention, the third step may include the GOP including the Instantaneous Decoder Refresh (IDR) frame as the best non-reference frame. The kth GOP is selected.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공방법에 있어서, 상기 제 3 단계는, 상기 동영상 컨텐츠 제공 시스템이, 상기 제 k GOP 내지 상기 제 k+m GOP(m은 10 이상의 정수)를 편집 GOP 어레이로 정의하고, 상기 편집 GOP 어레이를 상기 제 k-1 GOP에 선행하거나 k+m+1 GOP에 후행하도록 임의의 순서로 사용자 단말기에게 제공하는 것을 특징으로 한다.In the random editing compressed video content providing method according to an embodiment of the present invention, in the third step, the video content providing system edits the k th GOP to the k + m GOP (m is an integer of 10 or more). A GOP array is defined, and the edited GOP array is provided to the user terminal in any order to precede the k-1 GOP or to follow the k + m + 1 GOP.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공방법에 있어서, 상기 제 1 단계는, 상기 동영상 컨텐츠 제공 시스템이, 상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지의 VOP(Video Object Plane)들을 분석하는 제 1 부단계;를 더 포함하고, 상기 제 2 단계는, 상기 동영상 컨텐츠 시스템이, 동일한 VOP가 연속적으로 포함된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 한다.In the random edited compressed video content providing method according to an embodiment of the present invention, the first step is the video content providing system, the VOP (Video Object Planes) of the image of the best-order non-reference frame and the reference frame And a second sub step of analyzing, wherein the video content system defines, as the edited GOP array, the k th GOPs to the k + m GOPs including the same VOP consecutively. It is characterized by.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공방법에 있어서, 상기 제 1 단계는, 상기 동영상 컨텐츠 제공 시스템이, 상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지에 포함된 인물의 안면을 인식하여 안면 식별자와 연관시키는 제 2 부단계;를 더 포함하고, 상기 제 2 단계는, 상기 동영상 컨텐츠 시스템이, 동일한 안면 식별자가 연속적으로 연관된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 한다.In the random edited compressed video content providing method according to an embodiment of the present invention, in the first step, the video content providing system recognizes the face of the person included in the image of the best-order non-reference frame and the reference frame. And a second sub-step of associating with a face identifier, wherein the second step includes: the editing GOP of the k th GOP to the k + m GOPs in which the same face identifier is continuously connected. It is characterized by defining as an array.
본 발명의 상기의 구성을 통해, 본 발명에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동영상 컨텐츠 제공 시스템은 압축 동영상 컨텐츠를 저장하고 사용자에게 동영상 컨텐츠를 제공하는 경우, 임의의 순서에 따라 GOP 단위로 동영상 프레임이 편집된 동영상 컨텐츠를 제공함으로써 사용자 단말기는 선행하는 프레임의 영상정보를 요하지 않는 최선행 비참조 프레임으로부터 임의지점으로 편집된 편집 GOP를 재생하여 임의편집에도 불구하고 화질의 저하를 방지하는 효과를 제공한다.According to the above-described configuration of the present invention, according to the system and method for providing arbitrary edited compressed video content according to the present invention, the video content providing system stores the compressed video content and provides the video content to the user in any order. Accordingly, by providing video content in which video frames are edited in units of GOPs, the user terminal plays back edited GOPs edited at random points from the best non-reference frame that does not require the image information of the preceding frame, thereby deteriorating image quality despite random editing. Provides the effect of preventing.
본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동영상 컨텐츠 내의 장면변환 지점을 검출하여 장면변화로 검출된 비참조 프레임을 임의지점으로 편집함으로써 편집된 동영상 컨텐츠의 의외성을 높이면서도 원본 동영상 컨텐츠의 맥락을 유지하는 효과를 제공한다.According to an arbitrary editing compressed video content providing system and a method according to an embodiment of the present invention, the unexpectedness of the edited video content by detecting a scene change point in the video content and editing the non-reference frame detected by the scene change to an arbitrary point It provides the effect of maintaining the context of the original video content while increasing the value.
본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 선행하는 프레임의 화질에 영향을 주지 않는 IDR 프레임을 최선행 비참조 프레임으로 포함하는 GOP를 임의편집함으로써, 임의지점으로 편집된 GOP의 공백에도 불구하고 선행하는 프레임에서의 후행 프레임의 화질 저하를 방지하는 효과를 제공한다.According to another embodiment of the present invention and a method for providing a random edited compressed video content, the random point is randomly edited by randomly editing a GOP including an IDR frame as the best non-reference frame that does not affect the image quality of a preceding frame. In spite of the gap of the edited GOP, it is possible to prevent the deterioration of the quality of the following frame in the preceding frame.
본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 단일의 GOP가 아닌 연속하는 GOP들을 편집 GOP 어레이로 정의하여 이를 임의배치함으로써, 편집된 동영상 컨텐츠에서 너무 짧은 동영상 구간들이 임의로 분산되는 것을 방지하고 원본 동영상 컨텐츠의 맥락을 유지하는 효과를 제공한다.According to another embodiment of the present invention and a method for providing a random edited compressed video content, a video section that is too short in the edited video content is defined by randomly arranging consecutive GOPs instead of a single GOP as an edited GOP array. To prevent them from being randomly distributed and to maintain the context of the original video content.
본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동일한 VOP가 연속적으로 포함된 GOP들을 편집 GOP 어레이로 정의함으로써, 피촬영객체 단위로 자동 임의 편집을 통해 원본 컨텐츠의 맥락을 유지할 수 있는 효과를 제공한다.According to another exemplary embodiment of the present invention, a system and a method for providing a random edited compressed video content may include defining GOPs including the same VOP consecutively as an edited GOP array, thereby automatically editing the original content in units of a photographed object. It provides the effect of maintaining context.
마지막으로, 본 발명의 다른 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템 및 제공방법에 따르면, 동일한 인물이 연속적으로 포함된 GOP들을 편집 GOP 어레이로 정의함으로써 등장인물 단위로 자동 임의 편집을 통해 원본 컨텐츠의 맥락을 유지할 수 있는 효과를 제공한다.Finally, according to another system and method for providing a random edited compressed video content according to another embodiment of the present invention, by defining the edited GOP array of GOPs containing the same person in succession, the original content through automatic random editing in the character unit Provides the effect of maintaining the context of
도 1은 본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠 제공 시스템을 도시하는 구성도.1 is a block diagram showing a system for providing arbitrarily compressed video content according to an embodiment of the present invention.
도 2는 압축 동영상 컨텐츠의 GOP 구조를 도시하는 프레임 구성도.2 is a frame configuration diagram illustrating a GOP structure of compressed video content.
도 3은 본 발명의 실시예에 따라 임의편집된 압축 동영상 컨텐츠의 GOP 구조를 도시하는 프레임 구성도.3 is a frame configuration diagram illustrating a GOP structure of arbitrarily edited compressed video content according to an embodiment of the present invention.
도 4는 본 발명의 다른 실시예에 따라 임의편집된 압축 동영상 컨텐츠의 GOP 구조를 도시하는 프레임 구성도.4 is a frame diagram illustrating a GOP structure of arbitrarily edited compressed video content according to another embodiment of the present invention.
도 5는 압축 동영상 컨텐츠의 GOP 구조를 도시하는 GOP 구성도.5 is a GOP configuration diagram showing a GOP structure of compressed video content.
도 6은 본 발명의 다른 실시예에 따라 편집 GOP 어레이 단위로 임의편집된 압축 동영상 컨텐츠의 GOP 구조를 도시하는 GOP 구성도.FIG. 6 is a GOP diagram illustrating a GOP structure of compressed video content arbitrarily edited in an edited GOP array unit according to another embodiment of the present invention. FIG.
도 7은 본 발명의 다른 실시예에 따라 편집 GOP 어레이 단위로 임의편집된 압축 동영상 컨텐츠의 GOP 구조를 도시하는 GOP 구성도.7 is a GOP diagram illustrating a GOP structure of compressed video content arbitrarily edited in an edited GOP array unit according to another embodiment of the present invention.
도 8은 본 발명에 따른 임의편집 압축 동영상 컨텐츠 제공방법을 도시하는 처리흐름도.8 is a flowchart illustrating a method for providing arbitrary edited compressed video content according to the present invention.
[부호의 설명][Description of the code]
동영상 컨텐츠 제공 시스템 : 100 동영상 컨텐츠 저장부 : 110Video content providing system: 100 Video content storage: 110
동영상 컨텐츠 제공부 : 120 비디오 객체 분석부 : 130Video content provider: 120 Video object analyzer: 130
안면 인식부 : 140 사용자 단말기 : 200-1, 200-2Facial recognition unit: 140 User terminal: 200-1, 200-2
본 명세서 및 청구범위에 사용된 용어나 단어는 통상적이거나 사전적인 의미로 한정 해석되어서는 안되며, 발명자는 자신의 발명을 최선의 방법으로 설명하기 위해 용어와 개념을 정의할 수 있는 원칙에 입각하여 본 발명의 기술적 사상에 부합하는 의미와 개념으로 해석되어야 한다.The terms or words used in this specification and claims are not to be construed as limiting in their usual or dictionary meanings, and the inventors shall refer to the principles on which terms and concepts may be defined in order to best explain their inventions. It should be interpreted as meanings and concepts corresponding to the technical spirit of the invention.
따라서, 본 명세서에 기재된 실시예와 도면에 도시된 구성은 본 발명의 바람직한 일 실시예에 해당하며, 본 발명의 기술적 사상을 모두 대변하는 것이 아니므로 해당 구성은 본 발명의 출원시점에서 이를 대체할 다양한 균등물과 변형예가 있을 수 있다.Therefore, the configuration shown in the embodiments and drawings described in this specification corresponds to a preferred embodiment of the present invention, and does not represent all of the technical spirit of the present invention, the configuration will be replaced at the time of filing of the present invention. There may be various equivalents and variations.
명세서 전반에서 어떠한 부분이 어떤 구성요소를 “포함”한다는 기재는, 이에 대한 특별한 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라, 다른 구성요소를 더 포함할 수 있는 것을 의미한다. 또한 명세서에 기재된 “..모듈”, “..부”, “..시스템” 등의 용어는 적어도 하나의 기능이나 동작을 처리하는 단위를 의미하며, 이는 하드웨어나 소프트웨어 또는 하드웨어 및 소프트웨어의 결합으로 구현될 수 있으며, 하나의 장치 또는 각각 다른 장치에 포함될 수 있다.Throughout the specification, the description that a part “includes” an element means that the element may further include other elements, except for the absence of a special objection thereto. In addition, the terms “.. module”, “.. unit” and “.. system” described in the specification mean a unit that processes at least one function or operation, which is hardware or software or a combination of hardware and software. It may be implemented, and may be included in one device or each other device.
이하에서는 도면을 참조하여 본 발명에 따른 임의편집 압축 동영상 컨텐츠(content_compress) 제공 시스템을 설명한다. 도 1은 본 발명의 실시예에 따른 임의편집 압축 동영상 컨텐츠(content_compress) 제공 시스템을 도시한다. 임의편집 압축 동영상 컨텐츠(content_compress)는 동영상 컨텐츠 저장부(110) 및 동영상 컨텐츠 제공부(120)를 포함하여 구성된다.Hereinafter, a system for providing arbitrarily compressed video content (content_compress) according to the present invention will be described with reference to the accompanying drawings. 1 illustrates a system for providing arbitrary edited compressed video content (content_compress) according to an exemplary embodiment of the present invention. The random edited compressed video content (content_compress) includes a video content storage unit 110 and a video content providing unit 120.
실시예에 따라서 동영상 컨텐츠 저장부(110) 및 동영상 컨텐츠 제공부(120)는 단일의 서버 시스템에서 구현될 수도 있고, 서로 통신망을 통해 각각 별도의 서버로 구성되는 서버 시스템으로 구현될 수도 있다. 또한 실시예에 따라서 서버-클라이언트 시스템이 아닌 사용자 단말기(200-1, 200-2) 내부에 논리회로, 메모리, 저장장치 등의 하드웨어 또는 하드웨어를 제어하는 소프트웨어의 프로그램 코드 형태로 구현될 수도 있다. According to an exemplary embodiment, the video content storage unit 110 and the video content providing unit 120 may be implemented in a single server system, or may be implemented as a server system configured as a separate server through a communication network. Further, according to the exemplary embodiment, hardware, such as a logic circuit, a memory, a storage device, or the like, may be implemented in the form of program codes of software that controls hardware, such as a logic circuit, a memory, a storage device, and the like, rather than a server-client system.
동영상 컨텐츠 저장부(110)는 선행하는 프레임의 정보를 참조하지 않는 최선행 참조 프레임(frame_inter_first) 및 선행 또는 후행하는 프레임의 정보를 참조하는 복수의 참조 프레임(frame_inter)들을 포함하여 각각 구성되는 제 1 GOP(GOP_1st) 내지 제 N GOP(GOP_Nth)로(N은 2 이상의 정수) 구성되는 압축 동영상 컨텐츠(content_compress)를 저장하는 기능을 수행한다.The video content storage unit 110 includes a first reference frame (frame_inter_first) that does not refer to the information of the preceding frame and a plurality of reference frames (frame_inter) which refer to the information of the preceding or following frames, respectively. A compressed video content (content_compress) including a GOP (GOP_1st) to an Nth GOP (GOP_Nth) (N is an integer of 2 or more) is stored.
도 2는 압축 동영상 컨텐츠(content_compress)의 GOP 구조를 도시한다. GOP는 프레임의 집합으로서, 적어도 하나 이상의 참조 프레임(frame_inter)과 적어도 하나 이상의 참조 프레임(frame_inter)으로 구성된다. GOP에 포함되는 프레임의 수는 사용자가 지정할 수 있으며 예컨대 0.5초 정도의 시간을 갖도록 프레임 수를 정할 수 있다.2 illustrates a GOP structure of compressed video content (content_compress). A GOP is a set of frames and is composed of at least one reference frame (frame_inter) and at least one reference frame (frame_inter). The number of frames included in the GOP can be specified by the user. For example, the number of frames can be determined to have a time of about 0.5 seconds.
참조 프레임(frame_inter)은 그 자체가 해당 프레임에 대해 모든 픽셀의 밝기, 색상 등의 데이터를 포함하기 때문에 데이터의 크기가 크다. MPEG 등의 동영상 압축 표준에 따르면 I 프레임(Infra Frame)과 후술하는 IDR 프레임(Instantaneous Decoder Refresh Frame)이 참조 프레임(frame_inter)에 해당한다. Since the reference frame (frame_inter) itself contains data such as brightness, color, etc. of all the pixels for the frame, the size of the data is large. According to a video compression standard such as MPEG, an I frame (Infra Frame) and an IDR frame (Instantaneous Decoder Refresh Frame) to be described later correspond to a reference frame (frame_inter).
참조 프레임(frame_inter)은 예컨대 선행하는 참조 프레임(frame_inter)에서 변화된 픽셀들 또는 VOP(Video Object Plane)에 대한 정보만을 포함하기 때문에 데이터의 크기가 적다. 동영상 컨텐츠에서 하나의 프레임에서 다음 프레임은 시간적으로 매우 짧기 때문에 연속하는 프레임들간에는 데이터의 변화가 매우 적기 때문에 변화된 데이터만 저장하는 것이 데이터의 크기면에서 효율적이다. MPEG 등의 동영상 압축 표준에 따르면 P 프레임(Predicted Frame) 및 B 프레임(Bidirectional Frame)이 참조 프레임(frame_inter)에 해당한다. P 프레임은 선행하는 I 프레임 또는 다른 P 프레임을 참조하고, B 프레임은 선행하는 프레임들 및 후행하는 프레임들을 모두 참조하기 때문에 데이터의 크기가 P 프레임보다도 작은 것이 일반적이다.Since the reference frame frame_inter includes only information about pixels or video object planes (VOPs) changed in the preceding reference frame (frame_inter), the data is small in size. Since the next frame in the video content is very short in time, the change of data between the successive frames is very small. Therefore, storing only the changed data is efficient in terms of data size. According to a video compression standard such as MPEG, P frames (Predicted Frame) and B frames (Bidirectional Frame) correspond to the reference frame (frame_inter). Since the P frame refers to the preceding I frame or another P frame, and the B frame refers to both the preceding frames and the following frames, it is common that the size of the data is smaller than the P frame.
도 2의 실시예의 경우 제 k GOP(GOP_kth)는 순서대로 I 프레임, B 프레임, B 프레임, P 프레임, B 프레임, B 프레임으로 구성되며, 최선행 참조 프레임(frame_inter_first)인 I 프레임과, 참조 프레임(frame_inter)인 4개의 B 프레임들 및 1개의 P 프레임들로 구성된다. 이 경우 동영상 컨텐츠의 재생시 I 프레임의 데이터를 통해 I 프레임이 구현되고, 다음으로 I 프레임에서 세 번째 후행 프레임인 P 프레임이 I 프레임을 참조하고, 다음으로 I 프레임에서 두 번째 후행 프레임인 B 프레임이 선행하는 I 프레임 및 후행하는 P 프레임을 참조하고, 다음으로 I 프레임에서 세 번째 후행 프레임인 B 프레임이 선행하는 I 프레임 및 B 프레임과 후행하는 P 프레임을 참조하는 방식으로 동영상 컨텐츠가 재생된다.In the case of the embodiment of FIG. 2, the k th GOP (GOP_kth) is composed of an I frame, a B frame, a B frame, a P frame, a B frame, and a B frame in order, an I frame that is a best-order reference frame (frame_inter_first), and a reference frame. It consists of four B frames and one P frame (frame_inter). In this case, the I frame is implemented through the data of the I frame when the video content is played. Then, the P frame, which is the third trailing frame in the I frame, refers to the I frame, and then the B frame, which is the second trailing frame in the I frame. The video content is played back in such a way that it refers to the preceding I frame and the following P frame, and then the B frame, which is the third trailing frame in the I frame, refers to the preceding I frame and B frame and the following P frame.
한편 본 발명의 동영상 컨텐츠 저장부(110)는 비압축 동영상 컨텐츠를 압축 동영상 컨텐츠(content_compress)로 변환하여 저장하는 기능을 포함하여 구현될 수도 있고, 이미 압축된 압축 동영상 컨텐츠(content_compress)를 저장하도록 구현될 수도 있다. Meanwhile, the video content storage unit 110 of the present invention may be implemented to include a function of converting and storing uncompressed video content into compressed video content (content_compress), or to store compressed video content (content_compress) that is already compressed. May be
동영상 컨텐츠 제공부(120)는, 통신망을 통해 압축 동영상 컨텐츠(content_compress)를 GOP 단위로 사용자 단말기(200-1, 200-2)에게 제공하되, 적어도 일부의 GOP들은 순서에 따라 사용자 단말기(200-1, 200-2)에게 제공하고, 제 k GOP(GOP_kth)를(k는 2 이상, N-1 이하의 정수) 제 k-1 GOP(GOP_k-1th)에 선행하거나 제 k+1 GOP(GOP_k+1th)에 후행하는 임의의 순서로 사용자 단말기(200-1, 200-2)에게 제공하는 기능을 수행한다.The video content providing unit 120 provides the compressed video content (content_compress) to the user terminals 200-1 and 200-2 in units of GOPs through a communication network, and at least some of the GOPs are in order. 1, 200-2), and precedes k-th GOP (GOP_kth) (k is an integer of 2 or more and N-1 or less) k-1 GOP (GOP_k-1th) or k + 1 GOP (GOP_k). + 1th) to the user terminal (200-1, 200-2) in any order that follows.
동영상 컨텐츠 제공부(120)는 서버-클라이언트 시스템의 경우 인터넷, 인트라넷 등의 통신망을 통해 동영상 컨텐츠를 제공하고, 단일의 클라이언트 상에 구현된 경우 통신망이 아닌 시스템 내부적인 데이터 처리를 통해 동영상 컨텐츠를 제공할 수 있다.The video content providing unit 120 provides a video content through a communication network such as the Internet or an intranet in the case of a server-client system, and provides a video content through system internal data processing rather than a communication network when implemented on a single client. can do.
동영상 컨텐츠 제공부(120)는 적어도 일부의 GOP들은 원본 동영상 컨텐츠의 순서에 따라 사용자 단말기(200-1, 200-2)에게 제공하고, 적어도 다른 일부의 GOP들은 원본 동영상 컨텐츠의 순서와 다르게 사용자 단말기(200-1, 200-2)에게 제공한다. 이때 순서에 따라 제공되는 비편집 GOP들과 순서와 다르게 제공되는 편집 GOP들의 비율은 실시예에 따라 다양하게 적용될 수 있다. 예컨대 원본 동영상 컨텐츠에서 많은 변화를 주고자 하는 경우 편집 GOP 대 비편집 GOP의 비율은 90%일 수도 있고, 반대로 원본 동영상 컨텐츠의 맥락을 유지하면서 일부 변화를 주고자 하는 경우 편집 GOP 대 비편집 GOP의 비율은 10%로 설정할 수 있다. 실시예에 따라서는 편집 GOP 대 비편집 GOP의 비율을 임의의 비율로 압축 동영상 컨텐츠(content_compress) 제공 시스템이 결정할 수 있다.The video content providing unit 120 provides at least some of the GOPs to the user terminals 200-1 and 200-2 in the order of the original video content, and at least some other GOPs are different from the order of the original video content. To (200-1, 200-2). In this case, the ratio of the non-edited GOPs provided in the order and the edited GOPs provided in the order may be variously applied according to an embodiment. For example, if you want to make a lot of changes in the original video content, the ratio of edited GOP to non-edited GOP may be 90% .In contrast, if you want to make some changes while maintaining the context of the original video content, The ratio can be set at 10%. According to an exemplary embodiment, the compressed video content (content_compress) providing system may determine the ratio of the edited GOP to the non-edited GOP at an arbitrary ratio.
사용자의 부가적인 편집 GOP의 지정이 요구되는 번거로움을 제거하기 위해 편집 GOP가 되는 GOP의 결정은 사용자 결정이 아닌 압축 동영상 컨텐츠(content_compress) 제공 시스템이 임의로 또는 이미지 프로세싱을 통한 소정의 알고리즘에 따라 자동으로 결정하도록 구성한다.In order to eliminate the need to specify additional editing GOPs by the user, the determination of the GOPs to be edited GOPs is automatically performed by the system for providing compressed video content (content_compress) rather than user decision or by a predetermined algorithm through image processing. Configure to decide.
임의의 제 k GOP(GOP_kth)는 도 3에 도시된 바와 같이 선행하는 제 k-1 GOP(GOP_k-1th)의 앞쪽에 인접하여 배치되거나 그 보다 더욱 선행하는 예컨대 k-20 GOP(GOP_k-20th)의 앞쪽에 제 k-1 GOP(GOP_k-1th)와 이격되어 배치될 수 있다. 반대로 임의의 제 k GOP(GOP_kth)는 도 4에 도시된 바와 같이 후행하는 제 k+1 GOP(GOP_k+1th)의 뒤쪽에 인접하여 배치되거나 그 보다 더욱 후행하는 예컨대 k+20 GOP(GOP_k+20th)의 뒤쪽에 제 k+1 GOP(GOP_k+1th)와 이격되어 배치될 수 있다. Any kth GOP (GOP_kth) may be disposed adjacent to or in front of a preceding k-1 GOP (GOP_k-1th) as shown in FIG. 3, for example, a k-20 GOP (GOP_k-20th). It may be spaced apart from the k-1th GOP (GOP_k-1th) in front of the. Conversely, any k GOP (GOP_kth) is arranged adjacent to or behind the following k + 1 GOPs (GOP_k + 1th) as shown in FIG. 4, for example k + 20 GOP (GOP_k + 20th). ) May be spaced apart from the k + 1th GOP (GOP_k + 1th) at the rear of the.
한편 동영상 컨텐츠 제공부(120)는 단일의 압축 동영상 컨텐츠(content_compress)에 대해서 임의편집을 수행할 수도 있고 복수의 압축 동영상 컨텐츠(content_compress)들에 대해서 임의편집을 수행할 수도 있다. 복수의 압축 동영상 컨텐츠(content_compress)들에 대해 임의편집을 하는 경우 제 1 압축 동영상 컨텐츠(content_compress_1st)가 150개의 GOP들들로 구성되고, 제 2 압축 동영상 컨텐츠(content_compress_2nd)가 300개의 GOP들로 구성되고, 제 압축 3 동영상 컨텐츠(content_compress_3rd)가 200개의 GOP들로 구성된다면, 동영상 컨텐츠 제공부(120)는 임의의 압축 동영상 컨텐츠 순서에 따라 총 650 GOP들로 구성된 압축 동영상 컨텐츠(content_compress)로 간주하여 임의편집을 수행하여 사용자 단말기(200-1, 200-2)에게 제공할 수 있다. 이러한 실시예에 따르면 임의편집된 동영상 컨텐츠의 의외성이 보다 증가되는 효과를 제공한다.Meanwhile, the video content providing unit 120 may randomly edit a single compressed video content (content_compress) or may arbitrarily edit a plurality of compressed video contents (content_compress). When randomly editing a plurality of compressed video contents (content_compress), the first compressed video content (content_compress_1st) is composed of 150 GOPs, and the second compressed video content (content_compress_2nd) is composed of 300 GOPs. If the third compressed video content (content_compress_3rd) is composed of 200 GOPs, the video content providing unit 120 regards the compressed video content (content_compress) composed of a total of 650 GOPs according to an arbitrary compressed video content order and edits randomly. This may be provided to the user terminals 200-1 and 200-2. According to this embodiment, the unexpectedness of the randomly edited video content is increased.
한편, 최근의 동영상 압축 기술은 이미지 프로세싱을 통한 장면변환 검출을 통해 장면변환 지점을 GOP의 최선행 참조 프레임(frame_inter_first)을 위치시키도록 발전하였다. 이는 장면변환이 GOP 중간에서 일어나는 경우 선행 프레임과 후행 프레임간의 프레임 코릴레이션(frame correlation)이 낮아지게 되고, 결국 참조 프레임(frame_inter)인 B 프레임 또는 P 프레임의 데이터양을 증가시키는 문제가 있기 때문이다. 본 발명의 실시예에서는 이 점에 착안하여 장면전환 검출을 통해 장면전환 지점을 최선행 참조 프레임(frame_inter_first)으로 정의한 GOP를 편집 GOP로 활용함으로써 임의편집임에도 불구하고 장면단위로 편집되어 보다 자연스러운 편집 동영상 컨텐츠를 자동으로 생성하는 효과를 제공할 수 있다. 이를 위해 동영상 컨텐츠 제공부(120)는 장면변환 검출을 통해 검출된 장면변환 지점에 위치한 I 프레임을 최선행 참조 프레임(frame_inter_first)으로 포함하는 GOP를 제 k GOP(GOP_kth)로 선택하도록 구성하는 것이 바람직하다.On the other hand, the recent video compression technology has been developed to position the scene reference point (frame_inter_first) of the GOP through the scene change detection through image processing. This is because when the scene transformation occurs in the middle of the GOP, frame correlation between the preceding frame and the following frame is lowered, and thus, there is a problem of increasing the data amount of the B frame or P frame, which is the reference frame (frame_inter). . In the embodiment of the present invention, by focusing on this point, the scene change point is edited by scene unit by using the GOP that defines the transition point as the best reference frame (frame_inter_first) as the editing GOP through scene change detection, so that it is edited in units of scenes. It can provide the effect of automatically generating content. To this end, the video content providing unit 120 may be configured to select a GOP including the I frame located at the scene change point detected through scene change detection as the best reference frame (frame_inter_first) as the k th GOP (GOP_kth). Do.
한편, 동영상 컨텐츠를 임의편집하여 사용자에게 제공하는 방식은 첫째, 동영상 컨텐츠 제공부(120)가 압축 동영상 컨텐츠(content_compress) 파일을 변경하지 않고 GOP의 순서만 변경하여 동영상 전송 포맷으로 변경하여 제공할 수도 있고, 둘째 동영상 컨텐츠 제공부(120)가 압축 동영상 컨텐츠(content_compress) 파일을 임의의 편집결과에 따라 압축 동영상 컨텐츠(content_compress) 파일의 GOP 순서를 변경하여 저장하거나 버퍼 메모리에 변경된 GOP 순서에 따라 압축 동영상 컨텐츠(content_compress) 파일의 전체 또는 일부의 복사본을 업로드하여 제공할 수도 있다.Meanwhile, a method of arbitrarily editing video content and providing the same to a user may be provided. First, the video content providing unit 120 may change the order of the GOP without changing the compressed video content (content_compress) file and change the order of the GOP to provide a video transmission format. Second, the video content providing unit 120 changes the GOP order of the compressed video content (content_compress) file and stores the compressed video content (content_compress) file according to an arbitrary editing result or compresses the video according to the changed GOP order in the buffer memory. A copy of all or part of the content (content_compress) file may be uploaded and provided.
전자의 경우라면 문제가 없지만 후자의 경우라면 GOP 순서의 변경에 따라 다음과 같은 데이터 유실의 문제가 발생할 수 있다. 도 2에 도시된 바에 따르면 제 k GOP(GOP_kth)의 최선행 참조 프레임(frame_inter_first)인 I 프레임의 직전 프레임인 제 k-1 GOP(GOP_k-1th)의 최후행 프레임은 양방향 참조 프레임(frame_inter)인 B 프레임이다. 도 3에 도시된 바와 같이 동영상 컨텐츠 제공부(120)의 임의편집 결과 제 k 프레임(GOP_kth)이 다른 위치로 옮겨지게 된다면 제 k-1 GOP(GOP_k-1th)의 최후행 프레임인 B 프레임은 참조할 후행 프레임이 유실되어 해당 프레임 이미지를 완전하게 생성하지 못하는 문제가 발생할 수 있다.In the former case, there is no problem. In the latter case, the following data loss problem may occur according to the change of the GOP order. As shown in FIG. 2, the last frame of the k-1 GOP (GOP_k-1th), which is the frame immediately before the I frame that is the best reference frame (frame_inter_first) of the k GOP (GOP_kth), is the bidirectional reference frame (frame_inter). B frame. As shown in FIG. 3, if the k-th frame GOP_kth is moved to another position as a result of the arbitrary editing of the video content providing unit 120, the B frame, which is the last frame of the k-1 GOP GP_k-1th, is referred to. The trailing frame may be lost and the frame image may not be completely created.
본 발명의 실시예에 따르면 상기의 문제를 해결하기 위해 선행 GOP의 선행 프레임들이 참조하진 않는 IDR 프레임을 최선행 비참조 프레임으로 포함하는 GOP를 제 k GOP(GOP_kth)로 선택함으로써 일부 프레임에서의 데이터 유실을 방지하는 효과를 제공할 수 있다.According to an embodiment of the present invention, in order to solve the above problem, data in some frames is selected by selecting a GOP including the IDR frame, which is not referenced by the preceding frames of the preceding GOP, as the best-order non-reference frame as the kth GOP (GOP_kth). It can provide the effect of preventing the loss.
한편, 하나의 GOP는 수십 분의 1초 또는 수 초 정도로 매우 짧기 때문에 단일의 GOP만 임의편집되어 다른 위치로 이동하는 경우 원본 동영상 컨텐츠의 맥락을 전달하기가 불가능할 뿐 아니라 사용자에게는 노이즈로 인식될 수 있는 문제가 있다. 이러한 문제를 방지하기 위해 본 발명의 실시예에 따른 동영상 컨텐츠 제공부(120)는, 제 k GOP(GOP_kth) 내지 제 k+m GOP(GOP_k+mth)(m은 10 이상의 정수)로 정의되는 편집 GOP 어레이(array_GOP_edit)를 제 k-1 GOP(GOP_k-1th)에 선행하거나 k+m+1 GOP(GOP_k+m+1th)에 후행하도록 임의의 순서로 사용자 단말기(200-1, 200-2)에게 제공하는 것이 바람직하다. 이때 적어도 11개의 GOP가 하나의 편집 GOP 어레이(array_GOP_edit)로 설정되어야지 임의편집되는 대상이 적어도 1초 내지 10초 정도의 길이를 갖기 때문에 m은 10 이상으로 설정하는 것이 바람직하다.On the other hand, a single GOP is very short, such as a few tenths or a few seconds, so if only a single GOP is randomly edited and moved to another location, it may not be possible to convey the context of the original video content and may be perceived as noise to the user. There is a problem. In order to prevent such a problem, the video content providing unit 120 according to an embodiment of the present invention is edited as defined by k th GOP (GOP_kth) to k + m GOP (GOP_k + mth) (m is an integer of 10 or more). The user terminals 200-1 and 200-2 in any order to precede the GOP array array_GOP_edit or to follow the k-1 m GOP (GOP_k-1 th) or the k + m + 1 GOP (GOP_k + m + 1 th). It is desirable to provide to. In this case, at least 11 GOPs should be set to one edit GOP array (array_GOP_edit). Since the object to be arbitrarily edited has a length of at least 1 second to 10 seconds, m is preferably set to 10 or more.
도 5는 압축 동영상 컨텐츠(content_compress)의 GOP 구조를 도시하는 GOP 구성도이고, 도 6 및 도 7은 본 발명의 실시예에 따라 편집 GOP 어레이(array_GOP_edit) 단위로 임의편집된 압축 동영상 컨텐츠(content_compress)의 GOP 구조를 도시하는 GOP 구성도이다. 이러한 실시예에 따르면, 동영상 컨텐츠 제공부(120)는 제 3 GOP(GOP_3rd) 내지 제 5 GOP(GOP_5th)를 편집 GOP 어레이(array_GOP_edit)로 정의한다(k=3, m=2). 실시예에 따라 도 6에 도시된 바와 같이 편집 GOP 어레이(array_GOP_edit)는 인접 선행 GOP인 제 2 GOP(GOP_2nd)에 인접하여 선행하도록 배치되거나, 도 7에 도시된 바와 같이 편집 GOP 어레이(array_GOP_edit)는 인접 후행 GOP인 제 6 GOP(GOP_6th)에 이격되어 후행하도록 배치될 수 있다.5 is a GOP diagram illustrating a GOP structure of compressed video content (content_compress), and FIGS. 6 and 7 are compressed video content (content_compress) arbitrarily edited in an edited GOP array (array_GOP_edit) according to an embodiment of the present invention. A GOP configuration diagram showing the GOP structure of the. According to this embodiment, the video content providing unit 120 defines the third GOP (GOP_3rd) to the fifth GOP (GOP_5th) as an edited GOP array (array_GOP_edit) (k = 3, m = 2). According to an embodiment, as shown in FIG. 6, the editing GOP array array_GOP_edit is arranged to be adjacent to a second GOP (GOP_2nd) which is an adjacent preceding GOP, or as shown in FIG. 7, the editing GOP array (array_GOP_edit) may be The sixth GOP (GOP_6th), which is an adjacent trailing GOP, may be disposed to be trailed apart from each other.
이러한 실시예에 따르면, 적어도 10개 이상의 연속된 GOP들로 구성된 편집 GOP 어레이(array_GOP_edit) 단위로 편집되기 때문에 원본 동영상 컨텐츠의 맥락을 유지하면서 의외성을 높일 수 있는 임의편집된 동영상 컨텐츠를 사용자에게 제공하는 효과가 있다.According to this embodiment, since the edited GOP array (array_GOP_edit) of at least 10 consecutive GOPs are edited to provide users with randomly edited video content that can increase the unexpectedness while maintaining the context of the original video content. It is effective.
한편 편집 GOP 어레이(array_GOP_edit)들이 동일한 인물, 동일한 물건, 동일한 배경 등의 동일한 VOP(Video Object Plane)를 연속적으로 포함하는 GOP들로 구성되는 것이 보다 바람직하다. 이를 위해 임의편집 압축 동영상 컨텐츠(content_compress) 제공 시스템은 최선행 참조 프레임(frame_inter_first) 및 참조 프레임(frame_inter)의 이미지의 VOP(Video Object Plane)들을 분석하는 비디오 객체 분석부(130)를 포함한다. 이때 동영상 컨텐츠 제공부(120)는 동일한 VOP가 연속적으로 포함된 제 k GOP(GOP_kth) 내지 제 k+m GOP(GOP_k+mth)를 편집 GOP 어레이(array_GOP_edit)로 정의하도록 구성할 수 있다.On the other hand, the editing GOP arrays (array_GOP_edit) is more preferably composed of GOPs containing the same video object plane (VOP), such as the same person, the same object, the same background. To this end, the random edited compressed video content (content_compress) providing system includes a video object analyzer 130 for analyzing VOPs (Video Object Planes) of the image of the best reference frame (frame_inter_first) and the reference frame (frame_inter). In this case, the video content providing unit 120 may be configured to define a kth GOP (GOP_kth) to a k + m GOP (GOP_k + mth) including the same VOP consecutively as an edited GOP array (array_GOP_edit).
또 다른 실시예로는 편집 GOP 어레이(array_GOP_edit)들이 안면인식을 통해 동일한 인물을 연속적으로 포함하는 GOP들로 구성되는 것이 바람직하다. 이를 위해 임의편집 압축 동영상 컨텐츠(content_compress) 제공 시스템은 최선행 참조 프레임(frame_inter_first) 및 참조 프레임(frame_inter)의 이미지에 포함된 인물의 안면을 인식하여 안면 식별자(id_facial)와 연관시키는 안면 인식부(140)를 더 포함한다. 이때 동영상 컨텐츠 제공부(120)는 동일한 안면 식별자(id_facial)가 연속적으로 연관된 제 k GOP(GOP_kth) 내지 제 k+m(GOP_k+mth) GOP를 편집 GOP 어레이(array_GOP_edit)로 정의하는 것을 특징으로 한다.In another embodiment, it is preferable that the editing GOP arrays (array_GOP_edit) are composed of GOPs that continuously include the same person through face recognition. To this end, the arbitrary editing compressed video content (content_compress) providing system recognizes the face of the person included in the image of the best line reference frame (frame_inter_first) and the reference frame (frame_inter) and associates it with the face identifier (id_facial). More). In this case, the video content providing unit 120 defines k-th GOP (GOP_kth) to k + m (GOP_k + mth) GOPs continuously associated with the same face identifier (id_facial) as an edited GOP array (array_GOP_edit). .
이때 안면 식별자(id_facial)는 특별히 제한되지 않으며 내부 메모리 또는 외부 서버를 통해 사전에 인식된 안면과 이에 해당하는 인물의 이름, 별칭, 식별자를 찾아내서 이를 안면 식별자(id_facial)로 사용할 수도 있고, 인식된 안면의 특징점에 따라 특정한 순서 또는 임의로 부여되는 식별자일 수도 있다.At this time, the facial identifier (id_facial) is not particularly limited, and the name, alias, and identifier of a previously recognized face and its corresponding person can be found through the internal memory or an external server, and used as the face identifier (id_facial). It may be an identifier assigned in a specific order or arbitrarily according to facial features.
이하에서는 도 8을 참조하여 본 발명에 따른 임의편집 압축 동영상 컨텐츠(content_compress) 제공방법을 설명한다.Hereinafter, a method for providing arbitrary edited compressed video content (content_compress) according to the present invention will be described with reference to FIG. 8.
먼저, 동영상 컨텐츠 제공 시스템(100)이 선행하는 프레임의 정보를 참조하지 않는 최선행 참조 프레임(frame_inter_first) 및 선행 또는 후행하는 프레임의 정보를 참조하는 복수의 참조 프레임(frame_inter)들을 포함하여 각각 구성되는 제 1 GOP(GOP_1st) 내지 제 N GOP(GOP_Nth)로(N은 2 이상의 정수) 구성되는 압축 동영상 컨텐츠(content_compress)를 저장하는 제 1 단계(S10)를 수행한다.First, the video content providing system 100 includes a best reference frame (frame_inter_first) that does not refer to the information of the preceding frame and a plurality of reference frames (frame_inter) which refer to the information of the preceding or following frames, respectively. A first step S10 of storing compressed video content (content_compress) including first GOP (GOP_1st) to Nth GOP (GOP_Nth) (N is an integer of 2 or more) is performed.
이때 참조 프레임(frame_inter_first)은 I 프레임 또는 IDR 프레임일 수 있고, 참조 프레임(frame_inter)은 B 프레임 또는 P 프레임일 수 있다.In this case, the reference frame (frame_inter_first) may be an I frame or an IDR frame, and the reference frame (frame_inter) may be a B frame or a P frame.
다음으로, 동영상 컨텐츠 제공 시스템(100)이 통신망을 통해 사용자 단말기(200-1, 200-2)로부터 압축 동영상 컨텐츠(content_compress)의 임의편집을 요청하는 임의편집 요청을 수신하는 제 2 단계(s20)를 수행한다.Next, a second step (s20) in which the video content providing system 100 receives a random edit request for requesting arbitrary editing of the compressed video content (content_compress) from the user terminals 200-1 and 200-2 through a communication network. Perform
마지막으로, 동영상 컨텐츠 제공 시스템(100)이 통신망을 통해 압축 동영상 컨텐츠(content_compress)를 GOP 단위로 사용자 단말기(200-1, 200-2)에게 제공하되, 적어도 일부의 GOP들은 순서에 따라 사용자 단말기(200-1, 200-2)에게 제공하고, 제 k GOP(GOP_kth)를(k는 2 이상, N-1 이하의 정수) 제 k-1 GOP(GOP_k-1th)에 선행하거나 제 k+1 GOP(GOP_k+1th)에 후행하는 임의의 순서로 사용자 단말기(200-1, 200-2)에게 제공하는 제 3 단계(s30)를 수행한다.Finally, the video content providing system 100 provides the compressed video content (content_compress) to the user terminals 200-1 and 200-2 in units of GOPs through a communication network, and at least some of the GOPs are provided in order. 200-1, 200-2, and precede kth G-1 GOP (GOP_k-1th) with kth GOP (GOP_kth), where k is an integer of 2 or more and N-1 or less. A third step s30 is provided to the user terminals 200-1 and 200-2 in any order following GOP_k + 1th.
한편 장면전환된 지점에서 자동으로 편집이 이루어지도록 제 3 단계(s30)는 동영상 컨텐츠 제공 시스템(100)이 장면변환 검출을 통해 검출된 장면변환 지점에 위치한 I 프레임을 최선행 참조 프레임(frame_inter_first)으로 포함하는 GOP를 제 k GOP(GOP_kth)로 선택하는 것이 바람직하다.In the third step s30, the video content providing system 100 converts the I frame located at the scene change point detected by the scene change detection into the best reference frame (frame_inter_first). It is preferable to select the containing GOP as the kth GOP (GOP_kth).
제k GOP(GOP_kth)에 선행하는 제 k-1 GOP(GOP_k-1th)의 최후행 프레임이 제 k GOP(GOP_kth)의 최선행 참조 프레임(frame_inter_first)의 이동으로 인한 데이터 유실을 방지하기 위해, 제 3 단계(s30)는 동영상 컨텐츠 제공 시스템(100)이 IDR 프레임(Instantaneous Decoder Refresh)을 최선행 참조 프레임(frame_inter_first)으로 포함하는 GOP를 제 k GOP(GOP_kth)로 선택하는 것이 바람직하다.In order to prevent data loss due to the movement of the best-order reference frame (frame_inter_first) of the k-th GOP (GOP_kth) prior to the k-th GOP (GOP_kth), In step S30, the video content providing system 100 selects a GOP including an IDR frame (Instantaneous Decoder Refresh) as the best-order reference frame (frame_inter_first) as the k th GOP (GOP_kth).
자동으로 편집되는 편집 GOP들이 적정한 길이를 갖도록 제 3 단계(s30)는 동영상 컨텐츠 제공 시스템(100)이 제 k GOP(GOP_kth) 내지 제 k+m GOP(GOP_k+mth)(m은 10 이상의 정수)를 편집 GOP 어레이(array_GOP_edit)로 정의하고, 편집 GOP 어레이(array_GOP_edit)를 제 k-1 GOP(GOP_k-1th)에 선행하거나 k+m+1 GOP(GOP_k+m+1th)에 후행하도록 임의의 순서로 사용자 단말기(200-1, 200-2)에게 제공하는 것이 바람직하다.In the third step s30, the video content providing system 100 performs k GOP (GOP_kth) to k + m GOP (GOP_k + mth) (m is an integer of 10 or more) so that the editing GOPs which are automatically edited have an appropriate length. Is defined as the editing GOP array (array_GOP_edit), and the arbitrary order to precede the editing GOP array (array_GOP_edit) to k-1 GOP (GOP_k-1th) or to k + m + 1 GOP (GOP_k + m + 1th). It is desirable to provide to the user terminal (200-1, 200-2).
편집 GOP 어레이(array_GOP_edit) 단위로 자동편집하는 실시예에 있어서 동일한 VOP가 연속적으로 포함된 GOP들을 편집 GOP 어레이(array_GOP_edit) 단위로 편집하기 위해 제 1 단계(s10)는 동영상 컨텐츠 제공 시스템(100)이 최선행 참조 프레임(frame_inter_first) 및 참조 프레임(frame_inter)의 이미지의 VOP(Video Object Plane)들을 분석하는 제 1 부단계(s11)를 더 포함하고, 제 2 단계(s20)는 동영상 컨텐츠 시스템이 동일한 VOP가 연속적으로 포함된 제 k GOP(GOP_kth) 내지 제 k+m GOP(GOP_k+mth)를 편집 GOP 어레이(array_GOP_edit)로 정의하는 것이 바람직하다.In an embodiment of auto editing in an edited GOP array (array_GOP_edit) unit, the first step (s10) is performed by the video content providing system 100 to edit GOPs including the same VOP consecutively in an edited GOP array (array_GOP_edit) unit. The method further includes a first substep s11 for analyzing the VOPs (Video Object Planes) of the image of the best row reference frame (frame_inter_first) and the reference frame (frame_inter), and the second step (s20) includes a VOP having the same video content system. It is preferable to define the kth GOP (GOP_kth) to k + m GOP (GOP_k + mth) including the consecutively as an edited GOP array (array_GOP_edit).
편집 GOP 어레이(array_GOP_edit) 단위로 자동편집하는 실시예에 있어서 동일한 인물이 연속적으로 포함된 GOP들을 편집 GOP 어레이(array_GOP_edit) 단위로 편집하기 위해 제 1 단계(s10)는 동영상 컨텐츠 제공 시스템(100)이 최선행 참조 프레임(frame_inter_first) 및 참조 프레임(frame_inter)의 이미지에 포함된 인물의 안면을 인식하여 안면 식별자(id_facial)와 연관시키는 제 2 부단계(s12)를 더 포함하고, 제 2 단계(s20)는 동영상 컨텐츠 시스템이 동일한 안면 식별자(id_facial)가 연속적으로 연관된 제 k GOP(GOP_kth) 내지 제 k+m GOP(GOP_k+mth)를 편집 GOP 어레이(array_GOP_edit)로 정의하는 것이 바람직하다.In an embodiment of auto editing in an edited GOP array (array_GOP_edit) unit, the first step (s10) is performed by the video content providing system 100 to edit GOPs containing the same person consecutively in an edited GOP array (array_GOP_edit) unit. And further including a second substep s12 for recognizing the face of the person included in the image of the best row reference frame frame_inter_first and the reference frame frame_inter and associating it with the face identifier id_facial. Preferably, the video content system defines a kth GOP (GOP_kth) to a k + m GOP (GOP_k + mth) continuously associated with the same face identifier (id_facial) as an edited GOP array (array_GOP_edit).
본 명세서에서의 발명의 설명은 바람직한 실시예를 설명하는 것으로, 본 발명은 이러한 실시예에 한정되지 않는다. 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자는 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 이상의 실시예에 대한 다양한 변경과 수정이 가능하고, 본 발명의 기술적 사상은 이러한 다양한 변경과 수정을 모두 포함한다.The description of the invention herein describes the preferred embodiment, and the invention is not limited to this embodiment. Those skilled in the art to which the present invention pertains can make various changes and modifications to the above embodiments without departing from the technical spirit of the present invention, the technical idea of the present invention is to make all such various changes and modifications Include.

Claims (14)

  1. 선행하는 프레임의 정보를 참조하지 않는 최선행 비참조 프레임 및 선행 또는 후행하는 프레임의 정보를 참조하는 복수의 참조 프레임들을 포함하여 각각 구성되는 제 1 GOP 내지 제 N GOP로(N은 2 이상의 정수) 구성되는 압축 동영상 컨텐츠를 저장하는 동영상 컨텐츠 저장부; 및The first GOP to the Nth GOPs each configured to include a best-order non-reference frame not referring to the information of the preceding frame and a plurality of reference frames referencing the information of the preceding or following frame (where N is an integer of 2 or more). A video content storage unit for storing the compressed video content; And
    통신망을 통해 상기 압축 동영상 컨텐츠를 GOP 단위로 사용자 단말기에게 제공하되, 적어도 일부의 GOP들은 순서에 따라 상기 사용자 단말기에게 제공하고, 제 k GOP를(k는 2 이상, N-1 이하의 정수) 제 k-1 GOP에 선행하거나 제 k+1 GOP에 후행하는 임의의 순서로 사용자 단말기에게 제공하는 동영상 컨텐츠 제공부;를 포함하여 구성되는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템.The compressed video content is provided to the user terminal in units of GOPs through a communication network, and at least some of the GOPs are provided to the user terminal in order, and kth GOP (k is an integer of 2 or more and N-1 or less) is made. and a video content providing unit provided to the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP.
  2. 제 1 항에 있어서,The method of claim 1,
    상기 최선행 비참조 프레임은 I 프레임(Infra Frame)이고, 상기 참조 프레임은 P 프레임(Predicted Frame) 및/또는 B 프레임(Bidirectional Frame)인 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템.And the best-order dereference frame is an I frame, and the reference frame is a P frame and / or a B-directional frame.
  3. 제 2 항에 있어서, 상기 동영상 컨텐츠 제공부는,The method of claim 2, wherein the video content providing unit,
    장면변환 검출을 통해 검출된 장면변환 지점에 위치한 I 프레임을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템.And a GOP including the I frame located at the scene change point detected through scene change detection as the k-th non-reference frame, as the k-th GOP.
  4. 제 2 항에 있어서, 상기 동영상 컨텐츠 제공부는,The method of claim 2, wherein the video content providing unit,
    IDR 프레임(Instantaneous Decoder Refresh)을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템.And a GOP including an InstantRane Decoder Refresh (IDR) frame as the best non-reference frame as the k-th GOP.
  5. 제 1 항에 있어서, 상기 동영상 컨텐츠 제공부는,According to claim 1, The video content providing unit,
    상기 제 k GOP 내지 상기 제 k+m GOP(m은 10 이상의 정수)로 정의되는 편집 GOP 어레이를 상기 제 k-1 GOP에 선행하거나 k+m+1 GOP에 후행하도록 임의의 순서로 사용자 단말기에게 제공하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템.The user terminal in any order to precede the k-1 GOP or to follow the k + m + 1 GOP to the editing GOP array defined by the k th GOP to the k + m GOP (m is an integer of 10 or more) Random edited compressed video content providing system, characterized in that provided.
  6. 제 5 항에 있어서, 상기 임의편집 압축 동영상 컨텐츠 제공 시스템은,The system of claim 5, wherein the random edited compressed video content providing system comprises:
    상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지의 VOP(Video Object Plane)들을 분석하는 비디오 객체 분석부;를 더 포함하고,And a video object analyzer configured to analyze video object planes (VOPs) of the best-order non-reference frame and the image of the reference frame.
    상기 동영상 컨텐츠 제공부는, 동일한 VOP가 연속적으로 포함된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템. And the video content providing unit defines the k-th GOP to the k + m GOP including consecutively the same VOP as the edited GOP array.
  7. 제 5 항에 있어서, 상기 임의편집 압축 동영상 컨텐츠 제공 시스템은,The system of claim 5, wherein the random edited compressed video content providing system comprises:
    상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지에 포함된 인물의 안면을 인식하여 안면 식별자와 연관시키는 안면 인식부;를 더 포함하고,And a face recognizing unit recognizing a face of a person included in the image of the reference frame and the best non-reference frame and associating it with a face identifier.
    상기 동영상 컨텐츠 제공부는, 동일한 안면 식별자가 연속적으로 연관된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공 시스템.And the moving picture contents providing unit defines the k th GOP to the k + m GOPs associated with the same face identifier consecutively as the edited GOP array.
  8. 동영상 컨텐츠 제공 시스템이, 선행하는 프레임의 정보를 참조하지 않는 최선행 비참조 프레임 및 선행 또는 후행하는 프레임의 정보를 참조하는 복수의 참조 프레임들을 포함하여 각각 구성되는 제 1 GOP 내지 제 N GOP로(N은 2 이상의 정수) 구성되는 압축 동영상 컨텐츠를 저장하는 제 1 단계;The video content providing system includes a first GOP to an Nth GOP, each of which includes a plurality of reference frames that refer to information of a best-order non-reference frame that does not refer to information of a preceding frame and information of a preceding or following frame ( N is an integer of 2 or more); a first step of storing the compressed video content;
    상기 동영상 컨텐츠 제공 시스템이, 통신망을 통해 사용자 단말기로부터 상기 압축 동영상 컨텐츠의 임의편집을 요청하는 임의편집 요청을 수신하는 제 2 단계; 및A second step of the video content providing system receiving a random edit request for requesting arbitrary edit of the compressed video content from a user terminal through a communication network; And
    상기 동영상 컨텐츠 제공 시스템이, 통신망을 통해 상기 압축 동영상 컨텐츠를 GOP 단위로 상기 사용자 단말기에게 제공하되, 적어도 일부의 GOP들은 순서에 따라 상기 사용자 단말기에게 제공하고, 제 k GOP를(k는 2 이상, N-1 이하의 정수) 제 k-1 GOP에 선행하거나 제 k+1 GOP에 후행하는 임의의 순서로 사용자 단말기에게 제공하는 제 3 단계;를 포함하여 구성되는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법.The video content providing system provides the compressed video content to the user terminal in units of GOPs through a communication network, wherein at least some of the GOPs are provided to the user terminal in order, and the kth GOP (k is equal to or greater than 2). An integer less than or equal to N-1) a third step of providing the user terminal in any order preceding the k-1 GOP or following the k + 1 GOP; How to Provide.
  9. 제 8 항에 있어서, 상기 제 1 단계는,The method of claim 8, wherein the first step,
    상기 동영상 컨텐츠 제공 시스템이, 상기 최선행 비참조 프레임은 I 프레임(Infra Frame)이고, 상기 참조 프레임은 P 프레임(Predicted Frame) 및/또는 B 프레임(Bidirectional Frame)인 상기 압축 동영상 컨텐츠를 저장하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법.The video content providing system stores the compressed video content, wherein the best-order non-reference frame is an I frame, and the reference frame is a P frame and / or a bidirectional frame. Random edited compressed video content providing method characterized in that.
  10. 제 9 항에 있어서, 상기 제 3 단계는,The method of claim 9, wherein the third step,
    상기 동영상 컨텐츠 제공 시스템이, 장면변환 검출을 통해 검출된 장면변환 지점에 위치한 I 프레임을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법.The video content providing system provides a randomized edited video content including a GOP including an I frame located at a scene change point detected through scene change detection as the best non-reference frame as the k th GOP. Way.
  11. 제 9 항에 있어서, 상기 제 3 단계는,The method of claim 9, wherein the third step,
    상기 동영상 컨텐츠 제공 시스템이, IDR 프레임(Instantaneous Decoder Refresh)을 상기 최선행 비참조 프레임으로 포함하는 GOP를 상기 제 k GOP로 선택하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법.And the video content providing system selects a GOP including an Instantaneous Decoder Refresh (IDR) frame as the best non-reference frame as the k-th GOP.
  12. 제 8 항에 있어서, 상기 제 3 단계는,The method of claim 8, wherein the third step,
    상기 동영상 컨텐츠 제공 시스템이, 상기 제 k GOP 내지 상기 제 k+m GOP(m은 10 이상의 정수)를 편집 GOP 어레이로 정의하고, 상기 편집 GOP 어레이를 상기 제 k-1 GOP에 선행하거나 k+m+1 GOP에 후행하도록 임의의 순서로 사용자 단말기에게 제공하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법.The video content providing system defines the k th GOP to the k + m GOP (m is an integer of 10 or more) as an edited GOP array, and the edited GOP array precedes the k-1 GOP or k + m. And providing the user terminal in any order so as to follow the +1 GOP.
  13. 제 12 항에 있어서,The method of claim 12,
    상기 제 1 단계는, 상기 동영상 컨텐츠 제공 시스템이, 상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지의 VOP(Video Object Plane)들을 분석하는 제 1 부단계;를 더 포함하고,The first step may further include a first sub-step of the video content providing system analyzing VOPs (Video Object Planes) of the best non-reference frame and the image of the reference frame.
    상기 제 2 단계는, 상기 동영상 컨텐츠 시스템이, 동일한 GOP가 연속적으로 포함된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법. In the second step, the video content system defines the kth GOP to the k + m GOP including the same GOP consecutively as the edited GOP array, characterized in that the edited GOP array method.
  14. 제 12 항에 있어서,The method of claim 12,
    상기 제 1 단계는, 상기 동영상 컨텐츠 제공 시스템이, 상기 최선행 비참조 프레임 및 상기 참조 프레임의 이미지에 포함된 인물의 안면을 인식하여 안면 식별자와 연관시키는 제 2 부단계;를 더 포함하고,The first step may further include a second sub-step of the video content providing system recognizing a face of a person included in an image of the best non-reference frame and the reference frame and associating it with a face identifier.
    상기 제 2 단계는, 상기 동영상 컨텐츠 시스템이, 동일한 안면 식별자가 연속적으로 연관된 상기 제 k GOP 내지 상기 제 k+m GOP를 상기 편집 GOP 어레이로 정의하는 것을 특징으로 하는 임의편집 압축 동영상 컨텐츠 제공방법.In the second step, the video content system defines the kth GOP to the k + m GOPs associated with the same face identifier consecutively as the edited GOP array.
PCT/KR2017/005356 2016-05-31 2017-05-23 Randomly-edited compressed video content provision system and provision method WO2017209432A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020160067016A KR101843017B1 (en) 2016-05-31 2016-05-31 Random Editing System for Providing Compressed Video Contents and Method thereof
KR10-2016-0067016 2016-05-31

Publications (1)

Publication Number Publication Date
WO2017209432A1 true WO2017209432A1 (en) 2017-12-07

Family

ID=60477656

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2017/005356 WO2017209432A1 (en) 2016-05-31 2017-05-23 Randomly-edited compressed video content provision system and provision method

Country Status (2)

Country Link
KR (1) KR101843017B1 (en)
WO (1) WO2017209432A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111918121A (en) * 2020-06-23 2020-11-10 南斗六星系统集成有限公司 Method and device for accurately editing streaming media file
CN113692781A (en) * 2020-03-10 2021-11-23 北京小米移动软件有限公司 Method, device, communication equipment and storage medium for transmitting data

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001094938A (en) * 1999-09-24 2001-04-06 Nec Corp Compression image reproduction method and device
KR20080054475A (en) * 2006-12-13 2008-06-18 주식회사 대우일렉트로닉스 Reservation recording method by using video object plane and its system
JP2009522939A (en) * 2006-01-06 2009-06-11 グーグル インク. Dynamic media supply infrastructure
JP2012018727A (en) * 2010-07-08 2012-01-26 Sony Corp Information processor, and information processing method and program
KR101382954B1 (en) * 2006-07-04 2014-04-08 소니 주식회사 Information processing apparatus and method and recording medium for program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001094938A (en) * 1999-09-24 2001-04-06 Nec Corp Compression image reproduction method and device
JP2009522939A (en) * 2006-01-06 2009-06-11 グーグル インク. Dynamic media supply infrastructure
KR101382954B1 (en) * 2006-07-04 2014-04-08 소니 주식회사 Information processing apparatus and method and recording medium for program
KR20080054475A (en) * 2006-12-13 2008-06-18 주식회사 대우일렉트로닉스 Reservation recording method by using video object plane and its system
JP2012018727A (en) * 2010-07-08 2012-01-26 Sony Corp Information processor, and information processing method and program

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113692781A (en) * 2020-03-10 2021-11-23 北京小米移动软件有限公司 Method, device, communication equipment and storage medium for transmitting data
CN113692781B (en) * 2020-03-10 2024-03-12 北京小米移动软件有限公司 Method, device, communication equipment and storage medium for transmitting data
CN111918121A (en) * 2020-06-23 2020-11-10 南斗六星系统集成有限公司 Method and device for accurately editing streaming media file

Also Published As

Publication number Publication date
KR101843017B1 (en) 2018-03-29
KR20170135299A (en) 2017-12-08

Similar Documents

Publication Publication Date Title
KR101354833B1 (en) Techniques for variable resolution encoding and decoding of digital video
WO2010027143A2 (en) Media transmission system and method
US10009628B2 (en) Tuning video compression for high frame rate and variable frame rate capture
US20210409752A1 (en) Personal Video Recorder
RU2479937C2 (en) Information processing apparatus and method
US6389218B2 (en) Method and apparatus for simultaneously producing compressed play and trick play bitstreams from a video frame sequence
US20090052537A1 (en) Method and device for processing coded video data
US20090106807A1 (en) Video Distribution System for Switching Video Streams
HUE029013T2 (en) Arranging sub-track fragments for streaming video data
WO2011111987A2 (en) Apparatus and method for playing media content data
US10283167B2 (en) Image decoding device, image decoding method, image encoding device, and image encoding method
CN108965986B (en) Video recording and playing method and system
WO2017209432A1 (en) Randomly-edited compressed video content provision system and provision method
WO2012176979A1 (en) High quality video streaming service method and system
CN115250356A (en) Multi-camera switchable virtual camera of mobile phone
CN102326403A (en) Accelerating channel change time with external picture property markings
CN111526363A (en) Encoding method and apparatus, terminal and storage medium
JPH09247614A (en) Image signal processing unit
WO2019004783A1 (en) Transmission system for multi-channel image, control method therefor, and multi-channel image playback method and apparatus
CN110392275B (en) Sharing method and device for manuscript demonstration and video networking soft terminal
JP2009171294A (en) Video distribution system, video relay apparatus, and video relay method
US20230328308A1 (en) Synchronization of multiple content streams
WO2019004498A1 (en) Multichannel image generation method, multichannel image playing method, and multichannel image playing program
CN111193956B (en) Video data processing method and video playing device
WO2019209008A1 (en) System for improving video quality by using changed macroblock extraction technique

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17806925

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 17806925

Country of ref document: EP

Kind code of ref document: A1